SlideShare uma empresa Scribd logo
1 de 24
Baixar para ler offline
Applying social network analysis
to Parliamentary Proceedings
Automatic discovery of meaningful cliques



Author:
Justin van Wees

Supervisors:
Dr. Maarten Marx
Dr. Johan van Doornik

June 23, 2011
Why?
Motivation and research question
Research question
    Can we discover communities of politicians
      that debate on a speci c policy area?

                         Motivation
•   It’s unknown which member is responsible for a certain
    policy area

•   Discover what issues are discussed within a policy area

•   Serve as example application of social network analysis
    techniques
How?
Background and methodology
<root>
  <docinfo>...</docinfo>
  <meta>...</meta>
  <proceedings>
    <topic>
       <scene type="speaker" speaker="Hamer" party="PvdA" function="Mevrouw"
              role="mp" title="Mevrouw Hamer (PvdA)" MPid="02221">
         <speech party="PvdA" speaker="Hamer" function="Mevrouw"
                 role="mp" MPid="02221">
           <p>Dat is helemaal niet waar. U bewijst nu voor de derde keer
                 dat u niet ...</p>
         </speech>
         <speech type="interruption" party="Verdonk" speaker="Verdonk"
                 function="Mevrouw" role="mp" MPid="02995">
           <p>Mag ik even uitpraten? Dank u. Zo werkt dat, gewoon fatsoen.
                 Dank u wel. [...]</p>
         </speech>
       </scence>
    </topic>
  </proceedings>
</root>
A simple graph
A directed graph
42


                  32
                                21
                       12
        84



  100
             10
                            8
                                 15




A weighted directed graph
.8&&%9":3()(;&/%3<"3='()(,-


                                               8
  456",,%#()(+77()(,-
                                           8
                        2
                                 4
                                                         !"#$%&'()(**+()(,-
                    2


4,"2'()(B1$A()(,-
                                     >":#%1%#$)456/?2%3()(@+A()(,-
                .//0%&1/&'2()(0/1%&3,%32


               A single debate represented in a graph
Debates during Cabinet Kok II
A community
   A group of nodes that are relatively densely
connected to each other but sparsely connected to
       other dense groups in the network
A k-clique (k = 4)   K-clique communities (k = 4)
Finding issues that a community is discussing

•   Retrieve all ‘community text’

•   Tokenized at word level

•   Lemmatize

•   Use parsimonious language models to nd most
    ‘descriptive’ terms
What?
Results and conclusion
General network statistics of Kok II

              No distinction With distinction
             between MP/MG between MP/MG
                  roles           roles
Nodes               211               218
Edges              3594              3615
Density            0,081             0,076
Finding k-clique communties

•   By default, found groups are note ‘cohesive’

•   Filter out ‘noise’ by setting a threshold on edge weights

•   At 15 interruptions: 197 nodes, 741 edges, 31 k-clique
    communities
Finding k-clique communties

•   All k-clique communities could be traced back to a single
    policy area

•   Except for more ‘general’ policy areas

•   92% of the community members directly related to the policy
    area covered by the community

•   85% of top 20 ‘issue terms’ relevant to policy area

•   K-clique community detection and parsimonious language
    models are successful methods for automatic discovery of
    communities within debate networks
Discussion
... and future research
•   Method for setting edge weight threshold

•   Reviewing of k-cliques done by single person

•   Used four years of data, shorter time-window possible?

•   Focused on Cabinet Kok II, what about other (earlier)
    cabinets?

•   Completely different data?
Questions?
For detailed results, datasets and programs see:
 http://justinvanwees.nl/goto/bachelorscriptie

Mais conteúdo relacionado

Semelhante a Applying social network analysis to Parliamentary Proceedings: Automatic discovery of meaningful cliques

Multi-mediated community structure in a socio-technical network
Multi-mediated community structure in a socio-technical networkMulti-mediated community structure in a socio-technical network
Multi-mediated community structure in a socio-technical networksuthers
 
Temporal and semantic analysis of richly typed social networks from user-gene...
Temporal and semantic analysis of richly typed social networks from user-gene...Temporal and semantic analysis of richly typed social networks from user-gene...
Temporal and semantic analysis of richly typed social networks from user-gene...Zide Meng
 
Community Structure, Interaction and Evolution Analysis of Online Social Netw...
Community Structure, Interaction and Evolution Analysis of Online Social Netw...Community Structure, Interaction and Evolution Analysis of Online Social Netw...
Community Structure, Interaction and Evolution Analysis of Online Social Netw...Symeon Papadopoulos
 
Self-modeling and self-reflection of E-learning communities
Self-modeling and self-reflection of E-learning communitiesSelf-modeling and self-reflection of E-learning communities
Self-modeling and self-reflection of E-learning communitiesZina Petrushyna
 
Topic map for Topic Maps case examples
Topic map for Topic Maps case examplesTopic map for Topic Maps case examples
Topic map for Topic Maps case examplestmra
 
Dorner Helga: INVESTIGATING PATTERNS OF INTERACTION IN THE CALIBRATE PROJECT
Dorner Helga: INVESTIGATING PATTERNS OF INTERACTION IN THE CALIBRATE PROJECTDorner Helga: INVESTIGATING PATTERNS OF INTERACTION IN THE CALIBRATE PROJECT
Dorner Helga: INVESTIGATING PATTERNS OF INTERACTION IN THE CALIBRATE PROJECTHelga Dorner
 
Lecture_2_Stats.pdf
Lecture_2_Stats.pdfLecture_2_Stats.pdf
Lecture_2_Stats.pdfpaijitk
 
Enabling reuse of arguments and opinions in open collaboration systems PhD vi...
Enabling reuse of arguments and opinions in open collaboration systems PhD vi...Enabling reuse of arguments and opinions in open collaboration systems PhD vi...
Enabling reuse of arguments and opinions in open collaboration systems PhD vi...jodischneider
 
Multimodal Learning Analytics
Multimodal Learning AnalyticsMultimodal Learning Analytics
Multimodal Learning AnalyticsXavier Ochoa
 
Measurement and modeling of the web and related data sets
Measurement and modeling of the web and related data setsMeasurement and modeling of the web and related data sets
Measurement and modeling of the web and related data setsMark J. Feldman
 
Open Source Community Metrics LibreOffice Conference
Open Source Community Metrics LibreOffice ConferenceOpen Source Community Metrics LibreOffice Conference
Open Source Community Metrics LibreOffice ConferenceDawn Foster
 
SIGDOC 2011 - Necessary and Neglected? An Empirical Study of Internal Documen...
SIGDOC 2011 - Necessary and Neglected? An Empirical Study of Internal Documen...SIGDOC 2011 - Necessary and Neglected? An Empirical Study of Internal Documen...
SIGDOC 2011 - Necessary and Neglected? An Empirical Study of Internal Documen...Christoph Johann Stettina
 
A Learning to Rank Project on a Daily Song Ranking Problem
A Learning to Rank Project on a Daily Song Ranking ProblemA Learning to Rank Project on a Daily Song Ranking Problem
A Learning to Rank Project on a Daily Song Ranking ProblemSease
 
Visualising data: Seeing is Believing - CS Forum 2012
Visualising data: Seeing is Believing - CS Forum 2012Visualising data: Seeing is Believing - CS Forum 2012
Visualising data: Seeing is Believing - CS Forum 2012Richard Ingram
 
Text mining and social network analysis of twitter data part 1
Text mining and social network analysis of twitter data part 1Text mining and social network analysis of twitter data part 1
Text mining and social network analysis of twitter data part 1Johan Blomme
 
Domain-Driven Design at ZendCon 2012
Domain-Driven Design at ZendCon 2012Domain-Driven Design at ZendCon 2012
Domain-Driven Design at ZendCon 2012Bradley Holt
 
Immersive Recommendation
Immersive RecommendationImmersive Recommendation
Immersive Recommendation承剛 謝
 
Semantic-based Process Analysis
Semantic-based Process AnalysisSemantic-based Process Analysis
Semantic-based Process AnalysisMauro Dragoni
 
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...Charalampos Chelmis
 

Semelhante a Applying social network analysis to Parliamentary Proceedings: Automatic discovery of meaningful cliques (20)

Multi-mediated community structure in a socio-technical network
Multi-mediated community structure in a socio-technical networkMulti-mediated community structure in a socio-technical network
Multi-mediated community structure in a socio-technical network
 
Temporal and semantic analysis of richly typed social networks from user-gene...
Temporal and semantic analysis of richly typed social networks from user-gene...Temporal and semantic analysis of richly typed social networks from user-gene...
Temporal and semantic analysis of richly typed social networks from user-gene...
 
Community Structure, Interaction and Evolution Analysis of Online Social Netw...
Community Structure, Interaction and Evolution Analysis of Online Social Netw...Community Structure, Interaction and Evolution Analysis of Online Social Netw...
Community Structure, Interaction and Evolution Analysis of Online Social Netw...
 
Self-modeling and self-reflection of E-learning communities
Self-modeling and self-reflection of E-learning communitiesSelf-modeling and self-reflection of E-learning communities
Self-modeling and self-reflection of E-learning communities
 
Manos
ManosManos
Manos
 
Topic map for Topic Maps case examples
Topic map for Topic Maps case examplesTopic map for Topic Maps case examples
Topic map for Topic Maps case examples
 
Dorner Helga: INVESTIGATING PATTERNS OF INTERACTION IN THE CALIBRATE PROJECT
Dorner Helga: INVESTIGATING PATTERNS OF INTERACTION IN THE CALIBRATE PROJECTDorner Helga: INVESTIGATING PATTERNS OF INTERACTION IN THE CALIBRATE PROJECT
Dorner Helga: INVESTIGATING PATTERNS OF INTERACTION IN THE CALIBRATE PROJECT
 
Lecture_2_Stats.pdf
Lecture_2_Stats.pdfLecture_2_Stats.pdf
Lecture_2_Stats.pdf
 
Enabling reuse of arguments and opinions in open collaboration systems PhD vi...
Enabling reuse of arguments and opinions in open collaboration systems PhD vi...Enabling reuse of arguments and opinions in open collaboration systems PhD vi...
Enabling reuse of arguments and opinions in open collaboration systems PhD vi...
 
Multimodal Learning Analytics
Multimodal Learning AnalyticsMultimodal Learning Analytics
Multimodal Learning Analytics
 
Measurement and modeling of the web and related data sets
Measurement and modeling of the web and related data setsMeasurement and modeling of the web and related data sets
Measurement and modeling of the web and related data sets
 
Open Source Community Metrics LibreOffice Conference
Open Source Community Metrics LibreOffice ConferenceOpen Source Community Metrics LibreOffice Conference
Open Source Community Metrics LibreOffice Conference
 
SIGDOC 2011 - Necessary and Neglected? An Empirical Study of Internal Documen...
SIGDOC 2011 - Necessary and Neglected? An Empirical Study of Internal Documen...SIGDOC 2011 - Necessary and Neglected? An Empirical Study of Internal Documen...
SIGDOC 2011 - Necessary and Neglected? An Empirical Study of Internal Documen...
 
A Learning to Rank Project on a Daily Song Ranking Problem
A Learning to Rank Project on a Daily Song Ranking ProblemA Learning to Rank Project on a Daily Song Ranking Problem
A Learning to Rank Project on a Daily Song Ranking Problem
 
Visualising data: Seeing is Believing - CS Forum 2012
Visualising data: Seeing is Believing - CS Forum 2012Visualising data: Seeing is Believing - CS Forum 2012
Visualising data: Seeing is Believing - CS Forum 2012
 
Text mining and social network analysis of twitter data part 1
Text mining and social network analysis of twitter data part 1Text mining and social network analysis of twitter data part 1
Text mining and social network analysis of twitter data part 1
 
Domain-Driven Design at ZendCon 2012
Domain-Driven Design at ZendCon 2012Domain-Driven Design at ZendCon 2012
Domain-Driven Design at ZendCon 2012
 
Immersive Recommendation
Immersive RecommendationImmersive Recommendation
Immersive Recommendation
 
Semantic-based Process Analysis
Semantic-based Process AnalysisSemantic-based Process Analysis
Semantic-based Process Analysis
 
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
 

Último

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 

Último (20)

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 

Applying social network analysis to Parliamentary Proceedings: Automatic discovery of meaningful cliques

  • 1. Applying social network analysis to Parliamentary Proceedings Automatic discovery of meaningful cliques Author: Justin van Wees Supervisors: Dr. Maarten Marx Dr. Johan van Doornik June 23, 2011
  • 3. Research question Can we discover communities of politicians that debate on a speci c policy area? Motivation • It’s unknown which member is responsible for a certain policy area • Discover what issues are discussed within a policy area • Serve as example application of social network analysis techniques
  • 5.
  • 6.
  • 7. <root> <docinfo>...</docinfo> <meta>...</meta> <proceedings> <topic> <scene type="speaker" speaker="Hamer" party="PvdA" function="Mevrouw" role="mp" title="Mevrouw Hamer (PvdA)" MPid="02221"> <speech party="PvdA" speaker="Hamer" function="Mevrouw" role="mp" MPid="02221"> <p>Dat is helemaal niet waar. U bewijst nu voor de derde keer dat u niet ...</p> </speech> <speech type="interruption" party="Verdonk" speaker="Verdonk" function="Mevrouw" role="mp" MPid="02995"> <p>Mag ik even uitpraten? Dank u. Zo werkt dat, gewoon fatsoen. Dank u wel. [...]</p> </speech> </scence> </topic> </proceedings> </root>
  • 10. 42 32 21 12 84 100 10 8 15 A weighted directed graph
  • 11. .8&&%9":3()(;&/%3<"3='()(,- 8 456",,%#()(+77()(,- 8 2 4 !"#$%&'()(**+()(,- 2 4,"2'()(B1$A()(,- >":#%1%#$)456/?2%3()(@+A()(,- .//0%&1/&'2()(0/1%&3,%32 A single debate represented in a graph
  • 13. A community A group of nodes that are relatively densely connected to each other but sparsely connected to other dense groups in the network
  • 14. A k-clique (k = 4) K-clique communities (k = 4)
  • 15. Finding issues that a community is discussing • Retrieve all ‘community text’ • Tokenized at word level • Lemmatize • Use parsimonious language models to nd most ‘descriptive’ terms
  • 17. General network statistics of Kok II No distinction With distinction between MP/MG between MP/MG roles roles Nodes 211 218 Edges 3594 3615 Density 0,081 0,076
  • 18. Finding k-clique communties • By default, found groups are note ‘cohesive’ • Filter out ‘noise’ by setting a threshold on edge weights • At 15 interruptions: 197 nodes, 741 edges, 31 k-clique communities
  • 19.
  • 20.
  • 21. Finding k-clique communties • All k-clique communities could be traced back to a single policy area • Except for more ‘general’ policy areas • 92% of the community members directly related to the policy area covered by the community • 85% of top 20 ‘issue terms’ relevant to policy area • K-clique community detection and parsimonious language models are successful methods for automatic discovery of communities within debate networks
  • 23. Method for setting edge weight threshold • Reviewing of k-cliques done by single person • Used four years of data, shorter time-window possible? • Focused on Cabinet Kok II, what about other (earlier) cabinets? • Completely different data?
  • 24. Questions? For detailed results, datasets and programs see: http://justinvanwees.nl/goto/bachelorscriptie