SlideShare uma empresa Scribd logo
1 de 22
Social Network Analysis with
Content and Graphs
徐凡耘
Resource
• Unit: LINCOLN LABORATORY JOURNAL
• Date: VOLUME 20, NUMBER 1, 2013
• Download:
https://www.ll.mit.edu/publications/journal/p
df/vol20_no1/20_1_5_Campbell.pdf
Authors
• William M. Campbell
– Match learning, mathematics in CU
• Charlie K. Dagli
– Computer engineering in UIUC
• Clifford J. Weinstein
– Electrical engineering in MIT
Agenda
• Introduction
– A consequence of changing economic
– Graph Construction
• Community Detection
– Modularity optimization
– Infomap
– Spectral Clustering
• Summary
We don’t cover
• Community Dynamics
– Latent semantic indexing(LSI)
– SVD
– Tensor
• Time-Profile-specific sub-network
– Classification(C4.5, PETS)
– Relational probability tree(RPT)
A consequence of
changing economic
• Big Data from variety and bunch of data
source of large-scale, real-world
sociographic data.
• Constructing social network, analyzing
the structure and dynamics of a
community and developing inferences
from social network
Graph Construction
• Challenge: too many factors
– Ambiguity of human language
– Multiple aliases for the same user
– Incompatible representations of information
– Ambiguity of relationship between
individuals
Data Source and
Information Extraction
• Newswire and sensor
– Smartphone and proximity devices provide
dynamic interactions
• Communications and social media
– Followers in Twitter, people who are
related by current news topics. Even FB…
– Email related to Enron’s bankruptcy
Introduction -
Information Extraction from Text
• Named-entity
recognition (NER)
extracts people, places
and orgs.
• Use links based upon
the co-occurrence of
entities in a documents
https://web.cs.umass.edu/publication/docs/2012/UM-CS-2012-015.pdf
Representation
• Knowledge Representation
– Participates(Bob, M)
Member(Bob, KarateClub)
– Ontology based on Automated Content
Extraction(ACE)
Community Detection
• High connectivity within a group and low
connectivity across groups
• Modularity
optimization(Clauset/Newman/Moore)
• Infomap
• Spectral clustering
DataSet
• Name: ISVG (Institute for the Study of
Violent Groups)
• Cover: terrorist and criminal activity
from open-source docs, including news
articles, court doc, police rpt.
• More than 100,000 incidents
• More than 1,500 hand-annotated types
• Nearly 30,000 individuals and 3,000
groups
Modularity optimization
(Clauset/Newman/Moore)
• Missing link prediction (e.g.
recommend friendship,
Folding@home)
• Using similarity probability
to associate nodes
• Strong similarity has
tendency to be linked?
(Men vs Men in sex-
network)
Modularity optimization
(Clauset/Newman/Moore)
• Even better performance in Terrorist
association and Grassland species network
• Both dataset has explicit level orgs.
• It is considered in large-scale network for big
calculating
• Convert a graph to Markov model by
random walk
• Using entropy to determine clusters
Infomap
Spectral Clustering (SC)
• Convergence in global optimal value with
arbitrary shape
• Few clusters, non-flat geometry
• Laplacian matrix for engenvalue/vector
Eigenvalue & Eigenvectors
• After image transformation, red line
keeps the same direction but yellow line
change to opposite direction
• Red line (eigenvalue = 1)
• Yellow line (eigenvalue = -1)
• They are orthogonal
• Av = λv (λ=Eigenvalue, v=Eigenvector)
Spectral Clustering (SC)
• Normalized cuts
– Divide and conquer
– NP-Complete
Precision and Recall
• Trade-off to fit the application
• Average color
• Segmentation
• Normalized Cut in Region Adjacency Graphs(RAG)
by removing edge
Normalized cuts
• L = D – A
Laplacian Matrix
A =
L =
Reference
• http://www.coolaler.com/showthread.php/184742-
%E4%BD%95%E8%AC%82-Folding-home-
%E8%9B%8B%E7%99%BD%E8%B3%AA%E6%91%BA%E7%
96%8A
• http://blog.sciencenet.cn/blog-3075-275710.html
• http://arxiv.org/pdf/0811.0484.pdf
• http://blog.sciencenet.cn/blog-329471-318268.html
• http://learnmath.pixnet.net/blog/post/44430775-
%E7%89%B9%E5%BE%B5%E5%80%BC%EF%BC%8C%E7
%89%B9%E5%BE%B5%E5%90%91%E9%87%8F
• http://www.fnlp.org/archives/4053
• http://dufu.math.ncu.edu.tw/calculus/calculus_eng/node162.html

Mais conteúdo relacionado

Mais procurados

News Sharing on Twitter: A Nationally Comparative Study
News Sharing on Twitter: A Nationally Comparative StudyNews Sharing on Twitter: A Nationally Comparative Study
News Sharing on Twitter: A Nationally Comparative StudyAxel Bruns
 
Analysing the Norwegian Twittersphere
Analysing the Norwegian TwittersphereAnalysing the Norwegian Twittersphere
Analysing the Norwegian TwittersphereAxel Bruns
 
Tracing Publics in the Australian Blogosphere: New Methods for International ...
Tracing Publics in the Australian Blogosphere: New Methods for International ...Tracing Publics in the Australian Blogosphere: New Methods for International ...
Tracing Publics in the Australian Blogosphere: New Methods for International ...Jean Burgess
 
Social Media in Selected Australian Federal and State Election Campaigns, 201...
Social Media in Selected Australian Federal and State Election Campaigns, 201...Social Media in Selected Australian Federal and State Election Campaigns, 201...
Social Media in Selected Australian Federal and State Election Campaigns, 201...Axel Bruns
 
Building Spaces for Hyperlocal Citizen Journalism (AoIR 2008)
Building Spaces for Hyperlocal Citizen Journalism (AoIR 2008)Building Spaces for Hyperlocal Citizen Journalism (AoIR 2008)
Building Spaces for Hyperlocal Citizen Journalism (AoIR 2008)Axel Bruns
 
Collaboration and Social Networking (KCB202 Week 2 Podcast)
Collaboration and Social Networking (KCB202 Week 2 Podcast)Collaboration and Social Networking (KCB202 Week 2 Podcast)
Collaboration and Social Networking (KCB202 Week 2 Podcast)Axel Bruns
 
Social Media in Australia: A ‘Big Data’ Perspective on Twitter
Social Media in Australia: A ‘Big Data’ Perspective on TwitterSocial Media in Australia: A ‘Big Data’ Perspective on Twitter
Social Media in Australia: A ‘Big Data’ Perspective on TwitterAxel Bruns
 
Data Science career mixer poster
Data Science career mixer posterData Science career mixer poster
Data Science career mixer posterTom Jeon
 
9th TripleHelix: Politicians Twitter network - a case of S. Korea
9th TripleHelix: Politicians Twitter network - a case of S. Korea9th TripleHelix: Politicians Twitter network - a case of S. Korea
9th TripleHelix: Politicians Twitter network - a case of S. KoreaHan Woo PARK
 
Comparison of Elementary Dynamic Network Models Using Empirical Data
Comparison of Elementary Dynamic Network Models Using Empirical DataComparison of Elementary Dynamic Network Models Using Empirical Data
Comparison of Elementary Dynamic Network Models Using Empirical DataRichard Oliver Legendi
 
Social Media in Australia: The Case of Twitter
Social Media in Australia: The Case of TwitterSocial Media in Australia: The Case of Twitter
Social Media in Australia: The Case of TwitterAxel Bruns
 
From Geographic Location to Network Location: The Potential of Big Social Data
From Geographic Location to Network Location: The Potential of Big Social DataFrom Geographic Location to Network Location: The Potential of Big Social Data
From Geographic Location to Network Location: The Potential of Big Social DataAxel Bruns
 
Amplifying Impact: Developing Indicators of Public Value in Public Communicat...
Amplifying Impact: Developing Indicators of Public Value in Public Communicat...Amplifying Impact: Developing Indicators of Public Value in Public Communicat...
Amplifying Impact: Developing Indicators of Public Value in Public Communicat...Axel Bruns
 
‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources
‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources
‘Big Social Data’ in Context: Connecting Social Media Data and Other SourcesAxel Bruns
 
Journalism-as-a-Service: Amplifying Public Intellectual Contributions through...
Journalism-as-a-Service: Amplifying Public Intellectual Contributions through...Journalism-as-a-Service: Amplifying Public Intellectual Contributions through...
Journalism-as-a-Service: Amplifying Public Intellectual Contributions through...Axel Bruns
 
Exploring the Global Demographics of Twitter
Exploring the Global Demographics of TwitterExploring the Global Demographics of Twitter
Exploring the Global Demographics of TwitterAxel Bruns
 
Phd Colloquium Spatial Analysis
Phd Colloquium Spatial AnalysisPhd Colloquium Spatial Analysis
Phd Colloquium Spatial Analysisalistairleak
 
#ICCSS2015 - Computational Human Security Analytics using "Big Data"
#ICCSS2015 - Computational Human Security Analytics using "Big Data"#ICCSS2015 - Computational Human Security Analytics using "Big Data"
#ICCSS2015 - Computational Human Security Analytics using "Big Data"Pete Burnap
 
Twitter in Germany: A Big Data Perspective
Twitter in Germany: A Big Data PerspectiveTwitter in Germany: A Big Data Perspective
Twitter in Germany: A Big Data PerspectiveAxel Bruns
 

Mais procurados (20)

News Sharing on Twitter: A Nationally Comparative Study
News Sharing on Twitter: A Nationally Comparative StudyNews Sharing on Twitter: A Nationally Comparative Study
News Sharing on Twitter: A Nationally Comparative Study
 
Analysing the Norwegian Twittersphere
Analysing the Norwegian TwittersphereAnalysing the Norwegian Twittersphere
Analysing the Norwegian Twittersphere
 
Tracing Publics in the Australian Blogosphere: New Methods for International ...
Tracing Publics in the Australian Blogosphere: New Methods for International ...Tracing Publics in the Australian Blogosphere: New Methods for International ...
Tracing Publics in the Australian Blogosphere: New Methods for International ...
 
Social Media in Selected Australian Federal and State Election Campaigns, 201...
Social Media in Selected Australian Federal and State Election Campaigns, 201...Social Media in Selected Australian Federal and State Election Campaigns, 201...
Social Media in Selected Australian Federal and State Election Campaigns, 201...
 
Building Spaces for Hyperlocal Citizen Journalism (AoIR 2008)
Building Spaces for Hyperlocal Citizen Journalism (AoIR 2008)Building Spaces for Hyperlocal Citizen Journalism (AoIR 2008)
Building Spaces for Hyperlocal Citizen Journalism (AoIR 2008)
 
Collaboration and Social Networking (KCB202 Week 2 Podcast)
Collaboration and Social Networking (KCB202 Week 2 Podcast)Collaboration and Social Networking (KCB202 Week 2 Podcast)
Collaboration and Social Networking (KCB202 Week 2 Podcast)
 
Social Media in Australia: A ‘Big Data’ Perspective on Twitter
Social Media in Australia: A ‘Big Data’ Perspective on TwitterSocial Media in Australia: A ‘Big Data’ Perspective on Twitter
Social Media in Australia: A ‘Big Data’ Perspective on Twitter
 
Data Science career mixer poster
Data Science career mixer posterData Science career mixer poster
Data Science career mixer poster
 
9th TripleHelix: Politicians Twitter network - a case of S. Korea
9th TripleHelix: Politicians Twitter network - a case of S. Korea9th TripleHelix: Politicians Twitter network - a case of S. Korea
9th TripleHelix: Politicians Twitter network - a case of S. Korea
 
Comparison of Elementary Dynamic Network Models Using Empirical Data
Comparison of Elementary Dynamic Network Models Using Empirical DataComparison of Elementary Dynamic Network Models Using Empirical Data
Comparison of Elementary Dynamic Network Models Using Empirical Data
 
Social Media in Australia: The Case of Twitter
Social Media in Australia: The Case of TwitterSocial Media in Australia: The Case of Twitter
Social Media in Australia: The Case of Twitter
 
From Geographic Location to Network Location: The Potential of Big Social Data
From Geographic Location to Network Location: The Potential of Big Social DataFrom Geographic Location to Network Location: The Potential of Big Social Data
From Geographic Location to Network Location: The Potential of Big Social Data
 
Amplifying Impact: Developing Indicators of Public Value in Public Communicat...
Amplifying Impact: Developing Indicators of Public Value in Public Communicat...Amplifying Impact: Developing Indicators of Public Value in Public Communicat...
Amplifying Impact: Developing Indicators of Public Value in Public Communicat...
 
‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources
‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources
‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources
 
Journalism-as-a-Service: Amplifying Public Intellectual Contributions through...
Journalism-as-a-Service: Amplifying Public Intellectual Contributions through...Journalism-as-a-Service: Amplifying Public Intellectual Contributions through...
Journalism-as-a-Service: Amplifying Public Intellectual Contributions through...
 
Exploring the Global Demographics of Twitter
Exploring the Global Demographics of TwitterExploring the Global Demographics of Twitter
Exploring the Global Demographics of Twitter
 
Phd Colloquium Spatial Analysis
Phd Colloquium Spatial AnalysisPhd Colloquium Spatial Analysis
Phd Colloquium Spatial Analysis
 
#ICCSS2015 - Computational Human Security Analytics using "Big Data"
#ICCSS2015 - Computational Human Security Analytics using "Big Data"#ICCSS2015 - Computational Human Security Analytics using "Big Data"
#ICCSS2015 - Computational Human Security Analytics using "Big Data"
 
Data Power
Data PowerData Power
Data Power
 
Twitter in Germany: A Big Data Perspective
Twitter in Germany: A Big Data PerspectiveTwitter in Germany: A Big Data Perspective
Twitter in Germany: A Big Data Perspective
 

Semelhante a Social network analysis

Data mining based social network
Data mining based social networkData mining based social network
Data mining based social networkFiras Husseini
 
Characterizing Data and Software for Social Science Research
Characterizing Data and Software for Social Science ResearchCharacterizing Data and Software for Social Science Research
Characterizing Data and Software for Social Science ResearchMicah Altman
 
A boring presentation about social mobile communication patterns and opportun...
A boring presentation about social mobile communication patterns and opportun...A boring presentation about social mobile communication patterns and opportun...
A boring presentation about social mobile communication patterns and opportun...Save Manos
 
System dynamics prof nagurney
System dynamics prof nagurneySystem dynamics prof nagurney
System dynamics prof nagurneyHouw Liong The
 
2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 TutorialAlexander Pico
 
Christoph Barrett - Policy Informatics at Societal Scale
Christoph Barrett - Policy Informatics at Societal ScaleChristoph Barrett - Policy Informatics at Societal Scale
Christoph Barrett - Policy Informatics at Societal ScaleGlobal Risk Forum GRFDavos
 
Networks, Deep Learning (and COVID-19)
Networks, Deep Learning (and COVID-19)Networks, Deep Learning (and COVID-19)
Networks, Deep Learning (and COVID-19)tm1966
 
WIDS 2021--An Introduction to Network Science
WIDS 2021--An Introduction to Network ScienceWIDS 2021--An Introduction to Network Science
WIDS 2021--An Introduction to Network ScienceColleen Farrelly
 
Social Networks and Computer Science
Social Networks and Computer ScienceSocial Networks and Computer Science
Social Networks and Computer Sciencedragonmeteor
 
Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012CameliaN
 
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...giuseppe_futia
 
Data Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research OpportunitiesData Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research OpportunitiesKathirvel Ayyaswamy
 
01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)Duke Network Analysis Center
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measuresdnac
 

Semelhante a Social network analysis (20)

Data mining based social network
Data mining based social networkData mining based social network
Data mining based social network
 
Characterizing Data and Software for Social Science Research
Characterizing Data and Software for Social Science ResearchCharacterizing Data and Software for Social Science Research
Characterizing Data and Software for Social Science Research
 
Network Science: Theory, Modeling and Applications
Network Science: Theory, Modeling and ApplicationsNetwork Science: Theory, Modeling and Applications
Network Science: Theory, Modeling and Applications
 
A boring presentation about social mobile communication patterns and opportun...
A boring presentation about social mobile communication patterns and opportun...A boring presentation about social mobile communication patterns and opportun...
A boring presentation about social mobile communication patterns and opportun...
 
System dynamics prof nagurney
System dynamics prof nagurneySystem dynamics prof nagurney
System dynamics prof nagurney
 
2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial
 
Christoph Barrett - Policy Informatics at Societal Scale
Christoph Barrett - Policy Informatics at Societal ScaleChristoph Barrett - Policy Informatics at Societal Scale
Christoph Barrett - Policy Informatics at Societal Scale
 
Networks, Deep Learning (and COVID-19)
Networks, Deep Learning (and COVID-19)Networks, Deep Learning (and COVID-19)
Networks, Deep Learning (and COVID-19)
 
WIDS 2021--An Introduction to Network Science
WIDS 2021--An Introduction to Network ScienceWIDS 2021--An Introduction to Network Science
WIDS 2021--An Introduction to Network Science
 
Social Networks and Computer Science
Social Networks and Computer ScienceSocial Networks and Computer Science
Social Networks and Computer Science
 
06 Community Detection
06 Community Detection06 Community Detection
06 Community Detection
 
Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012
 
Spatio Temporal Data Mining
Spatio Temporal Data MiningSpatio Temporal Data Mining
Spatio Temporal Data Mining
 
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
 
Big Data
Big Data Big Data
Big Data
 
DBMS
DBMSDBMS
DBMS
 
Data Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research OpportunitiesData Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research Opportunities
 
social.pptx
social.pptxsocial.pptx
social.pptx
 
01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures
 

Mais de FEG

Pytorch cnn netowork introduction 20240318
Pytorch cnn netowork introduction 20240318Pytorch cnn netowork introduction 20240318
Pytorch cnn netowork introduction 20240318FEG
 
2023 Decision Tree analysis in business practices
2023 Decision Tree analysis in business practices2023 Decision Tree analysis in business practices
2023 Decision Tree analysis in business practicesFEG
 
2023 Clustering analysis using Python from scratch
2023 Clustering analysis using Python from scratch2023 Clustering analysis using Python from scratch
2023 Clustering analysis using Python from scratchFEG
 
2023 Data visualization using Python from scratch
2023 Data visualization using Python from scratch2023 Data visualization using Python from scratch
2023 Data visualization using Python from scratchFEG
 
2023 Supervised Learning for Orange3 from scratch
2023 Supervised Learning for Orange3 from scratch2023 Supervised Learning for Orange3 from scratch
2023 Supervised Learning for Orange3 from scratchFEG
 
2023 Supervised_Learning_Association_Rules
2023 Supervised_Learning_Association_Rules2023 Supervised_Learning_Association_Rules
2023 Supervised_Learning_Association_RulesFEG
 
202312 Exploration Data Analysis Visualization (English version)
202312 Exploration Data Analysis Visualization (English version)202312 Exploration Data Analysis Visualization (English version)
202312 Exploration Data Analysis Visualization (English version)FEG
 
202312 Exploration of Data Analysis Visualization
202312 Exploration of Data Analysis Visualization202312 Exploration of Data Analysis Visualization
202312 Exploration of Data Analysis VisualizationFEG
 
Transfer Learning (20230516)
Transfer Learning (20230516)Transfer Learning (20230516)
Transfer Learning (20230516)FEG
 
Image Classification (20230411)
Image Classification (20230411)Image Classification (20230411)
Image Classification (20230411)FEG
 
Google CoLab (20230321)
Google CoLab (20230321)Google CoLab (20230321)
Google CoLab (20230321)FEG
 
Supervised Learning
Supervised LearningSupervised Learning
Supervised LearningFEG
 
UnSupervised Learning Clustering
UnSupervised Learning ClusteringUnSupervised Learning Clustering
UnSupervised Learning ClusteringFEG
 
Data Visualization in Excel
Data Visualization in ExcelData Visualization in Excel
Data Visualization in ExcelFEG
 
6_Association_rule_碩士班第六次.pdf
6_Association_rule_碩士班第六次.pdf6_Association_rule_碩士班第六次.pdf
6_Association_rule_碩士班第六次.pdfFEG
 
5_Neural_network_碩士班第五次.pdf
5_Neural_network_碩士班第五次.pdf5_Neural_network_碩士班第五次.pdf
5_Neural_network_碩士班第五次.pdfFEG
 
4_Regression_analysis.pdf
4_Regression_analysis.pdf4_Regression_analysis.pdf
4_Regression_analysis.pdfFEG
 
3_Decision_tree.pdf
3_Decision_tree.pdf3_Decision_tree.pdf
3_Decision_tree.pdfFEG
 
2_Clustering.pdf
2_Clustering.pdf2_Clustering.pdf
2_Clustering.pdfFEG
 
1_大二班_資料視覺化_20221028.pdf
1_大二班_資料視覺化_20221028.pdf1_大二班_資料視覺化_20221028.pdf
1_大二班_資料視覺化_20221028.pdfFEG
 

Mais de FEG (20)

Pytorch cnn netowork introduction 20240318
Pytorch cnn netowork introduction 20240318Pytorch cnn netowork introduction 20240318
Pytorch cnn netowork introduction 20240318
 
2023 Decision Tree analysis in business practices
2023 Decision Tree analysis in business practices2023 Decision Tree analysis in business practices
2023 Decision Tree analysis in business practices
 
2023 Clustering analysis using Python from scratch
2023 Clustering analysis using Python from scratch2023 Clustering analysis using Python from scratch
2023 Clustering analysis using Python from scratch
 
2023 Data visualization using Python from scratch
2023 Data visualization using Python from scratch2023 Data visualization using Python from scratch
2023 Data visualization using Python from scratch
 
2023 Supervised Learning for Orange3 from scratch
2023 Supervised Learning for Orange3 from scratch2023 Supervised Learning for Orange3 from scratch
2023 Supervised Learning for Orange3 from scratch
 
2023 Supervised_Learning_Association_Rules
2023 Supervised_Learning_Association_Rules2023 Supervised_Learning_Association_Rules
2023 Supervised_Learning_Association_Rules
 
202312 Exploration Data Analysis Visualization (English version)
202312 Exploration Data Analysis Visualization (English version)202312 Exploration Data Analysis Visualization (English version)
202312 Exploration Data Analysis Visualization (English version)
 
202312 Exploration of Data Analysis Visualization
202312 Exploration of Data Analysis Visualization202312 Exploration of Data Analysis Visualization
202312 Exploration of Data Analysis Visualization
 
Transfer Learning (20230516)
Transfer Learning (20230516)Transfer Learning (20230516)
Transfer Learning (20230516)
 
Image Classification (20230411)
Image Classification (20230411)Image Classification (20230411)
Image Classification (20230411)
 
Google CoLab (20230321)
Google CoLab (20230321)Google CoLab (20230321)
Google CoLab (20230321)
 
Supervised Learning
Supervised LearningSupervised Learning
Supervised Learning
 
UnSupervised Learning Clustering
UnSupervised Learning ClusteringUnSupervised Learning Clustering
UnSupervised Learning Clustering
 
Data Visualization in Excel
Data Visualization in ExcelData Visualization in Excel
Data Visualization in Excel
 
6_Association_rule_碩士班第六次.pdf
6_Association_rule_碩士班第六次.pdf6_Association_rule_碩士班第六次.pdf
6_Association_rule_碩士班第六次.pdf
 
5_Neural_network_碩士班第五次.pdf
5_Neural_network_碩士班第五次.pdf5_Neural_network_碩士班第五次.pdf
5_Neural_network_碩士班第五次.pdf
 
4_Regression_analysis.pdf
4_Regression_analysis.pdf4_Regression_analysis.pdf
4_Regression_analysis.pdf
 
3_Decision_tree.pdf
3_Decision_tree.pdf3_Decision_tree.pdf
3_Decision_tree.pdf
 
2_Clustering.pdf
2_Clustering.pdf2_Clustering.pdf
2_Clustering.pdf
 
1_大二班_資料視覺化_20221028.pdf
1_大二班_資料視覺化_20221028.pdf1_大二班_資料視覺化_20221028.pdf
1_大二班_資料視覺化_20221028.pdf
 

Último

Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 

Último (20)

Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 

Social network analysis

  • 1. Social Network Analysis with Content and Graphs 徐凡耘
  • 2. Resource • Unit: LINCOLN LABORATORY JOURNAL • Date: VOLUME 20, NUMBER 1, 2013 • Download: https://www.ll.mit.edu/publications/journal/p df/vol20_no1/20_1_5_Campbell.pdf
  • 3. Authors • William M. Campbell – Match learning, mathematics in CU • Charlie K. Dagli – Computer engineering in UIUC • Clifford J. Weinstein – Electrical engineering in MIT
  • 4. Agenda • Introduction – A consequence of changing economic – Graph Construction • Community Detection – Modularity optimization – Infomap – Spectral Clustering • Summary
  • 5. We don’t cover • Community Dynamics – Latent semantic indexing(LSI) – SVD – Tensor • Time-Profile-specific sub-network – Classification(C4.5, PETS) – Relational probability tree(RPT)
  • 6. A consequence of changing economic • Big Data from variety and bunch of data source of large-scale, real-world sociographic data. • Constructing social network, analyzing the structure and dynamics of a community and developing inferences from social network
  • 7. Graph Construction • Challenge: too many factors – Ambiguity of human language – Multiple aliases for the same user – Incompatible representations of information – Ambiguity of relationship between individuals
  • 8. Data Source and Information Extraction • Newswire and sensor – Smartphone and proximity devices provide dynamic interactions • Communications and social media – Followers in Twitter, people who are related by current news topics. Even FB… – Email related to Enron’s bankruptcy
  • 9. Introduction - Information Extraction from Text • Named-entity recognition (NER) extracts people, places and orgs. • Use links based upon the co-occurrence of entities in a documents https://web.cs.umass.edu/publication/docs/2012/UM-CS-2012-015.pdf
  • 10. Representation • Knowledge Representation – Participates(Bob, M) Member(Bob, KarateClub) – Ontology based on Automated Content Extraction(ACE)
  • 11. Community Detection • High connectivity within a group and low connectivity across groups • Modularity optimization(Clauset/Newman/Moore) • Infomap • Spectral clustering
  • 12. DataSet • Name: ISVG (Institute for the Study of Violent Groups) • Cover: terrorist and criminal activity from open-source docs, including news articles, court doc, police rpt. • More than 100,000 incidents • More than 1,500 hand-annotated types • Nearly 30,000 individuals and 3,000 groups
  • 13. Modularity optimization (Clauset/Newman/Moore) • Missing link prediction (e.g. recommend friendship, Folding@home) • Using similarity probability to associate nodes • Strong similarity has tendency to be linked? (Men vs Men in sex- network)
  • 14. Modularity optimization (Clauset/Newman/Moore) • Even better performance in Terrorist association and Grassland species network • Both dataset has explicit level orgs. • It is considered in large-scale network for big calculating
  • 15. • Convert a graph to Markov model by random walk • Using entropy to determine clusters Infomap
  • 16. Spectral Clustering (SC) • Convergence in global optimal value with arbitrary shape • Few clusters, non-flat geometry • Laplacian matrix for engenvalue/vector
  • 17. Eigenvalue & Eigenvectors • After image transformation, red line keeps the same direction but yellow line change to opposite direction • Red line (eigenvalue = 1) • Yellow line (eigenvalue = -1) • They are orthogonal • Av = λv (λ=Eigenvalue, v=Eigenvector)
  • 18. Spectral Clustering (SC) • Normalized cuts – Divide and conquer – NP-Complete
  • 19. Precision and Recall • Trade-off to fit the application
  • 20. • Average color • Segmentation • Normalized Cut in Region Adjacency Graphs(RAG) by removing edge Normalized cuts
  • 21. • L = D – A Laplacian Matrix A = L =
  • 22. Reference • http://www.coolaler.com/showthread.php/184742- %E4%BD%95%E8%AC%82-Folding-home- %E8%9B%8B%E7%99%BD%E8%B3%AA%E6%91%BA%E7% 96%8A • http://blog.sciencenet.cn/blog-3075-275710.html • http://arxiv.org/pdf/0811.0484.pdf • http://blog.sciencenet.cn/blog-329471-318268.html • http://learnmath.pixnet.net/blog/post/44430775- %E7%89%B9%E5%BE%B5%E5%80%BC%EF%BC%8C%E7 %89%B9%E5%BE%B5%E5%90%91%E9%87%8F • http://www.fnlp.org/archives/4053 • http://dufu.math.ncu.edu.tw/calculus/calculus_eng/node162.html

Notas do Editor

  1. Stanford University research project (Folding@home)
  2. 矩陣乘以一個不為零的向量,相當於將此向量做一些平移、旋轉、伸展、推移之後的結果
  3. Divide and Conquer: a big problem divide to some small problem. Solve the small problems and then combine them. Such like map reduce, quick sort, merge sort