SlideShare a Scribd company logo
1 of 26
Download to read offline
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Formal Concept Analysis applied to
Professional Social Networks
Paula Silva, S´ergio M. Dias, Wladmir Brand˜ao, Mark Song,
Luis Z´arate (supervisor)
Pontifical Catholic University of Minas Gerais, Brazil
ICEIS, 2017
ICEIS, 2017 1 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Context and Motivation
Online social networks for users oriented to business
The LinkedIn is the one of the largest and most popular online
professional social network
The size and diversity of users generated content data
The need to help professionals to increase skills and reach job positions
The Formal Concept Analysis (FCA) as mathematical formulation for data
analysis, applied to find patterns of professional competence
ICEIS, 2017 2 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Goal
Identify professional behaviors through data scraped from LinkedIn
Find the minimum set of skills that is necessary to reach job positions
For example: statistic, machine learning, databases → data scientist
Implications rules, specifically the set of proper implications
ICEIS, 2017 3 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Contributions
Our contributions are:
The domain problem mapping the model of competences
The professionals data set scraped from LinkedIn
The FCA-based approach
The set of experiments to apply FCA for professional career analysis
ICEIS, 2017 4 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Formal Concept Analysis
Based on the notions of concept and conceptual hierarchy
A mathematical way to look at data and knowledge, their acquisition
process and analysis based on lattices.
There are three main principles:
Formal context
Formal concept
Implications
ICEIS, 2017 5 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Formal Context
Formal context (G, M, I)
a set G of objects
a set M of attributes
a binary relation I ⊆ G ×M
a b c d e f g
18 x x
19 x x x
20 x x x x
21 x x x
22 x x
23 x x x x x
24 x x
Table: Example context of an user’s LinkedIn skills.
ICEIS, 2017 6 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Formal Concept
Derivation operators:
For A ⊆ G and B ⊆ M
A := {m ∈ M|gIm∀g ∈ A}
B := {g ∈ G|gIm∀m ∈ B}
Formal concept (A, B)
A ⊆ G B ⊆ M
A = B B = A
A is concept extent and B is concept intent
e.g. formal concept ({18, 21, 24}, {software engineering, data bases})
ICEIS, 2017 7 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Implication Rules
Being a formal context (G, M, I), an implication over M is P → Q, where
P,Q ⊆ M.
P is the premise
Q is the conclusion
P → Q has to be such that P ⊆ Q , so the sets of attributes P and Q
share the same subset of objects.
ICEIS, 2017 8 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
The Set of Proper Implications
the right hand side of each implication is unitary: if P → m ∈ I , then
m ∈ M;
superfluous implications are not allowed: if P → m ∈ I , then m /∈ P;
specializations are not allowed, i.e. left hand sides are minimal: if
P → m ∈ I , then there is not any Q → m ∈ I such that Q ⊂ P.
For example, {e} → {a} is a proper implication, but {e, g} → {a} is not a
proper implication.
ICEIS, 2017 9 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
FCA-based Approach
ICEIS, 2017 10 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
1 - Problem Domain
Building the conceptual model according to the problem to be treated
The classification of informational categories, based on Model of
Competence
Figure: Duran(1998) adapted model
ICEIS, 2017 11 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
1 - Problem Domain
We identified 3 dimensions, 14 aspects and 51 variables
Figure: Professional identification through model of competence ICEIS, 2017 12 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
2 - Scrapping
The scrapping process was divided into two phases.
Selecting initial seeds randomly
Collecting the public profiles data: People undergraduate in IT coursers in
Minas Gerais, Brazil
ICEIS, 2017 13 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
3 - Preprocessing
We only considered the variables skills and experience
The ETL process:
String cleaning: UTF-8 encoding correction, accent removal,
standardization of all terms for the English language through Google
Translate API
Attribute reductions: reductions based on semantic relevance
Generic term Specific term
Java
JPA
JSF
Software developer
Developer
Programmer
Program developer
Formal context with 366 attributes and 970 objects
ICEIS, 2017 14 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
4 - Knowledge Extractor
The PropIm algorithm
Finding supersets: If A → b, so ↑ |(A → b) | and ↓ |A|
It computes proper implications with support > 0
Easily scalable strategy
Allows to set the attributes of interest for implications’ conclusions
Problem complexity: O(|M||I |(|G||M|+|I ||M|))
Pruning heuristic to reduce combinations possibilities among attributes
ICEIS, 2017 15 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
4 - Knowledge Extractor
Input : Formal context (G,M,I)
Output: Set of proper implications I with support
greater than 0
1 I = /0
2 foreach m ∈ M do
3 P = m
4 size = 1
5 Pa = /0
6 while size < |P| do
7 C = P
size
8 PC = getCandidate(C,Pa)
9 foreach P1 ⊂ PC do
10 if P1 = /0 and P1 ⊂ m then
11 Pa = Pa ∪{P1}
12 I = I ∪{P1 → m}
13 end
14 end
15 size ++
16 end
17 end
18 return I
1 Function getCandidate (C,Pa)
2 D = /0
3 foreach a ∈ A|A ⊂ Pa do
4 foreach B ⊂ C do
5 if a /∈ B then
6 D = PC B
7 end
8 end
9 end
10 return D
ICEIS, 2017 16 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Experiments
The goal was to answer the following questions:
How do proper implications identify relations between skills and positions?
Could we find intersections among sets of skills, and what do these
intersections represent?
ICEIS, 2017 17 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Proper Implications to Competence Identification
We selected 20 positions and their 180 skills
It was extracted 895 proper implications with PropIm algorithm
Figure: Proper implications network. AD = Administrative Director, ITC = IT Consultant
ICEIS, 2017 18 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Proper Implications to Competence Identification
Nodes with high in-degree value represent positions which have more
diversification of sets of skills
163 proper implications
{entrepreneurship, human resources, information management} →
{administrative director}
Figure: AD = Administrative Director
ICEIS, 2017 19 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Proper Implications to Competence Identification
Nodes with low in-degree value represent jobs positions that demand
more specific sets of skills
3 proper implications
{ABAP, agile methodology, BI} → {it consultant}
Figure: ITC = IT Consultant
ICEIS, 2017 20 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Intersection Between Skills and Job Positions
Top 3 best jobs in Information Technology area according to Career Cast
research (Cast, 2016)
Edges weight = relative frequency F =
Fi
Fp
, where F is the relative
frequency, Fi is the implication absolute frequency and Fp is |m |
Why relative frequency?
{java frameworks}→{software
engineer}
Support: 2.47%
Relative frequency: 75.56%
Figure: P1: data scientist, P2: information security
analyst and P3: software engineer job position
ICEIS, 2017 21 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Intersection Between Skills and Job Positions
Nodes P1 and P2 share two set of skills:
{agile methodology} → {data scientist}
{agile methodology} → {information security analyst}
Figure: P1: data scientist, P2: information security analyst and P3: software engineer job position
ICEIS, 2017 22 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Intersection Between Skills and Job Positions
Skills and positions related to hierarchical transition:
P1: {.NET, automation systems} → IT analyst
P2: {.NET, data base, ERP, it governance} → IT coordinator
P3: {BPM, cloud computing, CRM} → IT manager
P4: {assets management, BI, business management, consulting} → IT
director
Figure: P1 to P4 represents the following job positions: P1 is IT analyst, P2 is IT manager, P3 is IT
coordinator and P4 is IT director
ICEIS, 2017 23 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Conclusions
FCA-based approach to identify the minimum sets of skills that is
necessary to achieve a job position
Set of experiments for apply FCA to professional competences analysis
Computational strategy to find proper implications without loss of
information
In-degree of conclusions nodes mean the diversification among sets of
skills for the same job position
Sharing sets of skills do not determine the job position, but show positions
that can be achieved
The disjointed sets (representing hierarchy) show that is necessary
develop skills of different natures to progress in career
ICEIS, 2017 24 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Future Works
Experiments will be replicated for other areas
Exploring another algorithms which extract implications from concept
lattice, or from the set of formal concepts
Expanding the analysis to all dimensions from model of competence
Implementing an web environment with this FCA-based approach, for help
professionals to increase skills and look for possible job positions
Preprocessing: attribute fusion through correlation analysis
Temporal analysis of career evolution
Estimate the remuneration of people based on posts and professional data
ICEIS, 2017 25 / 26
Introduction FCA FCA-based Approach Experiments Conclusions Future Works
Any questions?
paula.raissa@sga.pucminas.br
paula.csraissa@gmail.com
ICEIS, 2017 26 / 26

More Related Content

Similar to Formal Concept Analysis applied to Professional Social Networks

Massive scale analytics with Stratosphere using R
Massive scale analytics with Stratosphere using RMassive scale analytics with Stratosphere using R
Massive scale analytics with Stratosphere using RJose Luis Lopez Pino
 
Synthetic VIX Data Generation Using ML Techniques
Synthetic VIX Data Generation Using ML TechniquesSynthetic VIX Data Generation Using ML Techniques
Synthetic VIX Data Generation Using ML TechniquesQuantUniversity
 
A data science observatory based on RAMP - rapid analytics and model prototyping
A data science observatory based on RAMP - rapid analytics and model prototypingA data science observatory based on RAMP - rapid analytics and model prototyping
A data science observatory based on RAMP - rapid analytics and model prototypingAkin Osman Kazakci
 
ODSC West 2021 – Composition in ML
ODSC West 2021 – Composition in MLODSC West 2021 – Composition in ML
ODSC West 2021 – Composition in MLBryan Bischof
 
Paper Explained: Deep learning framework for measuring the digital strategy o...
Paper Explained: Deep learning framework for measuring the digital strategy o...Paper Explained: Deep learning framework for measuring the digital strategy o...
Paper Explained: Deep learning framework for measuring the digital strategy o...Devansh16
 
PMIS Course Part 2 – PMIS in Practice
PMIS Course Part 2 – PMIS in PracticePMIS Course Part 2 – PMIS in Practice
PMIS Course Part 2 – PMIS in PracticeSasan Hosseyni
 
Chapter 03 overview-of project planning Step Wise
Chapter 03 overview-of project planning Step WiseChapter 03 overview-of project planning Step Wise
Chapter 03 overview-of project planning Step WiseRohanMistry15
 
Data & Analytics at Scale
Data & Analytics at ScaleData & Analytics at Scale
Data & Analytics at ScaleWalid Mehanna
 
Unit 2 Planning_b System Development Initiation and Planning .pdf
Unit 2 Planning_b System Development Initiation and Planning .pdfUnit 2 Planning_b System Development Initiation and Planning .pdf
Unit 2 Planning_b System Development Initiation and Planning .pdfSuryaBasnet3
 
Arg-XAI: a Tool for Explaining Machine Learning Results
Arg-XAI: a Tool for Explaining Machine Learning ResultsArg-XAI: a Tool for Explaining Machine Learning Results
Arg-XAI: a Tool for Explaining Machine Learning ResultsCarlo Taticchi
 
Learning Discrete Representations via Information Maximizing Self-Augmented T...
Learning Discrete Representations via Information Maximizing Self-Augmented T...Learning Discrete Representations via Information Maximizing Self-Augmented T...
Learning Discrete Representations via Information Maximizing Self-Augmented T...Shunsuke KITADA
 
Laura sebu icstcc_merging
Laura sebu icstcc_mergingLaura sebu icstcc_merging
Laura sebu icstcc_merginglaurasebu
 
STARBUCKS Site Selection Analysis drift
STARBUCKS Site Selection Analysis driftSTARBUCKS Site Selection Analysis drift
STARBUCKS Site Selection Analysis driftPark JunPyo
 
Linear Regression with R programming.pptx
Linear Regression with R programming.pptxLinear Regression with R programming.pptx
Linear Regression with R programming.pptxanshikagoel52
 
Performance evaluation of Wearable Computing Frameworks
Performance evaluation of Wearable Computing FrameworksPerformance evaluation of Wearable Computing Frameworks
Performance evaluation of Wearable Computing FrameworksMichelle Guerreiro C Silva
 
Grcm Presentation On Aug 4 2009
Grcm Presentation On Aug 4 2009Grcm Presentation On Aug 4 2009
Grcm Presentation On Aug 4 2009rabett
 
Chatzikonstantinou c ai-se2013_
Chatzikonstantinou c ai-se2013_Chatzikonstantinou c ai-se2013_
Chatzikonstantinou c ai-se2013_caise2013vlc
 
Leveraging The Machine: The Future of People Data Is Now
Leveraging The Machine: The Future of People Data Is Now Leveraging The Machine: The Future of People Data Is Now
Leveraging The Machine: The Future of People Data Is Now Cornerstone OnDemand
 
Predire il futuro con Machine Learning & Big Data
Predire il futuro con Machine Learning & Big DataPredire il futuro con Machine Learning & Big Data
Predire il futuro con Machine Learning & Big DataData Driven Innovation
 

Similar to Formal Concept Analysis applied to Professional Social Networks (20)

Massive scale analytics with Stratosphere using R
Massive scale analytics with Stratosphere using RMassive scale analytics with Stratosphere using R
Massive scale analytics with Stratosphere using R
 
Synthetic VIX Data Generation Using ML Techniques
Synthetic VIX Data Generation Using ML TechniquesSynthetic VIX Data Generation Using ML Techniques
Synthetic VIX Data Generation Using ML Techniques
 
A data science observatory based on RAMP - rapid analytics and model prototyping
A data science observatory based on RAMP - rapid analytics and model prototypingA data science observatory based on RAMP - rapid analytics and model prototyping
A data science observatory based on RAMP - rapid analytics and model prototyping
 
ODSC West 2021 – Composition in ML
ODSC West 2021 – Composition in MLODSC West 2021 – Composition in ML
ODSC West 2021 – Composition in ML
 
Paper Explained: Deep learning framework for measuring the digital strategy o...
Paper Explained: Deep learning framework for measuring the digital strategy o...Paper Explained: Deep learning framework for measuring the digital strategy o...
Paper Explained: Deep learning framework for measuring the digital strategy o...
 
PMIS Course Part 2 – PMIS in Practice
PMIS Course Part 2 – PMIS in PracticePMIS Course Part 2 – PMIS in Practice
PMIS Course Part 2 – PMIS in Practice
 
Dubai meetup 2016
Dubai meetup 2016Dubai meetup 2016
Dubai meetup 2016
 
Chapter 03 overview-of project planning Step Wise
Chapter 03 overview-of project planning Step WiseChapter 03 overview-of project planning Step Wise
Chapter 03 overview-of project planning Step Wise
 
Data & Analytics at Scale
Data & Analytics at ScaleData & Analytics at Scale
Data & Analytics at Scale
 
Unit 2 Planning_b System Development Initiation and Planning .pdf
Unit 2 Planning_b System Development Initiation and Planning .pdfUnit 2 Planning_b System Development Initiation and Planning .pdf
Unit 2 Planning_b System Development Initiation and Planning .pdf
 
Arg-XAI: a Tool for Explaining Machine Learning Results
Arg-XAI: a Tool for Explaining Machine Learning ResultsArg-XAI: a Tool for Explaining Machine Learning Results
Arg-XAI: a Tool for Explaining Machine Learning Results
 
Learning Discrete Representations via Information Maximizing Self-Augmented T...
Learning Discrete Representations via Information Maximizing Self-Augmented T...Learning Discrete Representations via Information Maximizing Self-Augmented T...
Learning Discrete Representations via Information Maximizing Self-Augmented T...
 
Laura sebu icstcc_merging
Laura sebu icstcc_mergingLaura sebu icstcc_merging
Laura sebu icstcc_merging
 
STARBUCKS Site Selection Analysis drift
STARBUCKS Site Selection Analysis driftSTARBUCKS Site Selection Analysis drift
STARBUCKS Site Selection Analysis drift
 
Linear Regression with R programming.pptx
Linear Regression with R programming.pptxLinear Regression with R programming.pptx
Linear Regression with R programming.pptx
 
Performance evaluation of Wearable Computing Frameworks
Performance evaluation of Wearable Computing FrameworksPerformance evaluation of Wearable Computing Frameworks
Performance evaluation of Wearable Computing Frameworks
 
Grcm Presentation On Aug 4 2009
Grcm Presentation On Aug 4 2009Grcm Presentation On Aug 4 2009
Grcm Presentation On Aug 4 2009
 
Chatzikonstantinou c ai-se2013_
Chatzikonstantinou c ai-se2013_Chatzikonstantinou c ai-se2013_
Chatzikonstantinou c ai-se2013_
 
Leveraging The Machine: The Future of People Data Is Now
Leveraging The Machine: The Future of People Data Is Now Leveraging The Machine: The Future of People Data Is Now
Leveraging The Machine: The Future of People Data Is Now
 
Predire il futuro con Machine Learning & Big Data
Predire il futuro con Machine Learning & Big DataPredire il futuro con Machine Learning & Big Data
Predire il futuro con Machine Learning & Big Data
 

More from Sérgio Dias

Fundamentos da Teoria da Computação Terceira Lista de Exercícios - Aula sobre...
Fundamentos da Teoria da Computação Terceira Lista de Exercícios - Aula sobre...Fundamentos da Teoria da Computação Terceira Lista de Exercícios - Aula sobre...
Fundamentos da Teoria da Computação Terceira Lista de Exercícios - Aula sobre...Sérgio Dias
 
Fundamentos da Teoria da Computação Segunda Lista de Exercícios - Aula sobre ...
Fundamentos da Teoria da Computação Segunda Lista de Exercícios - Aula sobre ...Fundamentos da Teoria da Computação Segunda Lista de Exercícios - Aula sobre ...
Fundamentos da Teoria da Computação Segunda Lista de Exercícios - Aula sobre ...Sérgio Dias
 
Fundamentos da Teoria da Computação Primeira Lista de Exercícios - Aula sobre...
Fundamentos da Teoria da Computação Primeira Lista de Exercícios - Aula sobre...Fundamentos da Teoria da Computação Primeira Lista de Exercícios - Aula sobre...
Fundamentos da Teoria da Computação Primeira Lista de Exercícios - Aula sobre...Sérgio Dias
 
Introdução a ciência de dados com aplicações em dados educacionais
Introdução a ciência de dados com aplicações em dados educacionaisIntrodução a ciência de dados com aplicações em dados educacionais
Introdução a ciência de dados com aplicações em dados educacionaisSérgio Dias
 
Big Data Analytics em Governo
Big Data Analytics em GovernoBig Data Analytics em Governo
Big Data Analytics em GovernoSérgio Dias
 
Proposta de Tese: Redução de Reticulados Conceituais
Proposta de Tese: Redução de Reticulados ConceituaisProposta de Tese: Redução de Reticulados Conceituais
Proposta de Tese: Redução de Reticulados ConceituaisSérgio Dias
 
Edital SERPRO/UFMG 01/2013 - “Práticas de Mineração de Dados Escalável para S...
Edital SERPRO/UFMG 01/2013 - “Práticas de Mineração de Dados Escalável para S...Edital SERPRO/UFMG 01/2013 - “Práticas de Mineração de Dados Escalável para S...
Edital SERPRO/UFMG 01/2013 - “Práticas de Mineração de Dados Escalável para S...Sérgio Dias
 
Redução de Reticulados Conceituais
Redução de Reticulados ConceituaisRedução de Reticulados Conceituais
Redução de Reticulados ConceituaisSérgio Dias
 
Ciência de Dados em Governo
Ciência de Dados em GovernoCiência de Dados em Governo
Ciência de Dados em GovernoSérgio Dias
 
Algoritmos para Geração de Reticulados Conceituais
Algoritmos para Geração de Reticulados ConceituaisAlgoritmos para Geração de Reticulados Conceituais
Algoritmos para Geração de Reticulados ConceituaisSérgio Dias
 
Extração e Representação de Conhecimento de Redes Neurais Artificiais Utiliza...
Extração e Representação de Conhecimento de Redes Neurais Artificiais Utiliza...Extração e Representação de Conhecimento de Redes Neurais Artificiais Utiliza...
Extração e Representação de Conhecimento de Redes Neurais Artificiais Utiliza...Sérgio Dias
 

More from Sérgio Dias (11)

Fundamentos da Teoria da Computação Terceira Lista de Exercícios - Aula sobre...
Fundamentos da Teoria da Computação Terceira Lista de Exercícios - Aula sobre...Fundamentos da Teoria da Computação Terceira Lista de Exercícios - Aula sobre...
Fundamentos da Teoria da Computação Terceira Lista de Exercícios - Aula sobre...
 
Fundamentos da Teoria da Computação Segunda Lista de Exercícios - Aula sobre ...
Fundamentos da Teoria da Computação Segunda Lista de Exercícios - Aula sobre ...Fundamentos da Teoria da Computação Segunda Lista de Exercícios - Aula sobre ...
Fundamentos da Teoria da Computação Segunda Lista de Exercícios - Aula sobre ...
 
Fundamentos da Teoria da Computação Primeira Lista de Exercícios - Aula sobre...
Fundamentos da Teoria da Computação Primeira Lista de Exercícios - Aula sobre...Fundamentos da Teoria da Computação Primeira Lista de Exercícios - Aula sobre...
Fundamentos da Teoria da Computação Primeira Lista de Exercícios - Aula sobre...
 
Introdução a ciência de dados com aplicações em dados educacionais
Introdução a ciência de dados com aplicações em dados educacionaisIntrodução a ciência de dados com aplicações em dados educacionais
Introdução a ciência de dados com aplicações em dados educacionais
 
Big Data Analytics em Governo
Big Data Analytics em GovernoBig Data Analytics em Governo
Big Data Analytics em Governo
 
Proposta de Tese: Redução de Reticulados Conceituais
Proposta de Tese: Redução de Reticulados ConceituaisProposta de Tese: Redução de Reticulados Conceituais
Proposta de Tese: Redução de Reticulados Conceituais
 
Edital SERPRO/UFMG 01/2013 - “Práticas de Mineração de Dados Escalável para S...
Edital SERPRO/UFMG 01/2013 - “Práticas de Mineração de Dados Escalável para S...Edital SERPRO/UFMG 01/2013 - “Práticas de Mineração de Dados Escalável para S...
Edital SERPRO/UFMG 01/2013 - “Práticas de Mineração de Dados Escalável para S...
 
Redução de Reticulados Conceituais
Redução de Reticulados ConceituaisRedução de Reticulados Conceituais
Redução de Reticulados Conceituais
 
Ciência de Dados em Governo
Ciência de Dados em GovernoCiência de Dados em Governo
Ciência de Dados em Governo
 
Algoritmos para Geração de Reticulados Conceituais
Algoritmos para Geração de Reticulados ConceituaisAlgoritmos para Geração de Reticulados Conceituais
Algoritmos para Geração de Reticulados Conceituais
 
Extração e Representação de Conhecimento de Redes Neurais Artificiais Utiliza...
Extração e Representação de Conhecimento de Redes Neurais Artificiais Utiliza...Extração e Representação de Conhecimento de Redes Neurais Artificiais Utiliza...
Extração e Representação de Conhecimento de Redes Neurais Artificiais Utiliza...
 

Recently uploaded

GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
Presentation Activity 2. Unit 3 transv.pptx
Presentation Activity 2. Unit 3 transv.pptxPresentation Activity 2. Unit 3 transv.pptx
Presentation Activity 2. Unit 3 transv.pptxRosabel UA
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
EMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxEMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxElton John Embodo
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptshraddhaparab530
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
TEACHER REFLECTION FORM (NEW SET........).docx
TEACHER REFLECTION FORM (NEW SET........).docxTEACHER REFLECTION FORM (NEW SET........).docx
TEACHER REFLECTION FORM (NEW SET........).docxruthvilladarez
 

Recently uploaded (20)

GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
Presentation Activity 2. Unit 3 transv.pptx
Presentation Activity 2. Unit 3 transv.pptxPresentation Activity 2. Unit 3 transv.pptx
Presentation Activity 2. Unit 3 transv.pptx
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
EMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxEMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docx
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.ppt
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
TEACHER REFLECTION FORM (NEW SET........).docx
TEACHER REFLECTION FORM (NEW SET........).docxTEACHER REFLECTION FORM (NEW SET........).docx
TEACHER REFLECTION FORM (NEW SET........).docx
 

Formal Concept Analysis applied to Professional Social Networks

  • 1. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Formal Concept Analysis applied to Professional Social Networks Paula Silva, S´ergio M. Dias, Wladmir Brand˜ao, Mark Song, Luis Z´arate (supervisor) Pontifical Catholic University of Minas Gerais, Brazil ICEIS, 2017 ICEIS, 2017 1 / 26
  • 2. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Context and Motivation Online social networks for users oriented to business The LinkedIn is the one of the largest and most popular online professional social network The size and diversity of users generated content data The need to help professionals to increase skills and reach job positions The Formal Concept Analysis (FCA) as mathematical formulation for data analysis, applied to find patterns of professional competence ICEIS, 2017 2 / 26
  • 3. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Goal Identify professional behaviors through data scraped from LinkedIn Find the minimum set of skills that is necessary to reach job positions For example: statistic, machine learning, databases → data scientist Implications rules, specifically the set of proper implications ICEIS, 2017 3 / 26
  • 4. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Contributions Our contributions are: The domain problem mapping the model of competences The professionals data set scraped from LinkedIn The FCA-based approach The set of experiments to apply FCA for professional career analysis ICEIS, 2017 4 / 26
  • 5. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Formal Concept Analysis Based on the notions of concept and conceptual hierarchy A mathematical way to look at data and knowledge, their acquisition process and analysis based on lattices. There are three main principles: Formal context Formal concept Implications ICEIS, 2017 5 / 26
  • 6. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Formal Context Formal context (G, M, I) a set G of objects a set M of attributes a binary relation I ⊆ G ×M a b c d e f g 18 x x 19 x x x 20 x x x x 21 x x x 22 x x 23 x x x x x 24 x x Table: Example context of an user’s LinkedIn skills. ICEIS, 2017 6 / 26
  • 7. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Formal Concept Derivation operators: For A ⊆ G and B ⊆ M A := {m ∈ M|gIm∀g ∈ A} B := {g ∈ G|gIm∀m ∈ B} Formal concept (A, B) A ⊆ G B ⊆ M A = B B = A A is concept extent and B is concept intent e.g. formal concept ({18, 21, 24}, {software engineering, data bases}) ICEIS, 2017 7 / 26
  • 8. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Implication Rules Being a formal context (G, M, I), an implication over M is P → Q, where P,Q ⊆ M. P is the premise Q is the conclusion P → Q has to be such that P ⊆ Q , so the sets of attributes P and Q share the same subset of objects. ICEIS, 2017 8 / 26
  • 9. Introduction FCA FCA-based Approach Experiments Conclusions Future Works The Set of Proper Implications the right hand side of each implication is unitary: if P → m ∈ I , then m ∈ M; superfluous implications are not allowed: if P → m ∈ I , then m /∈ P; specializations are not allowed, i.e. left hand sides are minimal: if P → m ∈ I , then there is not any Q → m ∈ I such that Q ⊂ P. For example, {e} → {a} is a proper implication, but {e, g} → {a} is not a proper implication. ICEIS, 2017 9 / 26
  • 10. Introduction FCA FCA-based Approach Experiments Conclusions Future Works FCA-based Approach ICEIS, 2017 10 / 26
  • 11. Introduction FCA FCA-based Approach Experiments Conclusions Future Works 1 - Problem Domain Building the conceptual model according to the problem to be treated The classification of informational categories, based on Model of Competence Figure: Duran(1998) adapted model ICEIS, 2017 11 / 26
  • 12. Introduction FCA FCA-based Approach Experiments Conclusions Future Works 1 - Problem Domain We identified 3 dimensions, 14 aspects and 51 variables Figure: Professional identification through model of competence ICEIS, 2017 12 / 26
  • 13. Introduction FCA FCA-based Approach Experiments Conclusions Future Works 2 - Scrapping The scrapping process was divided into two phases. Selecting initial seeds randomly Collecting the public profiles data: People undergraduate in IT coursers in Minas Gerais, Brazil ICEIS, 2017 13 / 26
  • 14. Introduction FCA FCA-based Approach Experiments Conclusions Future Works 3 - Preprocessing We only considered the variables skills and experience The ETL process: String cleaning: UTF-8 encoding correction, accent removal, standardization of all terms for the English language through Google Translate API Attribute reductions: reductions based on semantic relevance Generic term Specific term Java JPA JSF Software developer Developer Programmer Program developer Formal context with 366 attributes and 970 objects ICEIS, 2017 14 / 26
  • 15. Introduction FCA FCA-based Approach Experiments Conclusions Future Works 4 - Knowledge Extractor The PropIm algorithm Finding supersets: If A → b, so ↑ |(A → b) | and ↓ |A| It computes proper implications with support > 0 Easily scalable strategy Allows to set the attributes of interest for implications’ conclusions Problem complexity: O(|M||I |(|G||M|+|I ||M|)) Pruning heuristic to reduce combinations possibilities among attributes ICEIS, 2017 15 / 26
  • 16. Introduction FCA FCA-based Approach Experiments Conclusions Future Works 4 - Knowledge Extractor Input : Formal context (G,M,I) Output: Set of proper implications I with support greater than 0 1 I = /0 2 foreach m ∈ M do 3 P = m 4 size = 1 5 Pa = /0 6 while size < |P| do 7 C = P size 8 PC = getCandidate(C,Pa) 9 foreach P1 ⊂ PC do 10 if P1 = /0 and P1 ⊂ m then 11 Pa = Pa ∪{P1} 12 I = I ∪{P1 → m} 13 end 14 end 15 size ++ 16 end 17 end 18 return I 1 Function getCandidate (C,Pa) 2 D = /0 3 foreach a ∈ A|A ⊂ Pa do 4 foreach B ⊂ C do 5 if a /∈ B then 6 D = PC B 7 end 8 end 9 end 10 return D ICEIS, 2017 16 / 26
  • 17. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Experiments The goal was to answer the following questions: How do proper implications identify relations between skills and positions? Could we find intersections among sets of skills, and what do these intersections represent? ICEIS, 2017 17 / 26
  • 18. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Proper Implications to Competence Identification We selected 20 positions and their 180 skills It was extracted 895 proper implications with PropIm algorithm Figure: Proper implications network. AD = Administrative Director, ITC = IT Consultant ICEIS, 2017 18 / 26
  • 19. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Proper Implications to Competence Identification Nodes with high in-degree value represent positions which have more diversification of sets of skills 163 proper implications {entrepreneurship, human resources, information management} → {administrative director} Figure: AD = Administrative Director ICEIS, 2017 19 / 26
  • 20. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Proper Implications to Competence Identification Nodes with low in-degree value represent jobs positions that demand more specific sets of skills 3 proper implications {ABAP, agile methodology, BI} → {it consultant} Figure: ITC = IT Consultant ICEIS, 2017 20 / 26
  • 21. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Intersection Between Skills and Job Positions Top 3 best jobs in Information Technology area according to Career Cast research (Cast, 2016) Edges weight = relative frequency F = Fi Fp , where F is the relative frequency, Fi is the implication absolute frequency and Fp is |m | Why relative frequency? {java frameworks}→{software engineer} Support: 2.47% Relative frequency: 75.56% Figure: P1: data scientist, P2: information security analyst and P3: software engineer job position ICEIS, 2017 21 / 26
  • 22. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Intersection Between Skills and Job Positions Nodes P1 and P2 share two set of skills: {agile methodology} → {data scientist} {agile methodology} → {information security analyst} Figure: P1: data scientist, P2: information security analyst and P3: software engineer job position ICEIS, 2017 22 / 26
  • 23. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Intersection Between Skills and Job Positions Skills and positions related to hierarchical transition: P1: {.NET, automation systems} → IT analyst P2: {.NET, data base, ERP, it governance} → IT coordinator P3: {BPM, cloud computing, CRM} → IT manager P4: {assets management, BI, business management, consulting} → IT director Figure: P1 to P4 represents the following job positions: P1 is IT analyst, P2 is IT manager, P3 is IT coordinator and P4 is IT director ICEIS, 2017 23 / 26
  • 24. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Conclusions FCA-based approach to identify the minimum sets of skills that is necessary to achieve a job position Set of experiments for apply FCA to professional competences analysis Computational strategy to find proper implications without loss of information In-degree of conclusions nodes mean the diversification among sets of skills for the same job position Sharing sets of skills do not determine the job position, but show positions that can be achieved The disjointed sets (representing hierarchy) show that is necessary develop skills of different natures to progress in career ICEIS, 2017 24 / 26
  • 25. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Future Works Experiments will be replicated for other areas Exploring another algorithms which extract implications from concept lattice, or from the set of formal concepts Expanding the analysis to all dimensions from model of competence Implementing an web environment with this FCA-based approach, for help professionals to increase skills and look for possible job positions Preprocessing: attribute fusion through correlation analysis Temporal analysis of career evolution Estimate the remuneration of people based on posts and professional data ICEIS, 2017 25 / 26
  • 26. Introduction FCA FCA-based Approach Experiments Conclusions Future Works Any questions? paula.raissa@sga.pucminas.br paula.csraissa@gmail.com ICEIS, 2017 26 / 26