SlideShare uma empresa Scribd logo
1 de 25
Modeling Professional Similarity by
Mining Professional Career Trajectories
Zang Li @ ACMKDD 2014, New_York_City
Outline
• Motivation
• Problem Statement
• Approach
• Evaluation
 Future Works
Similar People
How you rank for
profile views
People You May Hire Lead Recommendations
Similar Profiles Recommender
Recruiter Sales Navigator Subscription
Linkedin Profiles
• SimilarProfiles:
– Source profile -> return K target profiles with similar skills, titles, companies…
– keyword-based text matching
• Temporal information in a Linkedin profile
Rerank Top K SimProfiles:
Career trajectory Similarity
Outline
• Motivation
• Problem Statement
• Approach
• Evaluation
 Future Works
Title : Software Engineer, Research
Engineer, Research Assistant
Specialty : Machine Learning, Data
Analysis, Hadoop, Networks
Company: Cisco, Linkedin, Penn
State
Summary: Software Engineer,
Research Engineer, Machine
Learning, Data Analysis, Networks,
Research Assistant
How to model a profile with career trajectory?
- Summary: ML, Hadoop
- Company: Linkedin
- Title: Software Engineer
- Duration: (2011.5-2013.3)
- Summary: Data Analysis
- Company: Cisco
- Title: Research Engineer
- Duration: (2010.7-2011.4)
- Summary: Networks
- Company: Penn State
- Title: Research Assistant
- Duration: (2006.9-2010.6)
Keywords Profile Model Sequence Profile Model
How to match two profiles ?
Title : Software Engineer, RA
Specialty: ML, Networks…
Company: Cisco, Linkedin, Penn
State
Summary: Software Engineer
ML, Networks, …
Title : Software Engineer, Ph.D.
Specialty: ML, DM, Mobile…
Company: Yahoo, Linkedin,
Intel, Dartmouth
Summary: Software Engineer
ML, DM, Mobile …
- Summary: ML, Hadoop
- Company: Linkedin
- Title: Software Engineer
- Duration: (2011.5-2013.3)
- Summary: Data Analysis
- Company: Cisco
- Title: Research Engineer
- Duration: (2010.7-2011.4)
- Summary: ML
- Company: Linkedin
- Title: Software Engineer
- Duration: (2012.7-2013.3)
- Summary: Mobile
- Company: Intel
- Title: Software Engineer
- Duration: (2010.5-2012.3)
- Summary: Hadoop, DM
- Company: Yahoo
- Title: Research Scientist
- Duration: (2008.8-2010.4)
Similar Profiles
Similar Career Paths
?
Outline
• Motivation
• Problem Statement
• Approach
• Evaluation
 Future Works
Approach: Sequence Alignment
• Match and align positions from two
career sequences according to the
similarity between them.
1. What is position-level similarity
function?
2. How to identify optimal match of
pairs of positions?
Position Similarity Function
• Measure for position similarity
– Build a Logistic regression model
– Feature interaction
• company, industry, title, seniority, job function, summary
• E.g., title: whether the two positions share the same title name
– Feature weight learning
• Label: InMail via Similar Profiles
– Use recruiter InMail to infer position similarity label.
– Assumption: If recruiters send inMails to a target user from
SimilarProfiles results of the source user, then the current positions
(between source and target) are similar.
– Amongst the rest of the recommended profiles, all the ones that were
ranked higher but were not reached out by any recruiter are regarded as
negative profile-pairs
Sequence Alignment
PathSim[1:i, 1:j] =
PathSim[1:i-1, 1:j-1] + PosSim[i, j]
PathSim[1:i-1, 1:j] – penalty
PathSim[1:i, 1:j-1] - penalty
max
Match Pos i and Pos j
Skip Pos j
Skip Pos i
Sequence of Positions
• Profile 1
- Summary: ML, Hadoop
- Company: Linkedin
- Title: Software Engineer
- Duration: (2011.5-2013.3)
- Summary: Data Analysis
- Company: Cisco
- Title: Research Engineer
- Duration: (2010.7-2011.4)
- Summary: Networks
- Company: Penn State
- Title: Research Assistant
- Duration: (2006.9-2010.6)
• Profile 2
- Summary: ML
- Company: Linkedin
- Title: Software Engineer
- Duration: (2012.7-2013.3)
- Summary: Mobile
- Company: Intel
- Title: Software Engineer
- Duration: (2010.5-2012.3)
- Summary: Hadoop, DM
- Company: Yahoo
- Title: Research Scientist
- Duration: (2008.8-2010.4)
- Summary: Sensor
- Company: Dartmouth
- Title: Ph.D. Student
- Duration: (2002.9-2008.7)
Sequence of Compositions
• Profile 1
- Summary: Data Analysis
- Company: Cisco
- Title: Research Engineer
- Duration: (2010.7-2011.4)
- Summary: Networks
- Company: Penn State
- Title: Research Assistant
- Duration: (2006.9-2010.6)
- Summary: ML, Hadoop
- Company: Linkedin
- Title: Software Engineer
- Duration: (2011.5-2013.3)
• Profile 2
- Summary: Mobile
- Company: Intel
- Title: Software Engineer
- Duration: (2010.5-2012.3)
- Summary: Hadoop, DM
- Company: Yahoo
- Title: Research Scientist
- Duration: (2008.8-2010.4)
- Summary: ML
- Company: Linkedin
- Title: Software Engineer
- Duration: (2012.7-2013.3)
- SenChg: 0
- ComChg: YES
- Duration: 4
- SenChg: 7
- ComChg: YES
- Duration: 21
- SenChg: 7
- ComChg: YES
- Duration: 46
- SenChg: 2
- ComChg: YES
- Duration: 10
Sequence of Compositions
• Similarity measure for composition node
– Position features: company, industry, title, seniority, job
function, summary
• E.g., title: whether the two positions share the same title name.
– Transition features: same company, same industry, title
change, seniority change, year durations
• E.g., same company: whether both of the two transitions happen
within the same company or not.
Recency and Duration
• Incentivize matching more recent positions
• Incentivize matching longer duration positions
Similar Profiles score as a prior
• Sparse positions
• Bagging
Outline
• Motivation
• Problem Statement
• Approach
• Evaluation
 Future Works
Offline Evaluation: NDCG
• get the testing data from Similar Profiles Recommender System on
LinkedIn Recruiter.
• we employ InMail sending and receiving to infer this ground truth.
Online Evaluation: A/B Test
• When giving more weight to the SimCareers score, we get a
larger increase in both Profile Views and InMails sent.
• Capturing temporal information of career trajectory is
crucial in computing profile similarity for hiring.
Rerank Top K Similar
Profiles Recommendations
Outline
• Motivation
• Problem Statement
• Approach
• Evaluation
 Future Works
Future Works
• Skip VS. merge
– Lateral move, promotion
– Unaligned position does not hurt.
Thanks!

Mais conteúdo relacionado

Semelhante a SCP_lnkd_Presentation_kdd2014.pptx

Course Description Considering that an organization’s peopl
Course Description   Considering that an organization’s peoplCourse Description   Considering that an organization’s peopl
Course Description Considering that an organization’s peopl
CruzIbarra161
 
Workshop Resume Writing Tips
Workshop Resume Writing TipsWorkshop Resume Writing Tips
Workshop Resume Writing Tips
ggcareerservices
 
Evaluating LLM Models for Production Systems Methods and Practices -
Evaluating LLM Models for Production Systems Methods and Practices -Evaluating LLM Models for Production Systems Methods and Practices -
Evaluating LLM Models for Production Systems Methods and Practices -
alopatenko
 
The Semantic Knowledge Graph
The Semantic Knowledge GraphThe Semantic Knowledge Graph
The Semantic Knowledge Graph
Trey Grainger
 

Semelhante a SCP_lnkd_Presentation_kdd2014.pptx (20)

Designing and Building a Graph Database Application - Ian Robinson (Neo Techn...
Designing and Building a Graph Database Application - Ian Robinson (Neo Techn...Designing and Building a Graph Database Application - Ian Robinson (Neo Techn...
Designing and Building a Graph Database Application - Ian Robinson (Neo Techn...
 
Use of Contextualized Attention Metadata for Ranking and Recommending Learnin...
Use of Contextualized Attention Metadata for Ranking and Recommending Learnin...Use of Contextualized Attention Metadata for Ranking and Recommending Learnin...
Use of Contextualized Attention Metadata for Ranking and Recommending Learnin...
 
IRE_Group52
IRE_Group52IRE_Group52
IRE_Group52
 
Introduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudDataIntroduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudData
 
Introduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudDataIntroduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudData
 
Improving Search in Workday Products using Natural Language Processing
Improving Search in Workday Products using Natural Language ProcessingImproving Search in Workday Products using Natural Language Processing
Improving Search in Workday Products using Natural Language Processing
 
Course Description Considering that an organization’s peopl
Course Description   Considering that an organization’s peoplCourse Description   Considering that an organization’s peopl
Course Description Considering that an organization’s peopl
 
Resume Parser with Natural Language Processing
Resume Parser with Natural Language ProcessingResume Parser with Natural Language Processing
Resume Parser with Natural Language Processing
 
Workshop Resume Writing Tips
Workshop Resume Writing TipsWorkshop Resume Writing Tips
Workshop Resume Writing Tips
 
Shall we search? Lviv.
Shall we search? Lviv. Shall we search? Lviv.
Shall we search? Lviv.
 
Structure, Personalization, Scale: A Deep Dive into LinkedIn Search
Structure, Personalization, Scale: A Deep Dive into LinkedIn SearchStructure, Personalization, Scale: A Deep Dive into LinkedIn Search
Structure, Personalization, Scale: A Deep Dive into LinkedIn Search
 
Machine Learning Teams - Full Stack Deep Learning
Machine Learning Teams - Full Stack Deep LearningMachine Learning Teams - Full Stack Deep Learning
Machine Learning Teams - Full Stack Deep Learning
 
Data modeling with neo4j tutorial
Data modeling with neo4j tutorialData modeling with neo4j tutorial
Data modeling with neo4j tutorial
 
From Academic Papers To Production : A Learning To Rank Story
From Academic Papers To Production : A Learning To Rank StoryFrom Academic Papers To Production : A Learning To Rank Story
From Academic Papers To Production : A Learning To Rank Story
 
Resume and the Future of Internet Recruiting
Resume and the Future of Internet RecruitingResume and the Future of Internet Recruiting
Resume and the Future of Internet Recruiting
 
Evaluating LLM Models for Production Systems Methods and Practices -
Evaluating LLM Models for Production Systems Methods and Practices -Evaluating LLM Models for Production Systems Methods and Practices -
Evaluating LLM Models for Production Systems Methods and Practices -
 
Interleaving, Evaluation to Self-learning Search @904Labs
Interleaving, Evaluation to Self-learning Search @904LabsInterleaving, Evaluation to Self-learning Search @904Labs
Interleaving, Evaluation to Self-learning Search @904Labs
 
The Semantic Knowledge Graph
The Semantic Knowledge GraphThe Semantic Knowledge Graph
The Semantic Knowledge Graph
 
Recruiting for Drupal #Hiring
Recruiting for Drupal #HiringRecruiting for Drupal #Hiring
Recruiting for Drupal #Hiring
 
A Recruiter's Checklist (Strings Attached)
A Recruiter's Checklist (Strings Attached)A Recruiter's Checklist (Strings Attached)
A Recruiter's Checklist (Strings Attached)
 

Último

Último (10)

The Influence and Evolution of Mogul Press in Contemporary Public Relations.docx
The Influence and Evolution of Mogul Press in Contemporary Public Relations.docxThe Influence and Evolution of Mogul Press in Contemporary Public Relations.docx
The Influence and Evolution of Mogul Press in Contemporary Public Relations.docx
 
Databricks Machine Learning Associate Exam Dumps 2024.pdf
Databricks Machine Learning Associate Exam Dumps 2024.pdfDatabricks Machine Learning Associate Exam Dumps 2024.pdf
Databricks Machine Learning Associate Exam Dumps 2024.pdf
 
Breathing in New Life_ Part 3 05 22 2024.pptx
Breathing in New Life_ Part 3 05 22 2024.pptxBreathing in New Life_ Part 3 05 22 2024.pptx
Breathing in New Life_ Part 3 05 22 2024.pptx
 
2024-05-15-Surat Meetup-Hyperautomation.pptx
2024-05-15-Surat Meetup-Hyperautomation.pptx2024-05-15-Surat Meetup-Hyperautomation.pptx
2024-05-15-Surat Meetup-Hyperautomation.pptx
 
Understanding Poverty: A Community Questionnaire
Understanding Poverty: A Community QuestionnaireUnderstanding Poverty: A Community Questionnaire
Understanding Poverty: A Community Questionnaire
 
DAY 0 8 A Revelation 05-19-2024 PPT.pptx
DAY 0 8 A Revelation 05-19-2024 PPT.pptxDAY 0 8 A Revelation 05-19-2024 PPT.pptx
DAY 0 8 A Revelation 05-19-2024 PPT.pptx
 
ACM CHT Best Inspection Practices Kinben Innovation MIC Slideshare.pdf
ACM CHT Best Inspection Practices Kinben Innovation MIC Slideshare.pdfACM CHT Best Inspection Practices Kinben Innovation MIC Slideshare.pdf
ACM CHT Best Inspection Practices Kinben Innovation MIC Slideshare.pdf
 
Deciding The Topic of our Magazine.pptx.
Deciding The Topic of our Magazine.pptx.Deciding The Topic of our Magazine.pptx.
Deciding The Topic of our Magazine.pptx.
 
Microsoft Fabric Analytics Engineer (DP-600) Exam Dumps 2024.pdf
Microsoft Fabric Analytics Engineer (DP-600) Exam Dumps 2024.pdfMicrosoft Fabric Analytics Engineer (DP-600) Exam Dumps 2024.pdf
Microsoft Fabric Analytics Engineer (DP-600) Exam Dumps 2024.pdf
 
ServiceNow CIS-Discovery Exam Dumps 2024
ServiceNow CIS-Discovery Exam Dumps 2024ServiceNow CIS-Discovery Exam Dumps 2024
ServiceNow CIS-Discovery Exam Dumps 2024
 

SCP_lnkd_Presentation_kdd2014.pptx

  • 1. Modeling Professional Similarity by Mining Professional Career Trajectories Zang Li @ ACMKDD 2014, New_York_City
  • 2. Outline • Motivation • Problem Statement • Approach • Evaluation  Future Works
  • 3. Similar People How you rank for profile views People You May Hire Lead Recommendations Similar Profiles Recommender Recruiter Sales Navigator Subscription Linkedin Profiles
  • 4. • SimilarProfiles: – Source profile -> return K target profiles with similar skills, titles, companies… – keyword-based text matching • Temporal information in a Linkedin profile
  • 5.
  • 6. Rerank Top K SimProfiles: Career trajectory Similarity
  • 7. Outline • Motivation • Problem Statement • Approach • Evaluation  Future Works
  • 8. Title : Software Engineer, Research Engineer, Research Assistant Specialty : Machine Learning, Data Analysis, Hadoop, Networks Company: Cisco, Linkedin, Penn State Summary: Software Engineer, Research Engineer, Machine Learning, Data Analysis, Networks, Research Assistant How to model a profile with career trajectory? - Summary: ML, Hadoop - Company: Linkedin - Title: Software Engineer - Duration: (2011.5-2013.3) - Summary: Data Analysis - Company: Cisco - Title: Research Engineer - Duration: (2010.7-2011.4) - Summary: Networks - Company: Penn State - Title: Research Assistant - Duration: (2006.9-2010.6) Keywords Profile Model Sequence Profile Model
  • 9. How to match two profiles ? Title : Software Engineer, RA Specialty: ML, Networks… Company: Cisco, Linkedin, Penn State Summary: Software Engineer ML, Networks, … Title : Software Engineer, Ph.D. Specialty: ML, DM, Mobile… Company: Yahoo, Linkedin, Intel, Dartmouth Summary: Software Engineer ML, DM, Mobile … - Summary: ML, Hadoop - Company: Linkedin - Title: Software Engineer - Duration: (2011.5-2013.3) - Summary: Data Analysis - Company: Cisco - Title: Research Engineer - Duration: (2010.7-2011.4) - Summary: ML - Company: Linkedin - Title: Software Engineer - Duration: (2012.7-2013.3) - Summary: Mobile - Company: Intel - Title: Software Engineer - Duration: (2010.5-2012.3) - Summary: Hadoop, DM - Company: Yahoo - Title: Research Scientist - Duration: (2008.8-2010.4) Similar Profiles Similar Career Paths ?
  • 10. Outline • Motivation • Problem Statement • Approach • Evaluation  Future Works
  • 11. Approach: Sequence Alignment • Match and align positions from two career sequences according to the similarity between them. 1. What is position-level similarity function? 2. How to identify optimal match of pairs of positions?
  • 12. Position Similarity Function • Measure for position similarity – Build a Logistic regression model – Feature interaction • company, industry, title, seniority, job function, summary • E.g., title: whether the two positions share the same title name – Feature weight learning • Label: InMail via Similar Profiles – Use recruiter InMail to infer position similarity label. – Assumption: If recruiters send inMails to a target user from SimilarProfiles results of the source user, then the current positions (between source and target) are similar. – Amongst the rest of the recommended profiles, all the ones that were ranked higher but were not reached out by any recruiter are regarded as negative profile-pairs
  • 13. Sequence Alignment PathSim[1:i, 1:j] = PathSim[1:i-1, 1:j-1] + PosSim[i, j] PathSim[1:i-1, 1:j] – penalty PathSim[1:i, 1:j-1] - penalty max Match Pos i and Pos j Skip Pos j Skip Pos i
  • 14. Sequence of Positions • Profile 1 - Summary: ML, Hadoop - Company: Linkedin - Title: Software Engineer - Duration: (2011.5-2013.3) - Summary: Data Analysis - Company: Cisco - Title: Research Engineer - Duration: (2010.7-2011.4) - Summary: Networks - Company: Penn State - Title: Research Assistant - Duration: (2006.9-2010.6) • Profile 2 - Summary: ML - Company: Linkedin - Title: Software Engineer - Duration: (2012.7-2013.3) - Summary: Mobile - Company: Intel - Title: Software Engineer - Duration: (2010.5-2012.3) - Summary: Hadoop, DM - Company: Yahoo - Title: Research Scientist - Duration: (2008.8-2010.4) - Summary: Sensor - Company: Dartmouth - Title: Ph.D. Student - Duration: (2002.9-2008.7)
  • 15. Sequence of Compositions • Profile 1 - Summary: Data Analysis - Company: Cisco - Title: Research Engineer - Duration: (2010.7-2011.4) - Summary: Networks - Company: Penn State - Title: Research Assistant - Duration: (2006.9-2010.6) - Summary: ML, Hadoop - Company: Linkedin - Title: Software Engineer - Duration: (2011.5-2013.3) • Profile 2 - Summary: Mobile - Company: Intel - Title: Software Engineer - Duration: (2010.5-2012.3) - Summary: Hadoop, DM - Company: Yahoo - Title: Research Scientist - Duration: (2008.8-2010.4) - Summary: ML - Company: Linkedin - Title: Software Engineer - Duration: (2012.7-2013.3) - SenChg: 0 - ComChg: YES - Duration: 4 - SenChg: 7 - ComChg: YES - Duration: 21 - SenChg: 7 - ComChg: YES - Duration: 46 - SenChg: 2 - ComChg: YES - Duration: 10
  • 16. Sequence of Compositions • Similarity measure for composition node – Position features: company, industry, title, seniority, job function, summary • E.g., title: whether the two positions share the same title name. – Transition features: same company, same industry, title change, seniority change, year durations • E.g., same company: whether both of the two transitions happen within the same company or not.
  • 17. Recency and Duration • Incentivize matching more recent positions • Incentivize matching longer duration positions
  • 18. Similar Profiles score as a prior • Sparse positions • Bagging
  • 19. Outline • Motivation • Problem Statement • Approach • Evaluation  Future Works
  • 20. Offline Evaluation: NDCG • get the testing data from Similar Profiles Recommender System on LinkedIn Recruiter. • we employ InMail sending and receiving to infer this ground truth.
  • 21. Online Evaluation: A/B Test • When giving more weight to the SimCareers score, we get a larger increase in both Profile Views and InMails sent. • Capturing temporal information of career trajectory is crucial in computing profile similarity for hiring.
  • 22. Rerank Top K Similar Profiles Recommendations
  • 23. Outline • Motivation • Problem Statement • Approach • Evaluation  Future Works
  • 24. Future Works • Skip VS. merge – Lateral move, promotion – Unaligned position does not hurt.