SlideShare uma empresa Scribd logo
1 de 27
Baixar para ler offline
Introduction Method Results Summary
Contextualization of Topics
browsing through terms, authors, journals and cluster allocations
Rob Koopman1 Shenghui Wang1 Andrea Scharnhorst2
1OCLC Research 2DANS-KNAW
ISSI 2015
Introduction Method Results Summary
Introduction
What are essence and boundary of a scientific field?
Different ways to find clusters in scientific literature based on
connectivity in terms of authorship, citations, language
similarity, etc.
Ambiguous nature in science
Introduction Method Results Summary
Ariadne: interactive context explorer
Ariadne is an interactive interface which allows users to
explore the context of entities such as authors, journals,
topical terms, etc.
It builds on semantic indexing statistically computed from a
large scale bibliographic corpus
It was originally implemented to explore 1M topical terms, 3M
authors, 35K journals and 700+ Dewey decimal classes
associated with 65M articles.
Introduction Method Results Summary
Research questions
Q1: How does the Ariadne algorithm work on a much smaller,
field specific dataset?
Q2: Can we use Ariadne to label the clusters produce by the
different methods?
Q3: Can we use Ariadne to compare different clustering
solutions?
Introduction Method Results Summary
LittleAriadne
LittleAriadne: context explorer over Astrophysics data
Offline: generates a semantic representation for each entity
Online: finds the most related entities and using
multidimensional scaling to display
Introduction Method Results Summary
LittleAriadne
An example article
Article ID ISI:000276828000006
Title On the Mass Transfer Rate in SS Cyg
Abstract The mass transfer rate in SS Cyg at quiescence, estimated
from the observed luminosity of the hot spot, is log M-tr
= 16.8 +/- 0.3. This is safely below the critical mass
transfer rates of log M-crit = 18.1 (corresponding to log
T-crit(0) = 3.88) or log M-crit = 17.2 (corresponding to
the “revised” value of log T-crit(0) = 3.65). The mass
transfer rate during outbursts is strongly enhanced
Author [author:smak j]
ISSN [issn:0001-5237]
Subject [subject:accretion, accretion disks] [subject:cataclysmic
variables] [subject:disc instability model] [subject:dwarf novae]
[subject:novae, cataclysmic variables] [subject:outbursts]
[subject:parameters] [subject:stars] [subject:stars dwarf novae]
[subject:stars individual ss cyg] [subject:state] [subject:
superoutbursts]
Introduction Method Results Summary
LittleAriadne
An example article
Article ID ISI:000276828000006
Title On the Mass Transfer Rate in SS Cyg
Abstract The mass transfer rate in SS Cyg at quiescence, estimated
from the observed luminosity of the hot spot, is log M-tr
= 16.8 +/- 0.3. This is safely below the critical mass
transfer rates of log M-crit = 18.1 (corresponding to log
T-crit(0) = 3.88) or log M-crit = 17.2 (corresponding to
the “revised” value of log T-crit(0) = 3.65). The mass
transfer rate during outbursts is strongly enhanced
Author [author:smak j]
ISSN [issn:0001-5237]
Subject [subject:accretion, accretion disks] [subject:cataclysmic
variables] [subject:disc instability model] [subject:dwarf novae]
[subject:novae, cataclysmic variables] [subject:outbursts]
[subject:parameters] [subject:stars] [subject:stars dwarf novae]
[subject:stars individual ss cyg] [subject:state] [subject:
superoutbursts]
Cluster label [cluster:a 19] [cluster:b 16] [cluster:c 15]
[cluster:d 51] [cluster:e 17] [cluster:f 1]
Introduction Method Results Summary
LittleAriadne
Six different clustering solutions
x Source y=#Cluster #Cluster in LittleAriadne
a cwts 1.8 23 23
b UMSI 23 23
c oclc 20 20 20
d hu 139 48
e sts 5664 229
f ECOOM 15 15
Introduction Method Results Summary
LittleAriadne
Entities in the Astrophysis dataset
There are in total 90,343 entities associated with 111,616
astrophysics articles
59 journals
27,027 author names (no disambiguation applied)
39,577 topical terms
23,322 subjects (extracted from ”Author Keywords” and
”Keywords Plus”)
358 cluster labels (source + cluster id)
Introduction Method Results Summary
LittleAriadne
Build semantic representation
Basic assumptions
Entities can be represented by its context
Entities which share more context are more likely to be related
Context is the textual environment where an entity occurs
Introduction Method Results Summary
LittleAriadne
An example article
Article ID ISI:000276828000006
Title On the Mass Transfer Rate in SS Cyg
Abstract The mass transfer rate in SS Cyg at quiescence, estimated
from the observed luminosity of the hot spot, is log M-tr
= 16.8 +/- 0.3. This is safely below the critical mass
transfer rates of log M-crit = 18.1 (corresponding to log
T-crit(0) = 3.88) or log M-crit = 17.2 (corresponding to
the “revised” value of log T-crit(0) = 3.65). The mass
transfer rate during outbursts is strongly enhanced
Author [author:smak j]
ISSN [issn:0001-5237]
Subject [subject:accretion, accretion disks] [subject:cataclysmic
variables] [subject:disc instability model] [subject:dwarf novae]
[subject:novae, cataclysmic variables] [subject:outbursts]
[subject:parameters] [subject:stars] [subject:stars dwarf novae]
[subject:stars individual ss cyg] [subject:state] [subject:
superoutbursts]
Cluster label [cluster:a 19] [cluster:b 16] [cluster:c 15]
[cluster:d 51] [cluster:e 17] [cluster:f 1]
Introduction Method Results Summary
LittleAriadne
Dimension reduction using Random Projection
mass
transfer
rate
[subject:outburst]
[subject:sstars]
[subject:parameters]
[author:smak j]
[cluster: a19]
[issn:0001-5237]
Introduction Method Results Summary
LittleAriadne
Dimension reduction using Random Projection
mass
transfer
rate
[subject:outburst]
[subject:sstars]
[subject:parameters]
[author:smak j]
[cluster: a19]
[issn:0001-5237]
Introduction Method Results Summary
LittleAriadne
From semantic representation to visualisation and more
Each entity has its semantic representation
Cosine similarity between entities can be computed very fast,
based on which the 2D visualisation is implemented
Introduction Method Results Summary
LittleAriadne
From semantic representation to visualisation and more
Each entity has its semantic representation
Cosine similarity between entities can be computed very fast,
based on which the 2D visualisation is implemented
For each article, we collected the semantic representation of
all the entities in which it involves, and take an average as its
semantic representation
We applied a standard K-means clustering method to cluster
these articles based on their semantic representations
Introduction Method Results Summary
Experiment 1: Exploring context
Experiment 1: Exploring context
Now we can explore
Let’s start with stars
An overview of all journals
Introduction Method Results Summary
Experiment 1: Exploring context
Contextual view of stars
Introduction Method Results Summary
Experiment 2: Labelling clusters
Experiment 2: Labelling clusters
What is cluster a 2?
Introduction Method Results Summary
Experiment 2: Labelling clusters
Experiment 2: Labelling clusters
Introduction Method Results Summary
Experiment 2: Labelling clusters
Experiment 2: Labelling clusters
Cluster ID Top 9 most related topical terms
a 2 ”cosmology” ”dark energy” ”density perturbations”
”cosmologies” ”planck” ”cosmological” ”spatial
curvature” ”inflationary” ”inflation”
b 2 ”cosmology” ”cosmological constant” ”cosmologies”
”cosmological” ”universes” ”dark energy” ”quadratic”
”tensor” ”planck”
c 17 ”power spectrum” ”cosmological parameters” ”cmb”
”last scattering” ”anisotropies” ”microwave background”
”power spectra” ”planck” ”cosmic microwave”
d 28 ”density perturbations” ”inflationary” ”inflation”
”dark energy” ”scale invariant” ”spatial curvature”
”cosmological perturbations” ”inflationary models”
”cosmologies”
Introduction Method Results Summary
Experiment 3: Comparing clustering solutions
Experiment 3: Comparing clustering solutions
Cluster labels are treated as entities
Let’s compare
Introduction Method Results Summary
Experiment 3: Comparing clustering solutions
Highly similar clustering solutions
Introduction Method Results Summary
Experiment 3: Comparing clustering solutions
Partially agreeing clustering solutions
Introduction Method Results Summary
Experiment 3: Comparing clustering solutions
An overview of all clustering solutions
Introduction Method Results Summary
Summary
Summary
We present a method and an interface that allows visual
exploration through the contexts of entities
We can provide the most related topical terms to clusters
although expert knowledge is needed to transform them into
real labels/topics
LittleAriadne provides a visual way of comparing different
clustering solutions
Our na¨ıve way of clustering is worth exploring further
Introduction Method Results Summary
Future extensions
Future extensions
Add more types of entities, such as citations, publishers,
conferences, etc, to provide richer context
Add direct links to articles to answer information retrieval
needs
Study context sensitivity
compare ”young” and ”young”
Introduction Method Results Summary
Thank you
Thank you
http://thoth.pica.nl/astro/relate
Rob Koopman (rob.koopman@oclc.org)
Shenghui Wang (shenghui.wang@oclc.org)
Andrea Scharnhorst (andrea.scharnhorst@dans.knaw.nl)

Mais conteúdo relacionado

Mais procurados

Paolo Creminelli "Dark Energy after GW170817"
Paolo Creminelli "Dark Energy after GW170817"Paolo Creminelli "Dark Energy after GW170817"
Paolo Creminelli "Dark Energy after GW170817"SEENET-MTP
 
Prediction of just suspended speed for mixed slurries at
Prediction of just suspended speed for mixed slurries atPrediction of just suspended speed for mixed slurries at
Prediction of just suspended speed for mixed slurries atfabiola_9
 
DOI: 10.1007/s10773-009-0027-9
DOI: 10.1007/s10773-009-0027-9DOI: 10.1007/s10773-009-0027-9
DOI: 10.1007/s10773-009-0027-91neviv0
 
Aurelian Isar - Decoherence And Transition From Quantum To Classical In Open ...
Aurelian Isar - Decoherence And Transition From Quantum To Classical In Open ...Aurelian Isar - Decoherence And Transition From Quantum To Classical In Open ...
Aurelian Isar - Decoherence And Transition From Quantum To Classical In Open ...SEENET-MTP
 
Torsion and gravity seminar
Torsion and gravity seminarTorsion and gravity seminar
Torsion and gravity seminarHalo Anwar
 
Origin of Universe (Twin)
Origin of Universe (Twin)Origin of Universe (Twin)
Origin of Universe (Twin)omsai_999
 
"Warm tachyon matter" - N. Bilic
"Warm tachyon matter" - N. Bilic"Warm tachyon matter" - N. Bilic
"Warm tachyon matter" - N. BilicSEENET-MTP
 
Universe (Twin)
Universe (Twin)Universe (Twin)
Universe (Twin)omsai_999
 
S. Duplij, "Higher braid groups and regular semigroups from polyadic binary c...
S. Duplij, "Higher braid groups and regular semigroups from polyadic binary c...S. Duplij, "Higher braid groups and regular semigroups from polyadic binary c...
S. Duplij, "Higher braid groups and regular semigroups from polyadic binary c...Steven Duplij (Stepan Douplii)
 

Mais procurados (14)

Go2411811184
Go2411811184Go2411811184
Go2411811184
 
Paolo Creminelli "Dark Energy after GW170817"
Paolo Creminelli "Dark Energy after GW170817"Paolo Creminelli "Dark Energy after GW170817"
Paolo Creminelli "Dark Energy after GW170817"
 
Prediction of just suspended speed for mixed slurries at
Prediction of just suspended speed for mixed slurries atPrediction of just suspended speed for mixed slurries at
Prediction of just suspended speed for mixed slurries at
 
DOI: 10.1007/s10773-009-0027-9
DOI: 10.1007/s10773-009-0027-9DOI: 10.1007/s10773-009-0027-9
DOI: 10.1007/s10773-009-0027-9
 
Aurelian Isar - Decoherence And Transition From Quantum To Classical In Open ...
Aurelian Isar - Decoherence And Transition From Quantum To Classical In Open ...Aurelian Isar - Decoherence And Transition From Quantum To Classical In Open ...
Aurelian Isar - Decoherence And Transition From Quantum To Classical In Open ...
 
MHD Newtonian and non-Newtonian Nano Fluid Flow Passing On A Magnetic Sphere ...
MHD Newtonian and non-Newtonian Nano Fluid Flow Passing On A Magnetic Sphere ...MHD Newtonian and non-Newtonian Nano Fluid Flow Passing On A Magnetic Sphere ...
MHD Newtonian and non-Newtonian Nano Fluid Flow Passing On A Magnetic Sphere ...
 
Torsion and gravity seminar
Torsion and gravity seminarTorsion and gravity seminar
Torsion and gravity seminar
 
Origin of Universe (Twin)
Origin of Universe (Twin)Origin of Universe (Twin)
Origin of Universe (Twin)
 
"Warm tachyon matter" - N. Bilic
"Warm tachyon matter" - N. Bilic"Warm tachyon matter" - N. Bilic
"Warm tachyon matter" - N. Bilic
 
Universe (Twin)
Universe (Twin)Universe (Twin)
Universe (Twin)
 
Kk graviton redo.july5,2012
Kk graviton redo.july5,2012Kk graviton redo.july5,2012
Kk graviton redo.july5,2012
 
Climate Extremes Workshop - Decompositions of Dependence for High-Dimensiona...
Climate Extremes Workshop -  Decompositions of Dependence for High-Dimensiona...Climate Extremes Workshop -  Decompositions of Dependence for High-Dimensiona...
Climate Extremes Workshop - Decompositions of Dependence for High-Dimensiona...
 
S. Duplij, "Higher braid groups and regular semigroups from polyadic binary c...
S. Duplij, "Higher braid groups and regular semigroups from polyadic binary c...S. Duplij, "Higher braid groups and regular semigroups from polyadic binary c...
S. Duplij, "Higher braid groups and regular semigroups from polyadic binary c...
 
DL_FinalProposal
DL_FinalProposalDL_FinalProposal
DL_FinalProposal
 

Semelhante a Contextualization of topics - browsing through terms, authors, journals and cluster allocations

Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...JeremyHeyl
 
EP Thesis Defence 2016
EP Thesis Defence 2016EP Thesis Defence 2016
EP Thesis Defence 2016Elia Previtera
 
Kinetic pathways to the isotropic-nematic phase transformation: a mean field ...
Kinetic pathways to the isotropic-nematic phase transformation: a mean field ...Kinetic pathways to the isotropic-nematic phase transformation: a mean field ...
Kinetic pathways to the isotropic-nematic phase transformation: a mean field ...Amit Bhattacharjee
 
Multi wavelenth observations and surveys of galaxy clusters
Multi wavelenth observations and surveys of galaxy clustersMulti wavelenth observations and surveys of galaxy clusters
Multi wavelenth observations and surveys of galaxy clustersJoana Santos
 
UNF Undergrad Physics
UNF Undergrad PhysicsUNF Undergrad Physics
UNF Undergrad PhysicsNick Kypreos
 
Simulations of Strong Lensing
Simulations of Strong LensingSimulations of Strong Lensing
Simulations of Strong LensingNan Li
 
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...KAMAL CHOUDHARY
 
ON APPROACH TO INCREASE DENSITY OF FIELD- EFFECT TRANSISTORS IN AN INVERTER C...
ON APPROACH TO INCREASE DENSITY OF FIELD- EFFECT TRANSISTORS IN AN INVERTER C...ON APPROACH TO INCREASE DENSITY OF FIELD- EFFECT TRANSISTORS IN AN INVERTER C...
ON APPROACH TO INCREASE DENSITY OF FIELD- EFFECT TRANSISTORS IN AN INVERTER C...antjjournal
 
Comparative study of results obtained by analysis of structures using ANSYS, ...
Comparative study of results obtained by analysis of structures using ANSYS, ...Comparative study of results obtained by analysis of structures using ANSYS, ...
Comparative study of results obtained by analysis of structures using ANSYS, ...IOSR Journals
 
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...James D.B. Wang, PhD
 
Introduction to Hadron Structure from Lattice QCD
Introduction to Hadron Structure from Lattice QCDIntroduction to Hadron Structure from Lattice QCD
Introduction to Hadron Structure from Lattice QCDChristos Kallidonis
 
The distribution and_annihilation_of_dark_matter_around_black_holes
The distribution and_annihilation_of_dark_matter_around_black_holesThe distribution and_annihilation_of_dark_matter_around_black_holes
The distribution and_annihilation_of_dark_matter_around_black_holesSérgio Sacani
 
Group Cohomology of the Poincare Group and Invariant States
Group Cohomology of the Poincare Group and Invariant States Group Cohomology of the Poincare Group and Invariant States
Group Cohomology of the Poincare Group and Invariant States James Moffat
 
Dragoljub Dimitrijević "Tachyon Inflation in the RSII Framework"
Dragoljub Dimitrijević "Tachyon Inflation in the RSII Framework"Dragoljub Dimitrijević "Tachyon Inflation in the RSII Framework"
Dragoljub Dimitrijević "Tachyon Inflation in the RSII Framework"SEENET-MTP
 
Topology of charge density from pseudopotential density functional theory cal...
Topology of charge density from pseudopotential density functional theory cal...Topology of charge density from pseudopotential density functional theory cal...
Topology of charge density from pseudopotential density functional theory cal...Alexander Decker
 

Semelhante a Contextualization of topics - browsing through terms, authors, journals and cluster allocations (20)

Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
 
EP Thesis Defence 2016
EP Thesis Defence 2016EP Thesis Defence 2016
EP Thesis Defence 2016
 
Ijetr042201
Ijetr042201Ijetr042201
Ijetr042201
 
Nucleating Nematic Droplets
Nucleating Nematic DropletsNucleating Nematic Droplets
Nucleating Nematic Droplets
 
Kinetic pathways to the isotropic-nematic phase transformation: a mean field ...
Kinetic pathways to the isotropic-nematic phase transformation: a mean field ...Kinetic pathways to the isotropic-nematic phase transformation: a mean field ...
Kinetic pathways to the isotropic-nematic phase transformation: a mean field ...
 
My Prize Winning Physics Poster from 2006
My Prize Winning Physics Poster from 2006My Prize Winning Physics Poster from 2006
My Prize Winning Physics Poster from 2006
 
Multi wavelenth observations and surveys of galaxy clusters
Multi wavelenth observations and surveys of galaxy clustersMulti wavelenth observations and surveys of galaxy clusters
Multi wavelenth observations and surveys of galaxy clusters
 
Stellar hydro
Stellar hydroStellar hydro
Stellar hydro
 
UNF Undergrad Physics
UNF Undergrad PhysicsUNF Undergrad Physics
UNF Undergrad Physics
 
Simulations of Strong Lensing
Simulations of Strong LensingSimulations of Strong Lensing
Simulations of Strong Lensing
 
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
 
ON APPROACH TO INCREASE DENSITY OF FIELD- EFFECT TRANSISTORS IN AN INVERTER C...
ON APPROACH TO INCREASE DENSITY OF FIELD- EFFECT TRANSISTORS IN AN INVERTER C...ON APPROACH TO INCREASE DENSITY OF FIELD- EFFECT TRANSISTORS IN AN INVERTER C...
ON APPROACH TO INCREASE DENSITY OF FIELD- EFFECT TRANSISTORS IN AN INVERTER C...
 
Comparative study of results obtained by analysis of structures using ANSYS, ...
Comparative study of results obtained by analysis of structures using ANSYS, ...Comparative study of results obtained by analysis of structures using ANSYS, ...
Comparative study of results obtained by analysis of structures using ANSYS, ...
 
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...
 
Introduction to Hadron Structure from Lattice QCD
Introduction to Hadron Structure from Lattice QCDIntroduction to Hadron Structure from Lattice QCD
Introduction to Hadron Structure from Lattice QCD
 
The distribution and_annihilation_of_dark_matter_around_black_holes
The distribution and_annihilation_of_dark_matter_around_black_holesThe distribution and_annihilation_of_dark_matter_around_black_holes
The distribution and_annihilation_of_dark_matter_around_black_holes
 
Group Cohomology of the Poincare Group and Invariant States
Group Cohomology of the Poincare Group and Invariant States Group Cohomology of the Poincare Group and Invariant States
Group Cohomology of the Poincare Group and Invariant States
 
Dragoljub Dimitrijević "Tachyon Inflation in the RSII Framework"
Dragoljub Dimitrijević "Tachyon Inflation in the RSII Framework"Dragoljub Dimitrijević "Tachyon Inflation in the RSII Framework"
Dragoljub Dimitrijević "Tachyon Inflation in the RSII Framework"
 
International Journal of Engineering Inventions (IJEI)
International Journal of Engineering Inventions (IJEI)International Journal of Engineering Inventions (IJEI)
International Journal of Engineering Inventions (IJEI)
 
Topology of charge density from pseudopotential density functional theory cal...
Topology of charge density from pseudopotential density functional theory cal...Topology of charge density from pseudopotential density functional theory cal...
Topology of charge density from pseudopotential density functional theory cal...
 

Mais de Shenghui Wang

Non-parametric Subject Prediction
Non-parametric Subject PredictionNon-parametric Subject Prediction
Non-parametric Subject PredictionShenghui Wang
 
Our journey with semantic embedding
Our journey with semantic embeddingOur journey with semantic embedding
Our journey with semantic embeddingShenghui Wang
 
Linking entities via semantic indexing
Linking entities via semantic indexingLinking entities via semantic indexing
Linking entities via semantic indexingShenghui Wang
 
Semantic indexing for KOS
Semantic indexing for KOSSemantic indexing for KOS
Semantic indexing for KOSShenghui Wang
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataShenghui Wang
 
Ariadne's Thread -- Exploring a world of networked information built from fre...
Ariadne's Thread -- Exploring a world of networked information built from fre...Ariadne's Thread -- Exploring a world of networked information built from fre...
Ariadne's Thread -- Exploring a world of networked information built from fre...Shenghui Wang
 
Learning Concept Mappings from Instance Similarity
Learning Concept Mappings from Instance SimilarityLearning Concept Mappings from Instance Similarity
Learning Concept Mappings from Instance SimilarityShenghui Wang
 
Measuring the dynamic bi-directional influence between content and social ne...
Measuring the dynamic bi-directional influence between content and social ne...Measuring the dynamic bi-directional influence between content and social ne...
Measuring the dynamic bi-directional influence between content and social ne...Shenghui Wang
 
Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning Shenghui Wang
 
What is concept dirft and how to measure it?
What is concept dirft and how to measure it?What is concept dirft and how to measure it?
What is concept dirft and how to measure it?Shenghui Wang
 
Study concept drift in political ontologies
Study concept drift in political ontologiesStudy concept drift in political ontologies
Study concept drift in political ontologiesShenghui Wang
 

Mais de Shenghui Wang (13)

Non-parametric Subject Prediction
Non-parametric Subject PredictionNon-parametric Subject Prediction
Non-parametric Subject Prediction
 
Our journey with semantic embedding
Our journey with semantic embeddingOur journey with semantic embedding
Our journey with semantic embedding
 
Linking entities via semantic indexing
Linking entities via semantic indexingLinking entities via semantic indexing
Linking entities via semantic indexing
 
Semantic indexing for KOS
Semantic indexing for KOSSemantic indexing for KOS
Semantic indexing for KOS
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadata
 
Ariadne's Thread -- Exploring a world of networked information built from fre...
Ariadne's Thread -- Exploring a world of networked information built from fre...Ariadne's Thread -- Exploring a world of networked information built from fre...
Ariadne's Thread -- Exploring a world of networked information built from fre...
 
Learning Concept Mappings from Instance Similarity
Learning Concept Mappings from Instance SimilarityLearning Concept Mappings from Instance Similarity
Learning Concept Mappings from Instance Similarity
 
Measuring the dynamic bi-directional influence between content and social ne...
Measuring the dynamic bi-directional influence between content and social ne...Measuring the dynamic bi-directional influence between content and social ne...
Measuring the dynamic bi-directional influence between content and social ne...
 
Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning
 
What is concept dirft and how to measure it?
What is concept dirft and how to measure it?What is concept dirft and how to measure it?
What is concept dirft and how to measure it?
 
ICA Slides
ICA SlidesICA Slides
ICA Slides
 
ECCS 2010
ECCS 2010ECCS 2010
ECCS 2010
 
Study concept drift in political ontologies
Study concept drift in political ontologiesStudy concept drift in political ontologies
Study concept drift in political ontologies
 

Último

Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 

Último (20)

Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 

Contextualization of topics - browsing through terms, authors, journals and cluster allocations

  • 1. Introduction Method Results Summary Contextualization of Topics browsing through terms, authors, journals and cluster allocations Rob Koopman1 Shenghui Wang1 Andrea Scharnhorst2 1OCLC Research 2DANS-KNAW ISSI 2015
  • 2. Introduction Method Results Summary Introduction What are essence and boundary of a scientific field? Different ways to find clusters in scientific literature based on connectivity in terms of authorship, citations, language similarity, etc. Ambiguous nature in science
  • 3. Introduction Method Results Summary Ariadne: interactive context explorer Ariadne is an interactive interface which allows users to explore the context of entities such as authors, journals, topical terms, etc. It builds on semantic indexing statistically computed from a large scale bibliographic corpus It was originally implemented to explore 1M topical terms, 3M authors, 35K journals and 700+ Dewey decimal classes associated with 65M articles.
  • 4. Introduction Method Results Summary Research questions Q1: How does the Ariadne algorithm work on a much smaller, field specific dataset? Q2: Can we use Ariadne to label the clusters produce by the different methods? Q3: Can we use Ariadne to compare different clustering solutions?
  • 5. Introduction Method Results Summary LittleAriadne LittleAriadne: context explorer over Astrophysics data Offline: generates a semantic representation for each entity Online: finds the most related entities and using multidimensional scaling to display
  • 6. Introduction Method Results Summary LittleAriadne An example article Article ID ISI:000276828000006 Title On the Mass Transfer Rate in SS Cyg Abstract The mass transfer rate in SS Cyg at quiescence, estimated from the observed luminosity of the hot spot, is log M-tr = 16.8 +/- 0.3. This is safely below the critical mass transfer rates of log M-crit = 18.1 (corresponding to log T-crit(0) = 3.88) or log M-crit = 17.2 (corresponding to the “revised” value of log T-crit(0) = 3.65). The mass transfer rate during outbursts is strongly enhanced Author [author:smak j] ISSN [issn:0001-5237] Subject [subject:accretion, accretion disks] [subject:cataclysmic variables] [subject:disc instability model] [subject:dwarf novae] [subject:novae, cataclysmic variables] [subject:outbursts] [subject:parameters] [subject:stars] [subject:stars dwarf novae] [subject:stars individual ss cyg] [subject:state] [subject: superoutbursts]
  • 7. Introduction Method Results Summary LittleAriadne An example article Article ID ISI:000276828000006 Title On the Mass Transfer Rate in SS Cyg Abstract The mass transfer rate in SS Cyg at quiescence, estimated from the observed luminosity of the hot spot, is log M-tr = 16.8 +/- 0.3. This is safely below the critical mass transfer rates of log M-crit = 18.1 (corresponding to log T-crit(0) = 3.88) or log M-crit = 17.2 (corresponding to the “revised” value of log T-crit(0) = 3.65). The mass transfer rate during outbursts is strongly enhanced Author [author:smak j] ISSN [issn:0001-5237] Subject [subject:accretion, accretion disks] [subject:cataclysmic variables] [subject:disc instability model] [subject:dwarf novae] [subject:novae, cataclysmic variables] [subject:outbursts] [subject:parameters] [subject:stars] [subject:stars dwarf novae] [subject:stars individual ss cyg] [subject:state] [subject: superoutbursts] Cluster label [cluster:a 19] [cluster:b 16] [cluster:c 15] [cluster:d 51] [cluster:e 17] [cluster:f 1]
  • 8. Introduction Method Results Summary LittleAriadne Six different clustering solutions x Source y=#Cluster #Cluster in LittleAriadne a cwts 1.8 23 23 b UMSI 23 23 c oclc 20 20 20 d hu 139 48 e sts 5664 229 f ECOOM 15 15
  • 9. Introduction Method Results Summary LittleAriadne Entities in the Astrophysis dataset There are in total 90,343 entities associated with 111,616 astrophysics articles 59 journals 27,027 author names (no disambiguation applied) 39,577 topical terms 23,322 subjects (extracted from ”Author Keywords” and ”Keywords Plus”) 358 cluster labels (source + cluster id)
  • 10. Introduction Method Results Summary LittleAriadne Build semantic representation Basic assumptions Entities can be represented by its context Entities which share more context are more likely to be related Context is the textual environment where an entity occurs
  • 11. Introduction Method Results Summary LittleAriadne An example article Article ID ISI:000276828000006 Title On the Mass Transfer Rate in SS Cyg Abstract The mass transfer rate in SS Cyg at quiescence, estimated from the observed luminosity of the hot spot, is log M-tr = 16.8 +/- 0.3. This is safely below the critical mass transfer rates of log M-crit = 18.1 (corresponding to log T-crit(0) = 3.88) or log M-crit = 17.2 (corresponding to the “revised” value of log T-crit(0) = 3.65). The mass transfer rate during outbursts is strongly enhanced Author [author:smak j] ISSN [issn:0001-5237] Subject [subject:accretion, accretion disks] [subject:cataclysmic variables] [subject:disc instability model] [subject:dwarf novae] [subject:novae, cataclysmic variables] [subject:outbursts] [subject:parameters] [subject:stars] [subject:stars dwarf novae] [subject:stars individual ss cyg] [subject:state] [subject: superoutbursts] Cluster label [cluster:a 19] [cluster:b 16] [cluster:c 15] [cluster:d 51] [cluster:e 17] [cluster:f 1]
  • 12. Introduction Method Results Summary LittleAriadne Dimension reduction using Random Projection mass transfer rate [subject:outburst] [subject:sstars] [subject:parameters] [author:smak j] [cluster: a19] [issn:0001-5237]
  • 13. Introduction Method Results Summary LittleAriadne Dimension reduction using Random Projection mass transfer rate [subject:outburst] [subject:sstars] [subject:parameters] [author:smak j] [cluster: a19] [issn:0001-5237]
  • 14. Introduction Method Results Summary LittleAriadne From semantic representation to visualisation and more Each entity has its semantic representation Cosine similarity between entities can be computed very fast, based on which the 2D visualisation is implemented
  • 15. Introduction Method Results Summary LittleAriadne From semantic representation to visualisation and more Each entity has its semantic representation Cosine similarity between entities can be computed very fast, based on which the 2D visualisation is implemented For each article, we collected the semantic representation of all the entities in which it involves, and take an average as its semantic representation We applied a standard K-means clustering method to cluster these articles based on their semantic representations
  • 16. Introduction Method Results Summary Experiment 1: Exploring context Experiment 1: Exploring context Now we can explore Let’s start with stars An overview of all journals
  • 17. Introduction Method Results Summary Experiment 1: Exploring context Contextual view of stars
  • 18. Introduction Method Results Summary Experiment 2: Labelling clusters Experiment 2: Labelling clusters What is cluster a 2?
  • 19. Introduction Method Results Summary Experiment 2: Labelling clusters Experiment 2: Labelling clusters
  • 20. Introduction Method Results Summary Experiment 2: Labelling clusters Experiment 2: Labelling clusters Cluster ID Top 9 most related topical terms a 2 ”cosmology” ”dark energy” ”density perturbations” ”cosmologies” ”planck” ”cosmological” ”spatial curvature” ”inflationary” ”inflation” b 2 ”cosmology” ”cosmological constant” ”cosmologies” ”cosmological” ”universes” ”dark energy” ”quadratic” ”tensor” ”planck” c 17 ”power spectrum” ”cosmological parameters” ”cmb” ”last scattering” ”anisotropies” ”microwave background” ”power spectra” ”planck” ”cosmic microwave” d 28 ”density perturbations” ”inflationary” ”inflation” ”dark energy” ”scale invariant” ”spatial curvature” ”cosmological perturbations” ”inflationary models” ”cosmologies”
  • 21. Introduction Method Results Summary Experiment 3: Comparing clustering solutions Experiment 3: Comparing clustering solutions Cluster labels are treated as entities Let’s compare
  • 22. Introduction Method Results Summary Experiment 3: Comparing clustering solutions Highly similar clustering solutions
  • 23. Introduction Method Results Summary Experiment 3: Comparing clustering solutions Partially agreeing clustering solutions
  • 24. Introduction Method Results Summary Experiment 3: Comparing clustering solutions An overview of all clustering solutions
  • 25. Introduction Method Results Summary Summary Summary We present a method and an interface that allows visual exploration through the contexts of entities We can provide the most related topical terms to clusters although expert knowledge is needed to transform them into real labels/topics LittleAriadne provides a visual way of comparing different clustering solutions Our na¨ıve way of clustering is worth exploring further
  • 26. Introduction Method Results Summary Future extensions Future extensions Add more types of entities, such as citations, publishers, conferences, etc, to provide richer context Add direct links to articles to answer information retrieval needs Study context sensitivity compare ”young” and ”young”
  • 27. Introduction Method Results Summary Thank you Thank you http://thoth.pica.nl/astro/relate Rob Koopman (rob.koopman@oclc.org) Shenghui Wang (shenghui.wang@oclc.org) Andrea Scharnhorst (andrea.scharnhorst@dans.knaw.nl)