SlideShare a Scribd company logo
1 of 29
Download to read offline
Department of Information Engineering
University of Padua, Italy
Gianmaria Silvello

@giansilv
Reproducibility for IR Evaluation
slideReproducibility for IR EvaluationG. Silvello
IR Evaluation Initiatives
2
Evaluation in IR is often conducted in large, shared,
international campaigns
FIRE
slideReproducibility for IR EvaluationG. Silvello
IR Evaluation Initiatives
3
Organizer

Assessor

Par.cipant

Organizer

Assessor

Visitor,

Par.cipant

and

Organizer

Visitor,

Par.cipant

and

Organizer

Visitor,

Par.cipant

and

Organizer

Prepara.on
of

Documents

Crea.on

of
Topics

Experiment

Submission

Crea.on
of

Pools

Relevance

Assessment

Performance

Measures

Scien.fic

Produc.on

Data

Informa.on

Knowledge
 Wisdom

Sta.s.cal

Analyses

slideReproducibility for IR EvaluationG. Silvello
IR Evaluation Initiatives
3
Organizer

Assessor

Par.cipant

Organizer

Assessor

Visitor,

Par.cipant

and

Organizer

Visitor,

Par.cipant

and

Organizer

Visitor,

Par.cipant

and

Organizer

Prepara.on
of

Documents

Crea.on

of
Topics

Experiment

Submission

Crea.on
of

Pools

Relevance

Assessment

Performance

Measures

Scien.fic

Produc.on

Data

Informa.on

Knowledge
 Wisdom

Sta.s.cal

Analyses

We have shared
experimental
collections and we
perform statistical
validation.
But, are we done?
slideReproducibility for IR EvaluationG. Silvello
IR Evaluation Initiatives
3
Organizer

Assessor

Par.cipant

Organizer

Assessor

Visitor,

Par.cipant

and

Organizer

Visitor,

Par.cipant

and

Organizer

Visitor,

Par.cipant

and

Organizer

Prepara.on
of

Documents

Crea.on

of
Topics

Experiment

Submission

Crea.on
of

Pools

Relevance

Assessment

Performance

Measures

Scien.fic

Produc.on

Data

Informa.on

Knowledge
 Wisdom

Sta.s.cal

Analyses

Multiple targets for reproducibility:
experimental collections
system runs
meta-evaluation studies
slideReproducibility for IR EvaluationG. Silvello
The Format Babele
4
This situation hampers:
- automatic management
- interpretability
- reproducibility
- ease of (re-)use
- take-up from new comers
<topic'number="6"'type="ambiguous">1

 <query>2

 
 kcs3

 </query>4

 <description>5

 
 Find'information'on'the'Kansas'City'Southern'railroad.'6

 </description>7

 <subtopic'number="1"'type="nav">8

 
 Find'the'homepage'for'the'Kansas'City'Southern'railroad.'9

 </subtopic>10

 <subtopic'number="2"'type="inf">11

 
 I'm'looking'for'a'job'with'the'Kansas'City'Southern'railroad.'12

 </subtopic>13

 <subtopic'number="3"'type="nav">14

 
 Find'the'homepage'for'Kanawha'County'Schools'in'West'Virginia.'15

 </subtopic>16

 <subtopic'number="4"'type="nav">17

 
 Find'the'homepage'for'the'Knox'County'School'system'in'Tennessee.'18

 </subtopic>19

 <subtopic'number="5"'type="inf">20

 
 Find'information'on'KCS'Energy,'Inc.,'and'their'merger'with'Petrohawk'Energy'Corporation.'21

 </subtopic>22
</topic>23
24
<session'num="1"'starttime="08:59:47.258675">1

 <topic>2

 
 <title>3

 
 
 peacecorp4

 
 </title>5

 
 <desc>6

 
 
 Find'information'about'the'peace'corp7

 
 </desc>8

 
 <narr>9

 
 
 When'was'it'started'and'by'whom?'What'services'does'it'provide'and'where'does'it10

 
 </narr>11

 </topic>12

 <interaction'num="1"'starttime="09:00:04.155323">13

 
 <query>14

 
 
 peace'corp15

 
 </query>16

 
 <results>17

 
 
 <result'rank="1">18

 
 
 
 <url>19

 
 
 
 
 http://www.peacecorps.gov/20

 
 
 
 </url>21

 
 
 
 <clueweb09id>22

 
 
 
 
 clueweb09Nen0011N60N0800323

 
 
 
 </clueweb09id>24

 
 
 
 <title>25

 
 
 
 
 Peace'Corps26

 
 
 
 </title>27

 
 
 
 <snippet>28

 
 
 
 
 Fighting'hunger,'disease,'poverty,'and'lack'of'opportunity.29

 
 
 
 </snippet>30

 
 
 </result>31

 
 
 ...'32

 
 </results>33

 
 <clicked>34

 
 
 <click'num="1"'starttime="09:00:09.943356"'endtime="09:01:13.434255">35

 
 
 
 <rank>36
<top>1
2
<num>)Number:)3033
<title>)Hubble)Telescope)Achievements4
5
<desc>)Description:6
Identify)positive)accomplishments)of)the)Hubble)telescope)since)it7
was)launched)in)1991.8
9
<narr>)Narrative:10
Documents)are)relevant)that)show)the)Hubble)telescope)has)produced11
new)data,)better)quality)data)than)previously)available,)data)that12
has)increased)human)knowledge)of)the)universe,)or)data)that)has)led13
to)disproving)previously)existing)theories)or)hypotheses.))Documents14
limited)to)the)shortcomings)of)the)telescope)would)be)irrelevant.15
Details)of)repairs)or)modifications)to)the)telescope)without16
reference)to)positive)achievements)would)not)be)relevant.17
18
</top>19
slideReproducibility for IR EvaluationG. Silvello
The Format Babele
4
This situation hampers:
- automatic management
- interpretability
- reproducibility
- ease of (re-)use
- take-up from new comers
303#0#APW19980609.1531#21
303#0#APW19980610.1778#12
303#0#APW19980715.1061#23
303#0#APW19980910.1078#04
5
1#0#clueweb095en0120513520479#06
1#1#clueweb095en0120513520479#07
1#2#clueweb095en0120513520479#08
9
101#0#clueweb095en0047533520039#110
101#0#clueweb095en0004566509322#211
101#0#clueweb095en0033530508382#012
101#0#clueweb095en0000545505740#5213
101#0#clueweb095en0020592511795#114
15
20002#0#clueweb095en0006585533170#1#1#10.516
20004#0#clueweb095en0005528520976#1#1#10.517
20006#0#clueweb095en0010507521538#1#1#10.518
19
ad-hoc
diversity
ad-hoc with grades
relevance feedback
slideReproducibility for IR EvaluationG. Silvello
The Format Babele
4
This situation hampers:
- automatic management
- interpretability
- reproducibility
- ease of (re-)use
- take-up from new comers
303#0#APW19980609.1531#21
303#0#APW19980610.1778#12
303#0#APW19980715.1061#23
303#0#APW19980910.1078#04
5
1#0#clueweb095en0120513520479#06
1#1#clueweb095en0120513520479#07
1#2#clueweb095en0120513520479#08
9
101#0#clueweb095en0047533520039#110
101#0#clueweb095en0004566509322#211
101#0#clueweb095en0033530508382#012
101#0#clueweb095en0000545505740#5213
101#0#clueweb095en0020592511795#114
15
20002#0#clueweb095en0006585533170#1#1#10.516
20004#0#clueweb095en0005528520976#1#1#10.517
20006#0#clueweb095en0010507521538#1#1#10.518
19
ad-hoc
diversity
ad-hoc with grades
relevance feedback
We need:
to agree on a common data model which
allows for extension
to provide the basic experimental data
with proper metadata (descriptive,
administrative, copyright, ...)
slideReproducibility for IR EvaluationG. Silvello
Referenceability and Traceability
5
- Explanation of experimental data is usually reported in
scientific papers that do not provide direct links to them
- the may be referred to in many different ways within the same
paper (experiment id, system version, participant id, …)
- It is often difficult to exactly know which data have been
used in a paper and have access to them
- It is ever more difficult to exactly know the performed data
cleaning and processing operations
[Ferro, 2016]
slideReproducibility for IR EvaluationG. Silvello
Referenceability and Traceability
5
- Explanation of experimental data is usually reported in
scientific papers that do not provide direct links to them
- the may be referred to in many different ways within the same
paper (experiment id, system version, participant id, …)
- It is often difficult to exactly know which data have been
used in a paper and have access to them
- It is ever more difficult to exactly know the performed data
cleaning and processing operations
We need:
to have the possibility of citing experimental
data in our papers as any other references
and to link the data with the claims in the
papers
to make our papers actionable and
executable providing access to the
mentioned experimental data
[Ferro, 2016]
slideReproducibility for IR EvaluationG. Silvello
The DIRECT Experience
6
BIBLIOGRAPHICAL
EXPERIMENT
VISUAL
ANALYTICS
EVALUATION
ACTIVITY
EXPERIMENTAL
COLLECTION
RESOURCE
MANAGEMENT
MEASUREMENT
METADATA
http://direct.dei.unipd.it/
http://lod-direct.dei.unipd.it/
[Agosti et al., 2012]
slideReproducibility for IR EvaluationG. Silvello
LOD DIRECT
7
Jussi
Karlgren
Link ims:relation
ims:has-source
ims:has-target
is-expert-in
Reputation
Management
0.46 0.84
ims:score ims:backward-score
CLEF2012wn-
RepLab-
KarlgrenEtAl
2012
Link
ims:has-source
ims:has-target
ims:relation
feature
0.53 0.87
ims:score
ims:backward-score
Profiling Reputation of Corporate
Entities in Semantic Space
ims:title
dbpedia.or
g/resource/
Reputation_
manageme
nt
owl:sameAs
dbpedia.or
g/resource/
Information
_
retrieval
Link
ims:has-source
ims:has-target
ims:relation
0.42 0.23
ims:score ims:backward-score
Information
Retrieval
owl:sameAs
swrc:has-author
dblp.l3s.de/d2r/
resource/
publications/
conf/clef/
KarlgrenSOEH1
2
owl:sameAs
dblp.l3s.de/
d2r/resource/
authors/
Jussi_Karlgre
n owl:sameAs
RepLab
2012
CLEF
2012
profiling
_kthgavagai
_1
Measure
0.77 ims:score
Effectiveness
Accuracy
ims:refersTo
ims:submittedTo
ims:isPartOf
ims:evaluates
ims:isEvaluatedBy
ims:assignedTo
ims:measuredBy
[Silvello et al., 2016]
slideReproducibility for IR EvaluationG. Silvello
LOD DIRECT
7
@prefix dc: <http://purl.org/dc/terms/> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix aktors: <http://www.aktors.org/ontology/portal#> .
@prefix foaf: <http://xmlns.com/foaf/0.1/> .
@prefix ims: <http://ims.dei.unipd.it/data/rdf/> .
@prefix bibo: <http://purl.org/ontology/bibo/> .
@prefix owl: <http://www.w3.org/2002/07/owl#> .
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix swrc: <http://swrc.ontoware.org/ontology#> .
<http://lod-direct.dei.unipd.it/user/Fredrik+Olsson;http://
ims.dei.unipd.it/author/>
ims:file-metadata _:b0 ;
ims:has-namespace <http://lod-direct.dei.unipd.it/namespace/http://
ims.dei.unipd.it/author/> ;
ims:identifier "Fredrik+Olsson" .
<http://lod-direct.dei.unipd.it/user/Fredrik+Espinoza;http://
ims.dei.unipd.it/author/>
ims:file-metadata _:b0 ;
ims:has-namespace <http://lod-direct.dei.unipd.it/namespace/http://
ims.dei.unipd.it/author/> ;
ims:identifier "Fredrik+Espinoza" .
<http://lod-direct.dei.unipd.it/user/Magnus+Sahlgren;http://
ims.dei.unipd.it/author/>
ims:file-metadata _:b0 ;
ims:has-namespace <http://lod-direct.dei.unipd.it/namespace/http://
ims.dei.unipd.it/author/> ;
ims:identifier "Magnus+Sahlgren" .
<http://lod-direct.dei.unipd.it/user/Jussi+Karlgren;http://
ims.dei.unipd.it/author/>
ims:file-metadata _:b0 ;
ims:has-namespace <http://lod-direct.dei.unipd.it/namespace/http://
ims.dei.unipd.it/author/> ;
ims:identifier "Jussi+Karlgren" .
<http://lod-direct.dei.unipd.it/user/Ola+Hamfors;http://
ims.dei.unipd.it/author/>
ims:file-metadata _:b0 ;
ims:has-namespace <http://lod-direct.dei.unipd.it/namespace/http://
ims.dei.unipd.it/author/> ;
ims:identifier "Ola+Hamfors" .
<http://lod-direct.dei.unipd.it/contribution/CLEF2012wn-RepLab-
KarlgrenEt2012b>
ims:contribution-type <http://lod-direct.dei.unipd.it/concept/
Publication;http://www.aktors.org/ontology/portal%23> ;
ims:copyrighted "false" ;
ims:created "2013-05-19T17:01:05.644+02:00" ;
ims:file-metadata _:b0 ;
ims:last-modified "2013-05-19T17:01:05.644+02:00" ;
ims:link "http://www.clef-initiative.eu/documents/71612/155385/
CLEF2012wn-RepLab-KarlgrenEt2012b.pdf" ;
ims:owner <http://lod-direct.dei.unipd.it/user/root;http://
ims.dei.unipd.it/> ;
ims:title "Profiling Reputation of Corporate Entities in Semantic Space
" ;
swrc:has-author <http://lod-direct.dei.unipd.it/user/Magnus
+Sahlgren;http://ims.dei.unipd.it/author/> , <http://lod-
direct.dei.unipd.it/user/Jussi+Karlgren;http://ims.dei.unipd.it/author/
> , <http://lod-direct.dei.unipd.it/user/Fredrik+Espinoza;http://
ims.dei.unipd.it/author/> , <http://lod-direct.dei.unipd.it/user/
Fredrik+Olsson;http://ims.dei.unipd.it/author/> , <http://lod-
direct.dei.unipd.it/user/Ola+Hamfors;http://ims.dei.unipd.it/author/> .
<http://lod-direct.dei.unipd.it/namespace/http://
ims.dei.unipd.it/>
ims:file-metadata _:b0 ;
ims:identifier "http://ims.dei.unipd.it/" ;
ims:prefix "e6fe2c43" .
<http://lod-direct.dei.unipd.it/user/root;http://
ims.dei.unipd.it/>
ims:file-metadata _:b0 ;
ims:has-namespace <http://lod-direct.dei.unipd.it/
namespace/http://ims.dei.unipd.it/> ;
ims:identifier "root" .
<http://lod-direct.dei.unipd.it/namespace/http://
www.aktors.org/ontology/portal%23>
ims:file-metadata _:b0 ;
ims:identifier "http://www.aktors.org/ontology/portal%23" ;
ims:prefix "37675fe1" .
_:b0 dc:created "2015-11-06T15:55:20.052+01:00" ;
dc:creator "LOD DIRECT (Distributed Information Retrieval
Evaluation Campaign Tool) - Version 3.10" ;
dc:rights "Copyright (c) 2006-2015 - Information Management
Systems (IMS) Research Group (http://ims.dei.unipd.it/) -
Department of Information Engineering (http://
www.dei.unipd.it/) - University of Padua (http://
www.unipd.it/)" .
<http://lod-direct.dei.unipd.it/namespace/http://
ims.dei.unipd.it/author/>
ims:file-metadata _:b0 ;
ims:identifier "http://ims.dei.unipd.it/author/" ;
ims:prefix "9c5e2261" .
<http://lod-direct.dei.unipd.it/concept/Publication;http://
www.aktors.org/ontology/portal%23>
ims:file-metadata _:b0 ;
ims:has-namespace <http://lod-direct.dei.unipd.it/
namespace/http://www.aktors.org/ontology/portal%23> ;
ims:identifier "Publication" .
prefixesauthorscontribution
metadata
http://lod-direct.dei.unipd.it/contribution/
CLEF2012wn-RepLab-KarlgrenEt2012b/
[Silvello et al., 2016]
slideReproducibility for IR EvaluationG. Silvello
Towards a Support for Run Reproducibility
8
slideReproducibility for IR EvaluationG. Silvello
Towards a Support for Run Reproducibility
8
slideReproducibility for IR EvaluationG. Silvello
Actionable Papers
9
<a href=”http://direct.dei.unipd.it/user/UPV”>UPV</a>
slideReproducibility for IR EvaluationG. Silvello
Actionable Papers
9
<a href=”http://direct.dei.unipd.it/experiment/
EXP_UKB_WN100”>EXP_UKB_WN100</a>
slideReproducibility for IR EvaluationG. Silvello
Actionable Papers
9
<a href=”http://direct.dei.unipd.it/estimate/
017c333a-4b7c-4267-926d-f15fe3554efd”>51.61%</a>
slideReproducibility for IR EvaluationG. Silvello
Actionable Papers
9
<img src=”http://direct.dei.unipd.it/visualization/
017c333a-4b7c-4267-926d-f15fe3554efd/snapshot/
177bcef2-00a0-4f59-b781-f285610f1c6f”/>
slideReproducibility for IR EvaluationG. Silvello
Reproducibility is tied to data citation
10
Being able to uniquely identify data (e.g., DOI, URI) is fundamental, but it is not enough
- We need to:
- automatically generate pertinent, consistent and complete human-
and machine-readable citation snippets
- define tool to make data citation easy: click, generate, copy and
paste
- develop citation systems which require low or no effort to data
creators/curators and low or no modification to the actual data being
cited
- make persistent data citations
[Silvello&Ferro, 2016]
slideReproducibility for IR EvaluationG. Silvello
Data Citation is a Computational Problem
11
- Identity
- Completeness
- Fixity
- Validity
[Buneman, Davidson,
Frew, 2016]
The four main computational
issues of data citation
slideReproducibility for IR EvaluationG. Silvello
Towards a General Data Citation System
12
The identity+completeness issues
To identify and generate a citation for a single resource
<Iuphar>
<name>IUPHAR-DB </name>
<citation>Rule0</citation>
[...]
<gpcr>
<name>G protein-coupled receptors</name>
<citation>Rule1</citation>
[...]
<family>
<id>29</id>
<name>Glucagon receptor family</name>
<citation>Rule2</citation>
<receptor>
<id>247</id>
<name>GHRH</name>
[...]
<agonists>
<ligand>
[...]
</ligand>
</agonists>
[...]
</receptor>
[...]
</family>
[...]
</gpcr>
<ionchannels>
[...]
</ionchannels>
</iuphar>
iuphar[name=$.d,url=$.u, version=$.v]
iuphar[]/gpcr[name=$.n]
iuphar[]/gpcr[]/family[name=$.f,id=$.i]
/contributors[]/contributor[name=$?c]
{database=$d, version=$v, contributors=$c, db-family=$n, family=$f, idFamily=$i}
Rules:
The citation that gets generated (example):
{ database=IUPHAR-DB: the IUPHAR database || url=http://www.iuphar-db.org/ || version=15 ||
dbFamily=G protein-coupled receptors || family=Glucagon receptor family || idFamily=29 || contributor=
{Laurence J. Miller;;Daniel J. Drucker;;[...];;Rebecca Hills;;}}
The rules are recursively
processed by the system and
then transformed into a
conjunction of XPaths.
The interpretation of the XPaths
generates the citation.
Instantiation of the variables:
The first rule interpreted by the
system
The second rule interpreted by
the system
The third rule interpreted by
the system
[Buneman&Silvello, 2010]
Rule-based system
for hierarchical data
slideReproducibility for IR EvaluationG. Silvello 13
Towards a General Data Citation System
The identity+completeness issues
To identify and generate a citation for a single resource
[Silvello, 2016]
Learning to cite framework
for hierarchical data
Human-Readable
Citations
XML Files
Collection
Training Data
Learner
Citation
Model
Citation
System
Citation
XPath
XML File
Test Data
Machine-Readable
Citation
Human-Readable
Citation
Output Reference
1
2
3
4
5 6
slideReproducibility for IR EvaluationG. Silvello 14
Towards a General Data Citation System
The identity+completeness issues (+ fixity)
To identify and generate a citation for a single resource
[Alawini, Chen,
Davidson & Silvello,
in preparation]
View+rule based system
for RDF datasets
e1
e2 e3
e4 e5
e6
e7
e8
e9
e10
pypz
pz
py
pz
py
px
px
px
px
py
py
pz
VSW(e1)
Resource to be cited: e1 check type
citation query
parametrized by e1 CSW(e1,s,v,d,t,o,u)
Citation
Function
{eagle-id: “eagle-id: e1'',
name: `”Significance Tester'',
developers: {“Grant, G.'', “Lazar, M.l'', “Manduchi, E.''},
url: “http://www.cbil.upenn.edu/STAR/ '' }
Final citation
RDF
Citation
Model
eagle-i id
Citation
Formatter
machine-readable
citation (JSON)
human-readable
citation
eagle-i
triple store
eagle-iV
versioning
system
slideReproducibility for IR EvaluationG. Silvello 15
Towards a General Data Citation System
The identity+completeness issues
To identify and generate a citation for a multiple resources
[Silvello 2015]
Named graphs
for RDF subsets
ex:
systemA
ex:
expA
ex:
CLEF
2009
ex:
measureA
ex:produce
ex:measure ex:submitted-to
precision
0.70
ex:name
ex:value
ex:
n1
ex:
n2
ex:
n3
ex:
n4
ex:
n5
schema:
is-related-to
schema:
is-related-to
schema:
is-related-to
schema:
is-related-to
ex:n1 schema:is-related-to ex:n2 ex:cit-sysA-CLEF2009
ex:n1 schema:is-related-to ex:n3 ex:cit-sysA-CLEF2009
ex:n2 schema:is-related-to ex:n4 ex:cit-sysA-CLEF2009
ex:n2 schema:is-related-to ex:n5 ex:cit-sysA-CLEF2009
Subject Property Object Name
Machine-readable citation meta-graph
ex:systemA ex:produce ex:expA ex:n1
ex:expA ex:measure ex:measureA ex:n2
ex:expA ex:submitted-to ex:CLEF2009 ex:n3
ex:measureA ex:name "precision" ex:n4
ex:measureA ex:value "0.7" ex:n5
Subject Property Object Name
Original cited LOD subset
n1
n3
n2
n5
n4
Copyright © 2015 Gianmaria Silvello
slideReproducibility for IR EvaluationG. Silvello 16
Towards a General Data Citation System
The identity+completeness issues
To identify and generate a citation for a multiple resources
[Davidson, Deutch, Milo,Silvello, 2017]
View-based model
for relational databases
Query
Rewriting
Function
Database Views
V
Specification
Language
Query
Q
Database
D
Citation
Policies
q1
q2
qn
.
.
.
Preference
Model
Citation
Function
Set of best
rewritings
Citation Queries
CQ
c1
c2
cm
.
.
.
Aggregation
Function
Citation
C
1
2 3 4 5
Citation Views
Citation Checking Mechanism
6
slideReproducibility for IR EvaluationG. Silvello
Conclusions
17
- Reproducibility is a fundamental topic for science
- Information retrieval evaluation is a challenging domain
- Data Citation is a complex and open problem
- new models of citations
- computational solutions
- intrinsically related to reproducibility
slideReproducibility for IR EvaluationG. Silvello
References
18
[Agosti et al., 2012] Agosti, M., Di Buccio, E., Ferro, N., Masiero, I., Peruzzo, S., and Silvello, G. (2012).
DIRECTions: Design and Specification of an IR Evaluation Infrastructure. In Proceedings of the Third International
Conference of the CLEF Initiative (CLEF 2012). LNCS 7488, Springer, Heidelberg, Germany.
[Buneman et al., 2016] Buneman, P., Davidson, S. B., and Frew, J. (2016). Why data citation is a computational
problem. Communications of the ACM (CACM), 59(9):50–57.
[Buneman and Silvello, 2010] Buneman, P. and Silvello, G. (2010). A Rule-Based Citation System for Structured
and Evolving Datasets. IEEE Data Eng. Bull., 33(3):33–41.
[Davidson et al., 2017] Davidson, S. B., Deutch, D., Tova, M. and Silvello, G. (2017). A Model for Fine-Grained
Data Citation. In 8th Biennial Conference on Innovative Data Systems Research (CIDR 2017).
[Ferro, 2016] Ferro, N. (2016). Reproducibility Challenges in Information Retrieval Evaluation. ACM Journal of Data
and Information Quality (JDIQ), to appear.
[Silvello, 2015] Silvello, G. (2015). A Methodology for Citing Linked Open Data Subsets. D-Lib Magazine, 21(1/2).
[Silvello, 2016] Silvello, G. (2016). Learning to Cite Framework: How to Automatically Construct Citations for
Hierarchical Data. Journal of the American Society for Information Science and Technology (JASIST), in print:1–28.
[Silvello et al., 2016] Silvello, G., Bordea, G., Ferro, N., Buitelaar, P., and Bogers, T. (2016). Semantic
Representation and Enrichment of Information Retrieval Experimental Data. International Journal on Digital Libraries
(IJDL), in press:1–28.
[Silvello and Ferro, 2016] Silvello, G. and Ferro, N. (2016). ”Data Citation is Coming”. Introduction to the special
issue on data citation. Bulletin of IEEE Technical Committee on Digital Libraries, Special Issue on Data Citation,
12(1):1–5.
slideData Driven Digital Libraries: The Case of Data CitationG. Silvello 19

More Related Content

Similar to Reproducibility for IR evaluation

DevelopingDataScienceProfession
DevelopingDataScienceProfessionDevelopingDataScienceProfession
DevelopingDataScienceProfession
Gary Rector
 

Similar to Reproducibility for IR evaluation (20)

Reproducible, Open Data Science in the Life Sciences
Reproducible, Open  Data Science in the  Life SciencesReproducible, Open  Data Science in the  Life Sciences
Reproducible, Open Data Science in the Life Sciences
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research Objects
 
DevelopingDataScienceProfession
DevelopingDataScienceProfessionDevelopingDataScienceProfession
DevelopingDataScienceProfession
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
 
Supervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For CancerSupervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For Cancer
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction) Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction)
 
Journal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific ComputingJournal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific Computing
 
DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?
 
Looking for Data: Finding New Science
Looking for Data: Finding New ScienceLooking for Data: Finding New Science
Looking for Data: Finding New Science
 
Data Science Provenance: From Drug Discovery to Fake Fans
Data Science Provenance: From Drug Discovery to Fake FansData Science Provenance: From Drug Discovery to Fake Fans
Data Science Provenance: From Drug Discovery to Fake Fans
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better Science
 
thesis_final.pdf
thesis_final.pdfthesis_final.pdf
thesis_final.pdf
 
A data view of the data science process
A data view of the data science processA data view of the data science process
A data view of the data science process
 
The beauty of workflows and models
The beauty of workflows and modelsThe beauty of workflows and models
The beauty of workflows and models
 
A Characterization Of The Scientific Data Analysis Process. Revision 1
A Characterization Of The Scientific Data Analysis Process. Revision 1A Characterization Of The Scientific Data Analysis Process. Revision 1
A Characterization Of The Scientific Data Analysis Process. Revision 1
 
Aussois bda-mdd-2018
Aussois bda-mdd-2018Aussois bda-mdd-2018
Aussois bda-mdd-2018
 

More from Research Data Alliance

More from Research Data Alliance (20)

RDA in a Nutshell - September 2020
RDA in a Nutshell - September 2020RDA in a Nutshell - September 2020
RDA in a Nutshell - September 2020
 
RDA in a Nutshell - August 2020
RDA in a Nutshell - August 2020RDA in a Nutshell - August 2020
RDA in a Nutshell - August 2020
 
RDA in a Nutshell - July 2020
RDA in a Nutshell - July 2020RDA in a Nutshell - July 2020
RDA in a Nutshell - July 2020
 
RDA in a Nutshell - June 2020
RDA in a Nutshell - June 2020RDA in a Nutshell - June 2020
RDA in a Nutshell - June 2020
 
RDA in a Nutshell - May 2020
RDA in a Nutshell - May 2020RDA in a Nutshell - May 2020
RDA in a Nutshell - May 2020
 
RDA in a Nutshell - April 2020
RDA in a Nutshell - April 2020RDA in a Nutshell - April 2020
RDA in a Nutshell - April 2020
 
RDA in a Nutshell - March 2020
RDA in a Nutshell - March 2020RDA in a Nutshell - March 2020
RDA in a Nutshell - March 2020
 
RDA in a Nutshell - February 2020
RDA in a Nutshell - February 2020RDA in a Nutshell - February 2020
RDA in a Nutshell - February 2020
 
RDA in a Nutshell - January 2020
RDA in a Nutshell - January 2020RDA in a Nutshell - January 2020
RDA in a Nutshell - January 2020
 
Rda in a Nutshell - December 2019
Rda in a Nutshell - December 2019Rda in a Nutshell - December 2019
Rda in a Nutshell - December 2019
 
Rda in a Nutshell - November 2019
Rda in a Nutshell - November 2019Rda in a Nutshell - November 2019
Rda in a Nutshell - November 2019
 
RDA in a Nutshell - October 2019
RDA in a Nutshell - October 2019RDA in a Nutshell - October 2019
RDA in a Nutshell - October 2019
 
The Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsThe Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to Individuals
 
The Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsThe Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to Individuals
 
RDA Value for Infrastructure Providers
RDA Value for Infrastructure ProvidersRDA Value for Infrastructure Providers
RDA Value for Infrastructure Providers
 
Rda in a nutshell september 2019
Rda in a nutshell september 2019Rda in a nutshell september 2019
Rda in a nutshell september 2019
 
The Value of the Rda Value for Organisations Performing Research
The Value of the Rda Value for Organisations Performing ResearchThe Value of the Rda Value for Organisations Performing Research
The Value of the Rda Value for Organisations Performing Research
 
RDA Value for Libraries
RDA Value for LibrariesRDA Value for Libraries
RDA Value for Libraries
 
The Value of the RDA for Funders
The Value of the RDA for FundersThe Value of the RDA for Funders
The Value of the RDA for Funders
 
Rda in a nutshell august 2019
Rda in a nutshell august 2019Rda in a nutshell august 2019
Rda in a nutshell august 2019
 

Recently uploaded

➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
amitlee9823
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Recently uploaded (20)

➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 

Reproducibility for IR evaluation

  • 1. Department of Information Engineering University of Padua, Italy Gianmaria Silvello
 @giansilv Reproducibility for IR Evaluation
  • 2. slideReproducibility for IR EvaluationG. Silvello IR Evaluation Initiatives 2 Evaluation in IR is often conducted in large, shared, international campaigns FIRE
  • 3. slideReproducibility for IR EvaluationG. Silvello IR Evaluation Initiatives 3 Organizer
 Assessor
 Par.cipant
 Organizer
 Assessor
 Visitor,
 Par.cipant
 and
 Organizer
 Visitor,
 Par.cipant
 and
 Organizer
 Visitor,
 Par.cipant
 and
 Organizer
 Prepara.on
of
 Documents
 Crea.on
 of
Topics
 Experiment
 Submission
 Crea.on
of
 Pools
 Relevance
 Assessment
 Performance
 Measures
 Scien.fic
 Produc.on
 Data
 Informa.on
 Knowledge
 Wisdom
 Sta.s.cal
 Analyses

  • 4. slideReproducibility for IR EvaluationG. Silvello IR Evaluation Initiatives 3 Organizer
 Assessor
 Par.cipant
 Organizer
 Assessor
 Visitor,
 Par.cipant
 and
 Organizer
 Visitor,
 Par.cipant
 and
 Organizer
 Visitor,
 Par.cipant
 and
 Organizer
 Prepara.on
of
 Documents
 Crea.on
 of
Topics
 Experiment
 Submission
 Crea.on
of
 Pools
 Relevance
 Assessment
 Performance
 Measures
 Scien.fic
 Produc.on
 Data
 Informa.on
 Knowledge
 Wisdom
 Sta.s.cal
 Analyses
 We have shared experimental collections and we perform statistical validation. But, are we done?
  • 5. slideReproducibility for IR EvaluationG. Silvello IR Evaluation Initiatives 3 Organizer
 Assessor
 Par.cipant
 Organizer
 Assessor
 Visitor,
 Par.cipant
 and
 Organizer
 Visitor,
 Par.cipant
 and
 Organizer
 Visitor,
 Par.cipant
 and
 Organizer
 Prepara.on
of
 Documents
 Crea.on
 of
Topics
 Experiment
 Submission
 Crea.on
of
 Pools
 Relevance
 Assessment
 Performance
 Measures
 Scien.fic
 Produc.on
 Data
 Informa.on
 Knowledge
 Wisdom
 Sta.s.cal
 Analyses
 Multiple targets for reproducibility: experimental collections system runs meta-evaluation studies
  • 6. slideReproducibility for IR EvaluationG. Silvello The Format Babele 4 This situation hampers: - automatic management - interpretability - reproducibility - ease of (re-)use - take-up from new comers <topic'number="6"'type="ambiguous">1 <query>2 kcs3 </query>4 <description>5 Find'information'on'the'Kansas'City'Southern'railroad.'6 </description>7 <subtopic'number="1"'type="nav">8 Find'the'homepage'for'the'Kansas'City'Southern'railroad.'9 </subtopic>10 <subtopic'number="2"'type="inf">11 I'm'looking'for'a'job'with'the'Kansas'City'Southern'railroad.'12 </subtopic>13 <subtopic'number="3"'type="nav">14 Find'the'homepage'for'Kanawha'County'Schools'in'West'Virginia.'15 </subtopic>16 <subtopic'number="4"'type="nav">17 Find'the'homepage'for'the'Knox'County'School'system'in'Tennessee.'18 </subtopic>19 <subtopic'number="5"'type="inf">20 Find'information'on'KCS'Energy,'Inc.,'and'their'merger'with'Petrohawk'Energy'Corporation.'21 </subtopic>22 </topic>23 24 <session'num="1"'starttime="08:59:47.258675">1 <topic>2 <title>3 peacecorp4 </title>5 <desc>6 Find'information'about'the'peace'corp7 </desc>8 <narr>9 When'was'it'started'and'by'whom?'What'services'does'it'provide'and'where'does'it10 </narr>11 </topic>12 <interaction'num="1"'starttime="09:00:04.155323">13 <query>14 peace'corp15 </query>16 <results>17 <result'rank="1">18 <url>19 http://www.peacecorps.gov/20 </url>21 <clueweb09id>22 clueweb09Nen0011N60N0800323 </clueweb09id>24 <title>25 Peace'Corps26 </title>27 <snippet>28 Fighting'hunger,'disease,'poverty,'and'lack'of'opportunity.29 </snippet>30 </result>31 ...'32 </results>33 <clicked>34 <click'num="1"'starttime="09:00:09.943356"'endtime="09:01:13.434255">35 <rank>36 <top>1 2 <num>)Number:)3033 <title>)Hubble)Telescope)Achievements4 5 <desc>)Description:6 Identify)positive)accomplishments)of)the)Hubble)telescope)since)it7 was)launched)in)1991.8 9 <narr>)Narrative:10 Documents)are)relevant)that)show)the)Hubble)telescope)has)produced11 new)data,)better)quality)data)than)previously)available,)data)that12 has)increased)human)knowledge)of)the)universe,)or)data)that)has)led13 to)disproving)previously)existing)theories)or)hypotheses.))Documents14 limited)to)the)shortcomings)of)the)telescope)would)be)irrelevant.15 Details)of)repairs)or)modifications)to)the)telescope)without16 reference)to)positive)achievements)would)not)be)relevant.17 18 </top>19
  • 7. slideReproducibility for IR EvaluationG. Silvello The Format Babele 4 This situation hampers: - automatic management - interpretability - reproducibility - ease of (re-)use - take-up from new comers 303#0#APW19980609.1531#21 303#0#APW19980610.1778#12 303#0#APW19980715.1061#23 303#0#APW19980910.1078#04 5 1#0#clueweb095en0120513520479#06 1#1#clueweb095en0120513520479#07 1#2#clueweb095en0120513520479#08 9 101#0#clueweb095en0047533520039#110 101#0#clueweb095en0004566509322#211 101#0#clueweb095en0033530508382#012 101#0#clueweb095en0000545505740#5213 101#0#clueweb095en0020592511795#114 15 20002#0#clueweb095en0006585533170#1#1#10.516 20004#0#clueweb095en0005528520976#1#1#10.517 20006#0#clueweb095en0010507521538#1#1#10.518 19 ad-hoc diversity ad-hoc with grades relevance feedback
  • 8. slideReproducibility for IR EvaluationG. Silvello The Format Babele 4 This situation hampers: - automatic management - interpretability - reproducibility - ease of (re-)use - take-up from new comers 303#0#APW19980609.1531#21 303#0#APW19980610.1778#12 303#0#APW19980715.1061#23 303#0#APW19980910.1078#04 5 1#0#clueweb095en0120513520479#06 1#1#clueweb095en0120513520479#07 1#2#clueweb095en0120513520479#08 9 101#0#clueweb095en0047533520039#110 101#0#clueweb095en0004566509322#211 101#0#clueweb095en0033530508382#012 101#0#clueweb095en0000545505740#5213 101#0#clueweb095en0020592511795#114 15 20002#0#clueweb095en0006585533170#1#1#10.516 20004#0#clueweb095en0005528520976#1#1#10.517 20006#0#clueweb095en0010507521538#1#1#10.518 19 ad-hoc diversity ad-hoc with grades relevance feedback We need: to agree on a common data model which allows for extension to provide the basic experimental data with proper metadata (descriptive, administrative, copyright, ...)
  • 9. slideReproducibility for IR EvaluationG. Silvello Referenceability and Traceability 5 - Explanation of experimental data is usually reported in scientific papers that do not provide direct links to them - the may be referred to in many different ways within the same paper (experiment id, system version, participant id, …) - It is often difficult to exactly know which data have been used in a paper and have access to them - It is ever more difficult to exactly know the performed data cleaning and processing operations [Ferro, 2016]
  • 10. slideReproducibility for IR EvaluationG. Silvello Referenceability and Traceability 5 - Explanation of experimental data is usually reported in scientific papers that do not provide direct links to them - the may be referred to in many different ways within the same paper (experiment id, system version, participant id, …) - It is often difficult to exactly know which data have been used in a paper and have access to them - It is ever more difficult to exactly know the performed data cleaning and processing operations We need: to have the possibility of citing experimental data in our papers as any other references and to link the data with the claims in the papers to make our papers actionable and executable providing access to the mentioned experimental data [Ferro, 2016]
  • 11. slideReproducibility for IR EvaluationG. Silvello The DIRECT Experience 6 BIBLIOGRAPHICAL EXPERIMENT VISUAL ANALYTICS EVALUATION ACTIVITY EXPERIMENTAL COLLECTION RESOURCE MANAGEMENT MEASUREMENT METADATA http://direct.dei.unipd.it/ http://lod-direct.dei.unipd.it/ [Agosti et al., 2012]
  • 12. slideReproducibility for IR EvaluationG. Silvello LOD DIRECT 7 Jussi Karlgren Link ims:relation ims:has-source ims:has-target is-expert-in Reputation Management 0.46 0.84 ims:score ims:backward-score CLEF2012wn- RepLab- KarlgrenEtAl 2012 Link ims:has-source ims:has-target ims:relation feature 0.53 0.87 ims:score ims:backward-score Profiling Reputation of Corporate Entities in Semantic Space ims:title dbpedia.or g/resource/ Reputation_ manageme nt owl:sameAs dbpedia.or g/resource/ Information _ retrieval Link ims:has-source ims:has-target ims:relation 0.42 0.23 ims:score ims:backward-score Information Retrieval owl:sameAs swrc:has-author dblp.l3s.de/d2r/ resource/ publications/ conf/clef/ KarlgrenSOEH1 2 owl:sameAs dblp.l3s.de/ d2r/resource/ authors/ Jussi_Karlgre n owl:sameAs RepLab 2012 CLEF 2012 profiling _kthgavagai _1 Measure 0.77 ims:score Effectiveness Accuracy ims:refersTo ims:submittedTo ims:isPartOf ims:evaluates ims:isEvaluatedBy ims:assignedTo ims:measuredBy [Silvello et al., 2016]
  • 13. slideReproducibility for IR EvaluationG. Silvello LOD DIRECT 7 @prefix dc: <http://purl.org/dc/terms/> . @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix aktors: <http://www.aktors.org/ontology/portal#> . @prefix foaf: <http://xmlns.com/foaf/0.1/> . @prefix ims: <http://ims.dei.unipd.it/data/rdf/> . @prefix bibo: <http://purl.org/ontology/bibo/> . @prefix owl: <http://www.w3.org/2002/07/owl#> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix swrc: <http://swrc.ontoware.org/ontology#> . <http://lod-direct.dei.unipd.it/user/Fredrik+Olsson;http:// ims.dei.unipd.it/author/> ims:file-metadata _:b0 ; ims:has-namespace <http://lod-direct.dei.unipd.it/namespace/http:// ims.dei.unipd.it/author/> ; ims:identifier "Fredrik+Olsson" . <http://lod-direct.dei.unipd.it/user/Fredrik+Espinoza;http:// ims.dei.unipd.it/author/> ims:file-metadata _:b0 ; ims:has-namespace <http://lod-direct.dei.unipd.it/namespace/http:// ims.dei.unipd.it/author/> ; ims:identifier "Fredrik+Espinoza" . <http://lod-direct.dei.unipd.it/user/Magnus+Sahlgren;http:// ims.dei.unipd.it/author/> ims:file-metadata _:b0 ; ims:has-namespace <http://lod-direct.dei.unipd.it/namespace/http:// ims.dei.unipd.it/author/> ; ims:identifier "Magnus+Sahlgren" . <http://lod-direct.dei.unipd.it/user/Jussi+Karlgren;http:// ims.dei.unipd.it/author/> ims:file-metadata _:b0 ; ims:has-namespace <http://lod-direct.dei.unipd.it/namespace/http:// ims.dei.unipd.it/author/> ; ims:identifier "Jussi+Karlgren" . <http://lod-direct.dei.unipd.it/user/Ola+Hamfors;http:// ims.dei.unipd.it/author/> ims:file-metadata _:b0 ; ims:has-namespace <http://lod-direct.dei.unipd.it/namespace/http:// ims.dei.unipd.it/author/> ; ims:identifier "Ola+Hamfors" . <http://lod-direct.dei.unipd.it/contribution/CLEF2012wn-RepLab- KarlgrenEt2012b> ims:contribution-type <http://lod-direct.dei.unipd.it/concept/ Publication;http://www.aktors.org/ontology/portal%23> ; ims:copyrighted "false" ; ims:created "2013-05-19T17:01:05.644+02:00" ; ims:file-metadata _:b0 ; ims:last-modified "2013-05-19T17:01:05.644+02:00" ; ims:link "http://www.clef-initiative.eu/documents/71612/155385/ CLEF2012wn-RepLab-KarlgrenEt2012b.pdf" ; ims:owner <http://lod-direct.dei.unipd.it/user/root;http:// ims.dei.unipd.it/> ; ims:title "Profiling Reputation of Corporate Entities in Semantic Space " ; swrc:has-author <http://lod-direct.dei.unipd.it/user/Magnus +Sahlgren;http://ims.dei.unipd.it/author/> , <http://lod- direct.dei.unipd.it/user/Jussi+Karlgren;http://ims.dei.unipd.it/author/ > , <http://lod-direct.dei.unipd.it/user/Fredrik+Espinoza;http:// ims.dei.unipd.it/author/> , <http://lod-direct.dei.unipd.it/user/ Fredrik+Olsson;http://ims.dei.unipd.it/author/> , <http://lod- direct.dei.unipd.it/user/Ola+Hamfors;http://ims.dei.unipd.it/author/> . <http://lod-direct.dei.unipd.it/namespace/http:// ims.dei.unipd.it/> ims:file-metadata _:b0 ; ims:identifier "http://ims.dei.unipd.it/" ; ims:prefix "e6fe2c43" . <http://lod-direct.dei.unipd.it/user/root;http:// ims.dei.unipd.it/> ims:file-metadata _:b0 ; ims:has-namespace <http://lod-direct.dei.unipd.it/ namespace/http://ims.dei.unipd.it/> ; ims:identifier "root" . <http://lod-direct.dei.unipd.it/namespace/http:// www.aktors.org/ontology/portal%23> ims:file-metadata _:b0 ; ims:identifier "http://www.aktors.org/ontology/portal%23" ; ims:prefix "37675fe1" . _:b0 dc:created "2015-11-06T15:55:20.052+01:00" ; dc:creator "LOD DIRECT (Distributed Information Retrieval Evaluation Campaign Tool) - Version 3.10" ; dc:rights "Copyright (c) 2006-2015 - Information Management Systems (IMS) Research Group (http://ims.dei.unipd.it/) - Department of Information Engineering (http:// www.dei.unipd.it/) - University of Padua (http:// www.unipd.it/)" . <http://lod-direct.dei.unipd.it/namespace/http:// ims.dei.unipd.it/author/> ims:file-metadata _:b0 ; ims:identifier "http://ims.dei.unipd.it/author/" ; ims:prefix "9c5e2261" . <http://lod-direct.dei.unipd.it/concept/Publication;http:// www.aktors.org/ontology/portal%23> ims:file-metadata _:b0 ; ims:has-namespace <http://lod-direct.dei.unipd.it/ namespace/http://www.aktors.org/ontology/portal%23> ; ims:identifier "Publication" . prefixesauthorscontribution metadata http://lod-direct.dei.unipd.it/contribution/ CLEF2012wn-RepLab-KarlgrenEt2012b/ [Silvello et al., 2016]
  • 14. slideReproducibility for IR EvaluationG. Silvello Towards a Support for Run Reproducibility 8
  • 15. slideReproducibility for IR EvaluationG. Silvello Towards a Support for Run Reproducibility 8
  • 16. slideReproducibility for IR EvaluationG. Silvello Actionable Papers 9 <a href=”http://direct.dei.unipd.it/user/UPV”>UPV</a>
  • 17. slideReproducibility for IR EvaluationG. Silvello Actionable Papers 9 <a href=”http://direct.dei.unipd.it/experiment/ EXP_UKB_WN100”>EXP_UKB_WN100</a>
  • 18. slideReproducibility for IR EvaluationG. Silvello Actionable Papers 9 <a href=”http://direct.dei.unipd.it/estimate/ 017c333a-4b7c-4267-926d-f15fe3554efd”>51.61%</a>
  • 19. slideReproducibility for IR EvaluationG. Silvello Actionable Papers 9 <img src=”http://direct.dei.unipd.it/visualization/ 017c333a-4b7c-4267-926d-f15fe3554efd/snapshot/ 177bcef2-00a0-4f59-b781-f285610f1c6f”/>
  • 20. slideReproducibility for IR EvaluationG. Silvello Reproducibility is tied to data citation 10 Being able to uniquely identify data (e.g., DOI, URI) is fundamental, but it is not enough - We need to: - automatically generate pertinent, consistent and complete human- and machine-readable citation snippets - define tool to make data citation easy: click, generate, copy and paste - develop citation systems which require low or no effort to data creators/curators and low or no modification to the actual data being cited - make persistent data citations [Silvello&Ferro, 2016]
  • 21. slideReproducibility for IR EvaluationG. Silvello Data Citation is a Computational Problem 11 - Identity - Completeness - Fixity - Validity [Buneman, Davidson, Frew, 2016] The four main computational issues of data citation
  • 22. slideReproducibility for IR EvaluationG. Silvello Towards a General Data Citation System 12 The identity+completeness issues To identify and generate a citation for a single resource <Iuphar> <name>IUPHAR-DB </name> <citation>Rule0</citation> [...] <gpcr> <name>G protein-coupled receptors</name> <citation>Rule1</citation> [...] <family> <id>29</id> <name>Glucagon receptor family</name> <citation>Rule2</citation> <receptor> <id>247</id> <name>GHRH</name> [...] <agonists> <ligand> [...] </ligand> </agonists> [...] </receptor> [...] </family> [...] </gpcr> <ionchannels> [...] </ionchannels> </iuphar> iuphar[name=$.d,url=$.u, version=$.v] iuphar[]/gpcr[name=$.n] iuphar[]/gpcr[]/family[name=$.f,id=$.i] /contributors[]/contributor[name=$?c] {database=$d, version=$v, contributors=$c, db-family=$n, family=$f, idFamily=$i} Rules: The citation that gets generated (example): { database=IUPHAR-DB: the IUPHAR database || url=http://www.iuphar-db.org/ || version=15 || dbFamily=G protein-coupled receptors || family=Glucagon receptor family || idFamily=29 || contributor= {Laurence J. Miller;;Daniel J. Drucker;;[...];;Rebecca Hills;;}} The rules are recursively processed by the system and then transformed into a conjunction of XPaths. The interpretation of the XPaths generates the citation. Instantiation of the variables: The first rule interpreted by the system The second rule interpreted by the system The third rule interpreted by the system [Buneman&Silvello, 2010] Rule-based system for hierarchical data
  • 23. slideReproducibility for IR EvaluationG. Silvello 13 Towards a General Data Citation System The identity+completeness issues To identify and generate a citation for a single resource [Silvello, 2016] Learning to cite framework for hierarchical data Human-Readable Citations XML Files Collection Training Data Learner Citation Model Citation System Citation XPath XML File Test Data Machine-Readable Citation Human-Readable Citation Output Reference 1 2 3 4 5 6
  • 24. slideReproducibility for IR EvaluationG. Silvello 14 Towards a General Data Citation System The identity+completeness issues (+ fixity) To identify and generate a citation for a single resource [Alawini, Chen, Davidson & Silvello, in preparation] View+rule based system for RDF datasets e1 e2 e3 e4 e5 e6 e7 e8 e9 e10 pypz pz py pz py px px px px py py pz VSW(e1) Resource to be cited: e1 check type citation query parametrized by e1 CSW(e1,s,v,d,t,o,u) Citation Function {eagle-id: “eagle-id: e1'', name: `”Significance Tester'', developers: {“Grant, G.'', “Lazar, M.l'', “Manduchi, E.''}, url: “http://www.cbil.upenn.edu/STAR/ '' } Final citation RDF Citation Model eagle-i id Citation Formatter machine-readable citation (JSON) human-readable citation eagle-i triple store eagle-iV versioning system
  • 25. slideReproducibility for IR EvaluationG. Silvello 15 Towards a General Data Citation System The identity+completeness issues To identify and generate a citation for a multiple resources [Silvello 2015] Named graphs for RDF subsets ex: systemA ex: expA ex: CLEF 2009 ex: measureA ex:produce ex:measure ex:submitted-to precision 0.70 ex:name ex:value ex: n1 ex: n2 ex: n3 ex: n4 ex: n5 schema: is-related-to schema: is-related-to schema: is-related-to schema: is-related-to ex:n1 schema:is-related-to ex:n2 ex:cit-sysA-CLEF2009 ex:n1 schema:is-related-to ex:n3 ex:cit-sysA-CLEF2009 ex:n2 schema:is-related-to ex:n4 ex:cit-sysA-CLEF2009 ex:n2 schema:is-related-to ex:n5 ex:cit-sysA-CLEF2009 Subject Property Object Name Machine-readable citation meta-graph ex:systemA ex:produce ex:expA ex:n1 ex:expA ex:measure ex:measureA ex:n2 ex:expA ex:submitted-to ex:CLEF2009 ex:n3 ex:measureA ex:name "precision" ex:n4 ex:measureA ex:value "0.7" ex:n5 Subject Property Object Name Original cited LOD subset n1 n3 n2 n5 n4 Copyright © 2015 Gianmaria Silvello
  • 26. slideReproducibility for IR EvaluationG. Silvello 16 Towards a General Data Citation System The identity+completeness issues To identify and generate a citation for a multiple resources [Davidson, Deutch, Milo,Silvello, 2017] View-based model for relational databases Query Rewriting Function Database Views V Specification Language Query Q Database D Citation Policies q1 q2 qn . . . Preference Model Citation Function Set of best rewritings Citation Queries CQ c1 c2 cm . . . Aggregation Function Citation C 1 2 3 4 5 Citation Views Citation Checking Mechanism 6
  • 27. slideReproducibility for IR EvaluationG. Silvello Conclusions 17 - Reproducibility is a fundamental topic for science - Information retrieval evaluation is a challenging domain - Data Citation is a complex and open problem - new models of citations - computational solutions - intrinsically related to reproducibility
  • 28. slideReproducibility for IR EvaluationG. Silvello References 18 [Agosti et al., 2012] Agosti, M., Di Buccio, E., Ferro, N., Masiero, I., Peruzzo, S., and Silvello, G. (2012). DIRECTions: Design and Specification of an IR Evaluation Infrastructure. In Proceedings of the Third International Conference of the CLEF Initiative (CLEF 2012). LNCS 7488, Springer, Heidelberg, Germany. [Buneman et al., 2016] Buneman, P., Davidson, S. B., and Frew, J. (2016). Why data citation is a computational problem. Communications of the ACM (CACM), 59(9):50–57. [Buneman and Silvello, 2010] Buneman, P. and Silvello, G. (2010). A Rule-Based Citation System for Structured and Evolving Datasets. IEEE Data Eng. Bull., 33(3):33–41. [Davidson et al., 2017] Davidson, S. B., Deutch, D., Tova, M. and Silvello, G. (2017). A Model for Fine-Grained Data Citation. In 8th Biennial Conference on Innovative Data Systems Research (CIDR 2017). [Ferro, 2016] Ferro, N. (2016). Reproducibility Challenges in Information Retrieval Evaluation. ACM Journal of Data and Information Quality (JDIQ), to appear. [Silvello, 2015] Silvello, G. (2015). A Methodology for Citing Linked Open Data Subsets. D-Lib Magazine, 21(1/2). [Silvello, 2016] Silvello, G. (2016). Learning to Cite Framework: How to Automatically Construct Citations for Hierarchical Data. Journal of the American Society for Information Science and Technology (JASIST), in print:1–28. [Silvello et al., 2016] Silvello, G., Bordea, G., Ferro, N., Buitelaar, P., and Bogers, T. (2016). Semantic Representation and Enrichment of Information Retrieval Experimental Data. International Journal on Digital Libraries (IJDL), in press:1–28. [Silvello and Ferro, 2016] Silvello, G. and Ferro, N. (2016). ”Data Citation is Coming”. Introduction to the special issue on data citation. Bulletin of IEEE Technical Committee on Digital Libraries, Special Issue on Data Citation, 12(1):1–5.
  • 29. slideData Driven Digital Libraries: The Case of Data CitationG. Silvello 19