2. A short history of Wikipedia
A website that anyone can edit
The largest reference work on the internet
A multi-language online encyclopedia
3. A short history of Wikipedia
A website that anyone can edit
The largest reference work on the internet
A multi-language online encyclopedia
4. A short history of Wikipedia
A website that anyone can edit
The largest reference work on the internet
A multi-language online encyclopedia
5. Wikipedia: unintended outcomes
accelerate the dissemination of scholarship
support open scientific research
enable distributed fact-checking and curation of scientific knowledge
6. accelerate the dissemination of scholarship
support open scientific research
enable distributed fact-checking and curation of scientific knowledge
7. Wikipedia: unintended outcomes
accelerate the dissemination of scholarship
support open scientific research
enable distributed fact-checking and curation of scientific knowledge
8. Outline
1. Wikipedia as the front matter to all research
2. A new kind of open knowledge
3. Wikidata: Collaboratively curated linked open data
4. WikiCite: Building the sum of all human citations
5. Applications
6. Concluding remarks
10. “Wikipedia is not the bottom
layer of authority, nor the top,
but in fact the highest layer
without formal vetting. In this
unique role, it serves as an
ideal bridge between the
validated and unvalidated
Web.”
Casper Grathwohl
Chronicle of Higher Education
http://chronicle.com/article/article-content/125899/
11. Top sources of DOI resolutions
http://crosstech.crossref.org/2014/02/many-metrics-such-data-wow.html
http://blog.crossref.org/2016/05/https-and-wikipedia.html
12. The world’s most accessed online medical resource?
Heilman and West (2015) doi.org/10.2196/jmir.4069
13. Most visited resource on Ebola in West Africa
Heilman (2016) http://tinyurl.com/jfuyduv
Most used internet site in Liberia,
Sierra Leone and Guinea for
Ebola during 2014 outbreak
Greater than CNN, CDC and WHO
21. Wikidata
Free knowledge base that anyone can edit
Launched in 2012
Integrated with Wikipedia and other sister
projects
Statistics (Aug 2016)
Nearly 20M items
Over 100M statements
29. Expert curation of scientific open data
Benjamin Good (2016) Opportunities and challenges
presented by Wikidata in the context of
biocuration
http://tinyurl.com/hk9qrmz
30. Expert curation of scientific open data
Gene Wiki: WIkidata SPARQL examples
https://bitbucket.org/sulab/wikidatasparqlexamples/overview
Get a list of all diseases treated by Metformin
Get all the gene ontology evidence codes used in Wikidata
Get all known drug-drug interactions for Methadone via its CHEMBL id
31. WikiCite
Building the sum of all human citations
Randall Munroe, Wikipedian protester http://tinyurl.com/p3rodlb [CC BY]
36. Linking is a small act of generosity that sends people away from
your site to some other that you think shows the world in a way
worth considering. [...]
[Sources] that are not generous with linking [...] are a stopping
point in the ecology of information. That’s the operational
definition of authority: The last place you visit when you’re
looking for an answer. If you are satisfied with the answer, you
stop your pursuit of it. Take the links out and you think you look
like more of an authority.
D. Weinberger (2012) Linking is a public good
http://www.hyperorg.com/blogger/2012/02/26/2b2k-linking-is-a-public-good/
43. The molecular origins of insulin go at least as far back as
the simplest unicellular [[eukaryotes]].<ref
name='LeRoith'>{{cite journal | vauthors = LeRoith D, Shiloach
J, Heffron R, Rubinovitz C, Tanenbaum R, Roth J | title =
Insulin-related material in microbes: similarities and
differences from mammalian insulins | journal = Can. J.
Biochem. Cell Biol. | volume = 63 | issue = 8 | pages = 839–49
| year = 1985 | pmid = 3933801 | doi = 10.1139/o85-106
}}</ref> Apart from animals, insulin-like proteins are also
known to exist in Fungi and Protista kingdoms.
References in Wikipedia
44.
45. Wikicite: goals
Lay the foundations for building a repository of all Wikimedia
citations and source metadata as structured data
Design data models and technology to improve the coverage,
quality, standards-compliance and machine-readability of
citations and source metadata in Wikimedia projects
https://meta.wikimedia.org/wiki/WikiCite_2016
46. Wikidata as the solution
Vision
Technology
Community
Scale
Licensing
Independence
57. Most cited authors in the research corpus on Zika
SPARQL: http://tinyurl.com/jb8da68
58. Semi-automated recommendation of missing statements or sources for unsourced
statements
https://www.wikidata.org/wiki/Wikidata:Primary_sources_tool
https://meta.wikimedia.org/wiki/Grants:IEG/StrepHit:_Wikidata_Statements_Validation_via_References
59. Tools for crowdsourcing entity matching / disambiguation
http://www.generalist.org.uk/blog/2014/wikidata-identifiers-and-the-odnb-where-next/
http://www.generalist.org.uk/blog/2014/wikidata-and-identifiers-part-2-the-matching-process/
60. all statements citing a New York Times article
the most popular scholarly journals used as citations for statements in any item that
is a subclass of economics
all statements citing the works of Joseph Stiglitz
all statements citing journal articles by physicists from Oxford University
all statements citing a journal article that was retracted
all statements citing a source that cites a journal article that was retracted
New opportunities for linked open knowledge curation and discovery
https://meta.wikimedia.org/wiki/WikiCite_2016/Report/Group_5
62. Liberate public domain bibliographic and citation data
Support new forms of open curation and distributed fact-checking
Accelerate open scientific research
Verifiable, Linked Open Knowledge
That Anyone can Edit
64. Thank you
Acknowledgments
Daniel Mietchen, Jonathan Dugan, Lydia Pintscher, Cameron Neylon, James Hare, James Heilman,
Magnus Manske, the Gene Wiki team (especially Andra Waagmeester and Benjamin Good), the
University of Chicago Knowledge Lab, all WikiCite 2016 participants and Wikidata Source Metadata
project contributors.
Additional image credits
Printing press, M. Wirth https://thenounproject.com/term/printing/11880/ [CC BY]
Cocitation network for openfMRI papers, F. Å. Nielsen https://twitter.com/fnielsen/status/752860630932156416
dario@wikimedia.org • @readermeter • @Wikidata • @WikiCite • @WikiResearch