“Hot Topics: The DuraSpace Community Webinar Series, "Series Five: VIVO: Research Discovery and Networking.” Webinar #: 6-11-13 VIVO Technical Deep Dive, June 11, 2013
Presented by: Brian Lowe, Semantic Applications Programmer, Cornell University and Jim Blake, VIVO Release Manager, Cornell University.
The "Direct" Services - DSpaceDirect and ArchivesDirect
6-11-13 VIVO Technical Deep Dive Presentation Slides
1. June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
Hot Topics: The DuraSpace
Community Webinar Series
Series Five:
“VIVO: Research Discovery &
Networking ”
Curated by Dean Krafft
2. June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
Webinar 3:
VIVO Technical Deep Dive
Presented by:
Brian Lowe, Semantic Applications Programmer, Cornell
Jim Blake, VIVO Release Manager, Cornell
3. VIVO “behind the curtain”
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
13. “Context node” pattern
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
Person
Investigator Role
has role
role in
Grant
14. “Context node” pattern
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
Person
has role
role in
Grant
Date/Time
Interval
Date/Time Value
Date/Time Value Date/Time Value
“2012”“2008”
Investigator Role
15. “List views”
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
16. “List views”
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
Triple
Store
SPARQL
queries
display
template
17. Triple stores
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
Triple
Store
SPARQL
queries
display
template
• Dump in whatever you
want (or can)
• Extend the ontology;
VIVO application adapts
• Others take what they
want, ignore the rest
• Reasoners add triples
automatically
18. Reasoning
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
19. Reasoning
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
20. Beyond the triple store:
SOLR index
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
21. Beyond the triple store:
SOLR index
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
22. Linked Data
LinkSailor by Ian Davis
Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
June 11, 2013
23. Linked Data
Follow your nose
Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
June 11, 2013
24. Linked Data
Follow your nose
Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
June 11, 2013
25. Linked Data applications
Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
June 11, 2013
26. Linked Data applications
Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
Ponce
VIVO
WashU
VIVO
IU
VIVO
Cornell
Ithaca
VIVO
Weill
Cornel
l
VIVO
eagle-I
Research
resources
Other
VIVOs
Digita
l
Vita
RDF
Iowa
Loki
RDF
vivo
search.
org
UF
VIVO
Scripps
VIVO
Solr
search
index
Alter-
nate
Solr
indexLinked Open
Data
June 11, 2013
Harvard
Profiles
RDF
27. Data integration
Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
• Not just transforming one document
into another
• Turning strings into things
• Making connections
• Building a graph
– Inside VIVO and linking out to the
Linked Data cloud
June 11, 2013
28. Data integration
Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
• Disambiguating / deduping
• Dealing with dirty or incomplete data
June 11, 2013
29. Data integration tools
Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
• Lots of options
• Triples just get dumped into triple store
in the end
• RDF can include additional source data
not needed for VIVO display
June 11, 2013
30. Data integration Tools
Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
• VIVO Harvester
– developed at University of Florida
– convert from XML, CSV, relational
databases
– extensible architecture
– fetch, translate, score and match
June 11, 2013
31. Data integration tools
Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
June 11, 2013
32. Data integration tools
Hot Topics: DuraSpace Community Webinar Series
Eat Smart NY
SPARQL
queries
display
template
June 11, 2013
34. From the outside
• Supports browsing and searching with
faceted results
• Usually populated by ingest
• Supports authorized editing of data
• Supports Linked Open Data requests
Hot Topics: DuraSpace Community Webinar SeriesJune 11, 2013
35. VIVO is open source
• OSI BSD license
– Moving to Apache 2 license
• Built on open source components
– Jena
– Solr
– jQuery
– FreeMarker
• Source code at
https://github.com/vivo-project
Hot Topics: DuraSpace Community Webinar SeriesJune 11, 2013
36. VIVO plays well with others
• Host operating system
– Linux, Windows
• Java web container
– Tomcat, GlassFish, WebLogic
• Database layer
– MySQL, Oracle
• Browsers
– Firefox, Chrome, Safari
– Internet Explorer 7, 8, 9
Hot Topics: DuraSpace Community Webinar SeriesJune 11, 2013
37. The customary 3-layer
architecture?
Hot Topics: DuraSpace Community Webinar SeriesJune 11, 2013
Presentation:
HTML, CSS, JavaScript, AJAX, jQuery
FreeMarker template engine
Business logic:
Java Web application
Data:
A semantic triple store or SPARQL endpoint
(frequently layered over a relational data base)
38. A typical installation
Hot Topics: DuraSpace Community Webinar Series
Firewall
Apache
HTTP
server
Tomcat
VIVO
Solr
MySQL
June 11, 2013
40. VIVO is intended for customization
• Themes and styles
• Displays and editing
• Static and dynamic pages
• Extensions to the ontology
• OpenSocial gadgets
• Backing technologies
Hot Topics: DuraSpace Community Webinar SeriesJune 11, 2013
41. VIVO is intended for customization
Hot Topics: DuraSpace Community Webinar SeriesJune 11, 2013
42. VIVO is intended for customization
Hot Topics: DuraSpace Community Webinar SeriesJune 11, 2013
43. VIVO is intended for customization
Hot Topics: DuraSpace Community Webinar SeriesJune 11, 2013
44. VIVO is intended for customization
Hot Topics: DuraSpace Community Webinar SeriesJune 11, 2013
45. VIVO is a customization
• Vitro is a general-
purpose web-based
ontology and instance
editor with
customizable browsing
• VIVO is a thin layer
around Vitro
Hot Topics: DuraSpace Community Webinar SeriesJune 11, 2013
VIVO
Vitro
46. Release 1.6 directions
• Ontology normalization
– Integrated Semantic Framework
– Separating data content from display
context
• Full featured Web Service
– Get data, modify data
– Integrated authentication and indexing
• Internationalization
– Support for multiple languages
• Performance improvements
Hot Topics: DuraSpace Community Webinar SeriesJune 11, 2013
48. Who are we?
• People who are developing VIVO
• People who are installing VIVO
• People who are customizing VIVO
• People who are re-purposing VIVO
• People who are curious about VIVO
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
49. What does the community provide?
• Mutual assistance on installations
• Answers to questions about VIVO
• Bug reports
• Shared code or configurations
• Documentation
• Ideas on how to improve VIVO
• Use cases and requirements
• Pre-release testing
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
50. The front door – vivoweb.org
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
51. vivoweb.org
• Blog with info about
– latest events
– latest releases
• Links to the other resources
• “Contact Us” form
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
52. The conversation: open calls
• Implementation and Development call
– Every Thursday at 1:00 p.m., Eastern time
– Full info and agendas on the wiki: search “Weekly
Implementation Calls”
• Ontology call
– Alternate Wednesdays at 11:00 a.m., Eastern time
– Full info and agendas on the wiki: search
“Ontology Calls”
• WebEx conferencing provides simultaneous
audio, chat and screen-sharing
• Everyone is welcome
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
53. The conversation: mailing lists
• Issues relating to VIVO implementation
– https://lists.sourceforge.net/lists/listinfo/vivo-
imp-issues
• Discussion of VIVO development
– https://lists.sourceforge.net/lists/listinfo/vivo-
dev-all
• Discussion of the VIVO ontology
– https://lists.sourceforge.net/lists/listinfo/vivo-
ontology
• Announcements of VIVO release candidates
– https://lists.sourceforge.net/lists/listinfo/vivo-
release-test
• Anyone can join – archives are available for
searching
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
54. The gatherings:
The VIVO Conference
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
57. Community Contributions
• Ontology
• Design
• Code
• Testing
• Documentation
• Outreach
• “Ecosystem”
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
58. Find out more
• Our web site:
– http://vivoweb.org
• The VIVO book:
– VIVO: A Semantic Approach to
Scholarly Networking and Discovery
• The 2013 VIVO Conference
– August 14-16, St. Louis, MO
June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
59. June 11, 2013 Hot Topics: DuraSpace Community Webinar Series
Questions?
Brian Lowe
Cornell University
bjl23@cornell.edu
Jim Blake
Cornell University
jeb228@cornell.edu