Exploring the Future Potential of AI-Enabled Smartphone Processors
7-23-12 DuraSpace & VIVO Sponsor Webinar Slides
1. DuraSpace and VIVO
DuraSpace Sponsors Webinar
July 23, 2012
Michele Kimpton - DuraSpace
Jonathan Markow - DuraSpace
Dean Krafft – Cornell/VIVO
Jon Corson-Rikert – Cornell/VIVO
2. Using the Webinar
Platform
• 2-way audio for all
participants is muted
• We will utilize the Chat
Window for the Q&A
portion or you may use
it if you are having
technical difficulties
• You may type your
question here & hit
‘enter’
3. DuraSpace and VIVO
DuraSpace Sponsors Webinar
July 23, 2012
Michele Kimpton - DuraSpace
Jonathan Markow - DuraSpace
Dean Krafft – Cornell/VIVO
Jon Corson-Rikert – Cornell/VIVO
4. Agenda
• The DuraSpace Incubator
• The VIVO Project
• DuraSpace and VIVO – Current Status
and Next Steps
• Questions
5. The DuraSpace Incubator
• Uses the Apache and Jasig models as a
starting point
• Projects are mentored by DuraSpace
• Projects work towards best practices,
governance, sustainability
• End of process is status as officially
recognized project
6. DuraSpace Umbrella Services
• Legal • Webinars
• Licensing • Education
• Governance • Peer Support
• Developer resources • Conferences
• Mentoring • Community
• Technology expertise development
• International exposure • Service providers
• Contributors • Admin support
• Recognition • Grants
• Marketing and • Advocacy
Communications • Strategic planning
7. Advantages to DuraSpace and our
Community
• Broader community support
• New technologies of interest to
DuraSpace community (e.g., VIVO:
semantic web, linked open data)
• Technology synergies (e.g., Fedora)
• Attractiveness to corporate sponsors
• Wider base of developers
• Management and overhead cost
efficiencies
8. DuraSpace Expectations
• Projects must meet incubation criteria
vis-à-vis best practices, licensing, broad
committer support, etc.
• Project community must fund
DuraSpace resources
• Project works with DuraSpace to
develop sustainability plan
9. Agenda
• The DuraSpace Incubator
• The VIVO Project
• DuraSpace and VIVO – Current Status
and Next Steps
• Questions
10. VIVO: An Open Source Tool for
Describing and Linking Researchers
and Research
Dean B. Krafft (presenter) and
Jon Corson-Rikert
Cornell University Library
DuraSpace Sponsors Webinar
July 23, 2012
11. University of Florida Indiana University
VIVO Collaboration
Mike Conlon (VIVO and UF PI) Katy Borner (IU PI)
Beth Auten Kavitha Chandrasekar
Michael Barbieri Bin Chen
Chris Barnes Shanshan Chen
Kaitlin Blackburn Ryan Cobine
Cecilia Botero Jeni Coffey Cornell University
Kerry Britt Suresh Deivasigamani Dean Krafft (Cornell PI)
Washington University School of
Erin Brooks Ying Ding Manolo Bevia Medicine in St. Louis
Amy Buhler Russell Duhon Jim Blake Rakesh Nagarajan (WUSTL PI)
Ellie Bushhousen Jon Dunn Nick Cappadona Kristi L. Holmes
Linda Butson Poornima Gopinath Brian Caruso Caerie Houchins
Chris Case Julie Hardesty Jon Corson-Rikert George Joseph
Christine Cogar Brian Keese Elly Cramer Sunita B. Koul
Valrie Davis Namrata Lele Medha Devare Leslie D. McIntosh
Mary Edwards Micah Linnemeier Elizabeth Hines
Nita Ferree Nianli Ma Huda Khan Weill Cornell Medical College
Rolando Garcia-Milan Robert H. McDonald Depak Konidena Curtis Cole (Weill PI)
George Hack Asik Pradhan Gongaju Brian Lowe Paul Albert
Chris Haines Mark Price Joseph McEnerney Victor Brodsky
Sara Henning Michael Stamper Holly Mistlebauer Mark Bronnimann
Rae Jesano Yuyin Sun Stella Mitchell Adam Cheriff
Margeaux Johnson Chintan Tank Anup Sawant Oscar Cruz
Meghan Latorre Alan Walsh Christopher Westling Dan Dickinson
Yang Li Brian Wheeler Tim Worrall Richard Hu
Jennifer Lyon Feng Wu Rebecca Younes Chris Huang
Paula Markes Angela Zoss Itay Klaz
Hannah Norton Kenneth Lee
James Pence The Scripps Research Ponce School of Medicine Peter Michelini
Narayan Raum
Institute Richard J. Noel, Jr. (Ponce PI) Grace Migliorisi
Nicholas Rejack John Ruffing
Gerald Joyce (Scripps PI) Ricardo Espada Colon
Alexander Rockwell Jason Specland
Catherine Dunn Damaris Torres Cruz
Sara Russell Gonzalez Tru Tran
Sam Katkov Michael Vega Negrón
Nancy Schaefer Vinay Varughese
Brant Kelley
Dale Scheppler Virgil Wong
Paula King
Nicholas Skaggs
Angela Murrell
Matthew Tedder
Barbara Noble
Michele R. Tennant This project is funded by the National Institutes of Health, U24 RR029822
Cary Thomas
Alicia Turner "VIVO: Enabling National Networking of Scientists”
Michaeleen Trimarchi
Stephen Williams
12. What is VIVO?
• A semantic-web-based researcher and
research discovery tool
– People plus much more
• Institution-wide, publicly-visible information
– For external as well as internal audiences
• An open, shared platform for connecting
scholars, communities, and campuses using
Linked Open Data
13. What does VIVO do?
• Integrates multiple sources of data
– Systems of record
– Faculty activity reporting
– External sources (e.g., Scopus, PubMed, NIH
RePORTER)
• Provides a review and editing interface
– Single sign-on for self-editing or by proxy
• Provides integrated, filterable feeds to other
websites
14. A brief VIVO history
2003-2005 First development for the life sciences
at Cornell, as a relational database
2006-2008 Expansion to all disciplines at Cornell,
and conversion to Semantic Web
2009-2012 National Institutes of Health-sponsored
VIVO: Enabling the National
Networking of Scientists project
transforms VIVO to a multi-institutional
open source platform
2012+ Transitioning VIVO to DuraSpace for
open community development
15. What does VIVO model?
• People, but also organizations, grants, programs,
projects, publications, events, facilities, and
research resources
• Relationships among the above
– Meaningful connections among people and activities
– Bidirectional
– Context and navigation from one point of interest to
another
• Links to URIs outside VIVO
– Concepts
– People, places, organizations, events
17. What is a Semantic Web application?
• Provides data readable by machines, not just text
for humans
• Provides self-describing data via shared ontologies
– Defined types
– Defined relationships
• Provides search & query augmented by relationships
• Does simple reasoning to categorize and find
associations
– Teaching faculty = any faculty member teaching a course
– All researchers involved with any gene associated with
breast cancer (through research project, publication,
etc.)
18. The VIVO ontology
• Defines types
– Having individual instances (e.g., persons)
• Defines relationships
– Expressed as statements about individuals
• Statements (“triples”) can describe or connect
– Data property statements describe individuals
– Object property statements connect individuals
19. Example relationships for a researcher
Mining the record: Historical evidence for…
has author academic staff research area for
author of
taught by in research area
research area
academic staff crop management author of
Susan Riha
teaches research area for Andrew McDonald
has author
headed by
CSS 4830 NYS WRI
head of
faculty appointment in
features faculty members
person Earth and Atmospheric
featured in Sciences
Cornell’s supercomputers crunch weather data to help farmers manage chemicals
21. What is Linked Open Data (LOD)?
• Data
– Structured information, not just documents with text
– A common, simple format
• Open
– Available, visible, mine-able
– Anyone can post, consume, and reuse
• Linked
– Directly by reference
– Indirectly through common references and inference
23. Linked data indexing for search
Scripps
Scripps
UF
WashU
WashU VIVO
VIVO
VIVO eagle-I
IU Research
VIVO resources Harvard
Ponce Profiles
VIVO RDF
Cornell Other
Ithaca VIVOs
VIVO
Weill
Weill Solr
Cornell
Cornell search Iowa
VIVO Loki
index
Alter- RDF
vivo nate
search Solr
.org
index
Digital
Vita
RDF
Linked Open Data
24.
25. Value for institutions
• Common data substrate
– Public, granular and direct
– Discovery via external and internal search engines
– Available for reuse at many levels
• Distributed curation
– E.g., affiliations beyond what HR system tracks
– Data coordination across functional silos
– Feeding changes back to systems of record
– Direct linking across campuses
• Data that is visible gets fixed
26. Enter once, use many times
• Provides normalized public data to a range of
campus applications
– Parameterized, filtered queries
• By affiliation, org or person id, research area, geography
– Search results
• Easily consumable data
– XML, HTML, JSON
– Import module for Drupal
– Widgets (Duke, UCSF)
27.
28.
29.
30. Partnerships – research resources
• CTSAconnect
– OHSU, Harvard, Cornell, Florida, Buffalo & Stony
Brook
– eagle-i sister NIH project – Harvard, OHSU, 7 others
• Facilities, services, techniques, protocols, skills,
and research outputs beyond publications
– Extended ways to represent expertise
– Improve attribution for data and other
contributions to science
31. Partnerships – ORCID
• ORCID – Open Researcher and Contributor ID
– Create an identifier for all authors. Attribute
works to authors through ORCID identifier
• ORCID and VIVO
– VIVO provides assertion of ORCIDs for people
through institutional identity management.
ORCID is an attribute in a VIVO profile.
– Anticipating batch submission of basic researcher
registrations by universities
http://orcid.org
32. Partnerships – research data
• VIVO/ANDS consortium in Australia
– Link research data with researcher profiles and
publications
– Harvest to national registry
• DataStaR local data registry tool
– Add-on to VIVO
– Complement to other library data-related services
– Institute for Museum and Library Services (IMLS)
grant
33. Partnerships – information standards
• euroCRIS and the CERIF research data standard
– Linked Open Data task force slated to map VIVO
and CERIF ontologies during 2012
• CASRAI (http://casrai.org)
– University, corporate, and government partnership
– Building a common data dictionary for research
information
– Coordinating with euroCRIS
34. Partnerships - APA
• The American Psychological Association is using
VIVO as one component of a trusted identity
framework for online submission and review of
scholarly publications
• VIVO’s integration with institutional identity
systems through the InCommon Federation
provides a platform for representing and
exchanging information about authors and their
works
• http://vivo.apa.org, http://publishtrust.org
35. Partnerships - Sakai
• Cambridge University is testing VIVO as a
profiling component of the new Sakai Open
Academic Environment product
• VIVO’s design permits integration into the
Sakai interface without major modification to
either platform
• NYU and other sites may expand the testing
pending grant approval
36. Institutions Adopting VIVO
• American Psychological Assn. • Univ. of Melbourne
• Brown University • New York University
• Univ. of Cambridge • Northeastern University
• Univ. of Colorado, Boulder • Notre Dame University
• Cornell University • Univ. of Pennsylvania
• Duke University • Penn State
• Eindhoven Univ. of Technology • Scripps Research Institute
• Univ. of Florida • SUNY, Stony Brook
• Griffiths University • U.S. Dept. of Agriculture
• Indiana University • Univ. of Virginia
• Johns Hopkins University • Univ. of Washington
• MIT • Washington Univ., St Louis
• Weill Cornell Medical College
37. VIVO Conference Corporate Sponsors
Sponsor 2010 2011 2012
AAAS (Science) X
CrossRef Silver
Elsevier X Silver Gold
IMO Interface Terminology X Gold
Microsoft Research Silver
Nature Jobs Silver
NETE Silver
PLOS X
ProQuest Gold
Springer Gold
Symplectic Gold Platinum
Refworks | COS X Gold
Thomson Reuters X Gold Silver
Wellspring Worldwide Silver
38. VIVO Service Providers
• Symplectic Elements – research management tools
supporting repositories and VIVO
– Preliminary data reviews from the Duke VIVO team are positive
• Wellspring Worldwide’s Flintbox.com
– Integrating faculty expertise and licensable technologies
– Membership free to universities and research institutes
– Announced plans to create VIVO-based researcher profiles for
all member organizations
• JK Software in Australia
– Custom forms and other interface work
• American Psychological Assocation
– Presenting at VIVO 2012 on licensing the APA research
thesaurus as a service integrated with VIVO
39. Development beyond the VIVO project
• Duke – VIVO widgets
• Indiana – Query builder interface for HUBzero
• Pittsburgh – Digital Vita Documents
• Stony Brook – UMLS terminology web service
• Weill Cornell – Google Refine integration
• Nebraska – BEPress publication importer
• UCSF – Open Social container and RDF gadgets
• USC – Karma Information Integration Tool
40.
41. Agenda
• The DuraSpace Incubator
• The VIVO Project
• DuraSpace and VIVO – Planning and
Next Steps
• Questions
42. VIVO Planning
• Roles and Responsibilities
• Governance
• Working groups
• Advisory bodies
• Work objectives and plan
• Costs
• Revenue streams
43. Roles and Responsibilities
• Five part-time VIVO staff positions
identified for operational roles
• DuraSpace effort
– Strategic planning
– Business development
– Marketing and communications
– Community development
– Technology best practices
– Admin support
45. Costs
• Existing DuraSpace staff effort (0.85 FTE
aggregated) – funded by VIVO
• Dedicated VIVO staff (0.95 FTE
aggregated) - Administered by
DuraSpace, sourced and funded by VIVO
• VIVO “in-kind” staff (0.75 FTE) hosted at
contributing institutions
• Other costs (travel, marketing, event
coordination, sponsorship campaign)
46. Revenues
• Grants and contracts
• Hosted services
• Corporate sponsorships
• Registered service providers
• Conference/event sponsorships
• Community sponsorship
• Founding contributors
47. Next Steps
• Joint announcement on intent
• VIVO raises funds to support its startup
period
• Initial fund-raising success will trigger
the start of incubation for VIVO.
As you can see, The VIVO project itself is a rather large, geographically dispersed team. 7 institutions Project areas: development, implementation, ontology, and outreach