SlideShare a Scribd company logo
1 of 23
Scratchpad 2, Virtual Research Environment: Project Update Vince Smith Natural History Museum, London [email_address] EOL Content Summit Panama, 17-20 Jan. 2012
The problem ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Scratchpads ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],http://scratchpads.eu
Categories of Scratchpads Taxa ( Classifications, taxon profiles, specimens, literature, images, maps, phenotypic, genotypic & morphometric datasets, keys, phylogenies ) Projects Conservation Regions Societies
Sites 326 Users 6279 Active Users 5139 (273 w / 759 m) Pages 424,972 Sites Users ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Scratchpad usage (2007-2011) ViBRANT  SP 2
Scratchpad 2 Overview Justification for SP2 Release Timeline ,[object Object],[object Object],[object Object],Goal “ a scholarly communication system that is intertwined with the pursuit of natural history, rather than its after-thought or annex ” ,[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Scratchpad 2 backend enhancements Still to come…
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Scratchpad 2 frontend enhancements Still to come…
Scratchpad 2 theming SP1 SP2 Garland  Idiosyncratic colours & layouts ,[object Object],[object Object],[object Object],[object Object],[object Object]
Scratchpad 2 flexibility ,[object Object],[object Object]
Scratchpad 2 editing & administration SP1 SP2 ,[object Object],[object Object],Complex & not intuitive ,[object Object],[object Object]
Scratchpad 2 data management ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Scratchpad 2 guided workflows SP1 SP2 ,[object Object],[object Object],[object Object],[object Object],Not intuitive ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],1 2 3 4 5
Scratchpad 2 faceted search ,[object Object],[object Object],[object Object],[object Object]
Scratchpad 2 multimedia gallery ,[object Object],[object Object],[object Object],[object Object]
Scratchpad 2 taxon pages SP1 SP2 ,[object Object],[object Object],[object Object],[object Object],[object Object]
Scratchpad 2 taxon pages ,[object Object],[object Object],[object Object],[object Object],[object Object]
Scratchpad 2 mapping Three map types supported in SP1 User defined  TDWG regions (up to Level 4) GBIF Maps User defined Point localities  via DwC records
Scratchpad 2 mapping ,[object Object],[object Object],[object Object],[object Object]
Scratchpad 2 “publication” options ,[object Object],[object Object],[object Object],[object Object],Taxon descriptions 1. Define the publication 2. Enter metadata 3. Select taxa & content 4. Organise manuscript 5. Submit to journal
Scratchpad 2 “publication” options ,[object Object],[object Object],[object Object],Datasets
Scratchpad 2 “publication” options ,[object Object],[object Object],[object Object],Data items
EOL Issues – discussion points ,[object Object],[object Object],[object Object],[object Object],API ,[object Object],[object Object],[object Object],[object Object],LifeDesks ,[object Object],[object Object],Wikipedia

More Related Content

Similar to Scratchpad 2, Virtual Research Environment: Project Update

Scratchpads: past, present and future
Scratchpads: past, present and futureScratchpads: past, present and future
Scratchpads: past, present and futureVince Smith
 
Scratchpads: past, present and future
Scratchpads: past, present and futureScratchpads: past, present and future
Scratchpads: past, present and futureVince Smith
 
Scratchpads past,present,future
Scratchpads past,present,futureScratchpads past,present,future
Scratchpads past,present,futureEdward Baker
 
Apache Tika: 1 point Oh!
Apache Tika: 1 point Oh!Apache Tika: 1 point Oh!
Apache Tika: 1 point Oh!Chris Mattmann
 
Indexing and Searching Cross Media Content in a Social Network
Indexing and Searching Cross Media Content in a Social NetworkIndexing and Searching Cross Media Content in a Social Network
Indexing and Searching Cross Media Content in a Social NetworkPaolo Nesi
 
Jump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with DatabricksJump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with DatabricksAnyscale
 
DITA's New Thang: Going Mapless!
DITA's New Thang: Going Mapless!DITA's New Thang: Going Mapless!
DITA's New Thang: Going Mapless!dclsocialmedia
 
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015Andrey Vykhodtsev
 
Making your data work harder than you do
Making your data work harder than you doMaking your data work harder than you do
Making your data work harder than you doSusan Jane Williams
 
ALM Search Presentation for the VSS Arch Council
ALM Search Presentation for the VSS Arch CouncilALM Search Presentation for the VSS Arch Council
ALM Search Presentation for the VSS Arch CouncilSunita Shrivastava
 
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesWebinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesMongoDB
 
All the New Cool Stuff in QGIS 2.0
All the New Cool Stuff in QGIS 2.0All the New Cool Stuff in QGIS 2.0
All the New Cool Stuff in QGIS 2.0Nathan Woodrow
 
Large scale, interactive ad-hoc queries over different datastores with Apache...
Large scale, interactive ad-hoc queries over different datastores with Apache...Large scale, interactive ad-hoc queries over different datastores with Apache...
Large scale, interactive ad-hoc queries over different datastores with Apache...jaxLondonConference
 
Schema.org Update at ISWC2012
Schema.org Update at ISWC2012Schema.org Update at ISWC2012
Schema.org Update at ISWC2012Alex Shubin
 
Metadata & brokering - a modern approach #2
Metadata & brokering - a modern approach #2Metadata & brokering - a modern approach #2
Metadata & brokering - a modern approach #2Daniele Bailo
 
Resource discovery and information sharing: reaching the 2.0 turn
Resource discovery and information sharing: reaching the 2.0 turnResource discovery and information sharing: reaching the 2.0 turn
Resource discovery and information sharing: reaching the 2.0 turnBonaria Biancu
 
Etosha - Data Asset Manager : Status and road map
Etosha - Data Asset Manager : Status and road mapEtosha - Data Asset Manager : Status and road map
Etosha - Data Asset Manager : Status and road mapDr. Mirko Kämpf
 
MongoDB Roadmap
MongoDB RoadmapMongoDB Roadmap
MongoDB RoadmapMongoDB
 

Similar to Scratchpad 2, Virtual Research Environment: Project Update (20)

Scratchpads: past, present and future
Scratchpads: past, present and futureScratchpads: past, present and future
Scratchpads: past, present and future
 
Scratchpads: past, present and future
Scratchpads: past, present and futureScratchpads: past, present and future
Scratchpads: past, present and future
 
Scratchpads past,present,future
Scratchpads past,present,futureScratchpads past,present,future
Scratchpads past,present,future
 
Apache Tika: 1 point Oh!
Apache Tika: 1 point Oh!Apache Tika: 1 point Oh!
Apache Tika: 1 point Oh!
 
Indexing and Searching Cross Media Content in a Social Network
Indexing and Searching Cross Media Content in a Social NetworkIndexing and Searching Cross Media Content in a Social Network
Indexing and Searching Cross Media Content in a Social Network
 
Jump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with DatabricksJump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with Databricks
 
DITA's New Thang: Going Mapless!
DITA's New Thang: Going Mapless!DITA's New Thang: Going Mapless!
DITA's New Thang: Going Mapless!
 
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
 
Making your data work harder than you do
Making your data work harder than you doMaking your data work harder than you do
Making your data work harder than you do
 
ProjectHub
ProjectHubProjectHub
ProjectHub
 
ALM Search Presentation for the VSS Arch Council
ALM Search Presentation for the VSS Arch CouncilALM Search Presentation for the VSS Arch Council
ALM Search Presentation for the VSS Arch Council
 
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesWebinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
 
All the New Cool Stuff in QGIS 2.0
All the New Cool Stuff in QGIS 2.0All the New Cool Stuff in QGIS 2.0
All the New Cool Stuff in QGIS 2.0
 
Large scale, interactive ad-hoc queries over different datastores with Apache...
Large scale, interactive ad-hoc queries over different datastores with Apache...Large scale, interactive ad-hoc queries over different datastores with Apache...
Large scale, interactive ad-hoc queries over different datastores with Apache...
 
Schema.org Update at ISWC2012
Schema.org Update at ISWC2012Schema.org Update at ISWC2012
Schema.org Update at ISWC2012
 
Metadata & brokering - a modern approach #2
Metadata & brokering - a modern approach #2Metadata & brokering - a modern approach #2
Metadata & brokering - a modern approach #2
 
Resource discovery and information sharing: reaching the 2.0 turn
Resource discovery and information sharing: reaching the 2.0 turnResource discovery and information sharing: reaching the 2.0 turn
Resource discovery and information sharing: reaching the 2.0 turn
 
Etosha - Data Asset Manager : Status and road map
Etosha - Data Asset Manager : Status and road mapEtosha - Data Asset Manager : Status and road map
Etosha - Data Asset Manager : Status and road map
 
Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-
 
MongoDB Roadmap
MongoDB RoadmapMongoDB Roadmap
MongoDB Roadmap
 

More from Vince Smith

DiSSCo institutional benefits
DiSSCo institutional benefitsDiSSCo institutional benefits
DiSSCo institutional benefitsVince Smith
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeVince Smith
 
Moving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collectionsMoving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collectionsVince Smith
 
Use it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresUse it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresVince Smith
 
No specimen left behind: Collections digitisation at the NHM, London*
No specimen left behind:  Collections digitisation at the NHM, London*No specimen left behind:  Collections digitisation at the NHM, London*
No specimen left behind: Collections digitisation at the NHM, London*Vince Smith
 
SYNTHESYS 3 Overview
SYNTHESYS 3 OverviewSYNTHESYS 3 Overview
SYNTHESYS 3 OverviewVince Smith
 
Scratchpad 2014-introduction
Scratchpad 2014-introductionScratchpad 2014-introduction
Scratchpad 2014-introductionVince Smith
 
Consolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsConsolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsVince Smith
 
Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Vince Smith
 
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Vince Smith
 
Scratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataScratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataVince Smith
 
Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Vince Smith
 
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Vince Smith
 
Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince Smith
 
The biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveThe biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveVince Smith
 
Building data infrastructures for science
Building data infrastructures for scienceBuilding data infrastructures for science
Building data infrastructures for scienceVince Smith
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeVince Smith
 
Don’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyDon’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyVince Smith
 
Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Vince Smith
 
2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smithVince Smith
 

More from Vince Smith (20)

DiSSCo institutional benefits
DiSSCo institutional benefitsDiSSCo institutional benefits
DiSSCo institutional benefits
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 
Moving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collectionsMoving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collections
 
Use it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresUse it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructures
 
No specimen left behind: Collections digitisation at the NHM, London*
No specimen left behind:  Collections digitisation at the NHM, London*No specimen left behind:  Collections digitisation at the NHM, London*
No specimen left behind: Collections digitisation at the NHM, London*
 
SYNTHESYS 3 Overview
SYNTHESYS 3 OverviewSYNTHESYS 3 Overview
SYNTHESYS 3 Overview
 
Scratchpad 2014-introduction
Scratchpad 2014-introductionScratchpad 2014-introduction
Scratchpad 2014-introduction
 
Consolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsConsolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review Presentations
 
Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...
 
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
 
Scratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataScratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity data
 
Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...
 
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
 
Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notext
 
The biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveThe biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspective
 
Building data infrastructures for science
Building data infrastructures for scienceBuilding data infrastructures for science
Building data infrastructures for science
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics Landscape
 
Don’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyDon’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easy
 
Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...
 
2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Recently uploaded (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Scratchpad 2, Virtual Research Environment: Project Update

  • 1. Scratchpad 2, Virtual Research Environment: Project Update Vince Smith Natural History Museum, London [email_address] EOL Content Summit Panama, 17-20 Jan. 2012
  • 2.
  • 3.
  • 4. Categories of Scratchpads Taxa ( Classifications, taxon profiles, specimens, literature, images, maps, phenotypic, genotypic & morphometric datasets, keys, phylogenies ) Projects Conservation Regions Societies
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18. Scratchpad 2 mapping Three map types supported in SP1 User defined TDWG regions (up to Level 4) GBIF Maps User defined Point localities via DwC records
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.

Editor's Notes

  1. Good afternoon everybody. I ’m going to give a very quick update on the Scratchpad project, which is being developed under the auspices of an EU funded project called ViBRANT, and a UK Research council project called e-monocot. The Scratchpads are a class of software called Virtual Research Environments, and these enable enable real-time collaboration and dissemination of information and are creating new opportunities to manage data. The Scratchpads project has been running for approximately 5 years, and in the last year we have been redeveloping the software to improve the functionality and sustainability of the system. In this talk I ’ m going quickly go over some of these new developments.
  2. So I think most of you will be familiar with the fundamental problem the Scratchpads are trying to address. Much of the science we are doing, especially in the field of taxonomy and biodiversity studies, need to happen on a global scale. It requires global standards for software to talk to each other, global workflows to make that process of interaction seamless, and the cooperation of major projects, working toward shared goals. However, the practice of taxonomy happens locally. Its carried out by local groups of scientists, using local infrastructures, and usually with local funders. So the challenge is to link these local activities up to these global projects. And this is fundamentally what the Scratchpad project is about.
  3. The Scratchpad project originally started on a shoestring budget under the EU funded EDIT project, and this continued under the ViBRANT initiative, plus some other projects, which who ’ s funding at the moment is scheduled to finish at the end of 2013. As many of you will know, the Scratchpads are hosted websites for taxonomists. These website can be customised by who own and manage the content on these sites. These typically focus on a particular taxon, the flora and fauna of a region or in some cases are about the actions of a particular society. The sites act as a research & publication platform for their users and these sites support the taxonomic workflow through a series of modules. The entire project is currently maintained and developed by just two software developers. And these support an entire ecosystem of about 300 user communities.
  4. The types of information present within Scratchpads is very variable but the majority of sites are principally its about taxa. These sites hold structured information for taxa on a range of data types including Classifications, taxon profiles, specimens, literature, images, maps, phenotypic, genotypic & morphometric datasets, keys, phylogenies. Very occasionally you will find a Scratchpads that has a little bit of al this information of a group, but more often these sites will have a specific focus. For example, may taxon sites concentrate on the compilation of bibliographic data, because its easy to share and benefits all parties. In addition to taxa there are many site with very different agenda. Some focus on conservation issues like the drafting of Red List threat assessments. Others on specific science projects or the flora and fauna of a particular region. Others concern specific societies. The construction of these Scratchpads is very much a bottom up endeavor, and we don’t seek to alter of influence this decision in any way, except through the structure and functionality we create in the sites.
  5. At present there are about 320 different Scratchpad sites, and in total we have about 6,000 registered users, of which about 5,000 of these are active users. In other words these active users have logged in and done something on the site. In fact I ’ ve started to stop paying attention to these total numbers. More interesting is the monthly and weekly use. So for example last week we had 273 users log in, and over the month that figure was 759. This monthly figure was across about 85 different Scratchpads, and gives an idea of the breadth of use and “ stickyness ” of the sites. The number of users on any one site ranges from 1 to over 1000, with an average of 15 per site and the most popular number being just 1. We don ’ t collect precise numbers but broadly these fall into professional scientists and informed amateur. We do have plans for some citizen science applications for the Scratchpads but this will come later. What is interesting is that the number of users shot up with the start of the ViBRANT project, and after Scratchpads 2 have bedded in in 2012, its likely this momentum will be maintained.
  6. So the overall goal of the Scratchpad project remains the same with the release of Scratchpad 2. We aim to create a scholarly communication system that is intertwined with the pursuit of taxonomy and systematics,. In other words we want the Scratchpads to be embedded into the day to day work practices of the user community. Using the system should be seen as something that makes researchers life easier, rather than being something that is just an add on or optional extra to their activities. And this is the real justification for Scratchpads 2. The original version of the Scratchpads was put together on an ad hoc basis with a tiny budget. To use a British expression it is rather a kludge. New funding has allows us to step back from this and reconsider how we change the technical delivery of the Scratchpads to make them more scalable and sustainable. As well as front end enhancements to improve the functionality and ease of use. The fact that the underlying content management system was undergoing a version upgrade from 6-7, also gave us a good reason to do this now. In terms of release timeline, at present all the technical development is done and we are in to process of completing the theming (in other words the presentation and views). The current plan, and I’m pretty confident this is accurate, is to have a complete version by the 3 rd Feb, which we will release on the Sandbox for testing comment and bug fixes. By early March we will push Scratchpads to to all new users. In other words new site requests will get Scratchpads 2 from this point on. By April we will offer opt in switch over for existing users. And by mid 2012 we will essentially force SP1 users to switch over, but preserve the option to stick with SP1 for some users. I’m certain some users will not want to switch because of the specific ways they are using their sites, or because the the new theming breaks their sense of ownership with the site.
  7. So what are the improvements in SP2. As I said previously they consist of backend enhancements to make the system more sustainable, and front end enhancements to make the system more functional. I’m not going to go through the back end enhancements in detail, but in short they comprise… Aegir hosting environment (automated site management & migration) Themed project profiles (e.g. GBIF NPT, COMBER & potentially LifeDesks) Git code management (distributed source code version control) Scratchpad wide search (Apache Lucene, dedicated VM) Distributed physical hosting (not just at the NHM) Data services (Darwin Core Archives & Extensions) Its worth nothing that several backend improvements are still to come after the SP2 release.
  8. This is the list of front end enhancements and rather than go though the list I have a slide on most of these which I will go through quickly.
  9. Probably the major difference people will see when it comes to SP2 is the new theming and presentation. The original was rather idiosyncratic, and left the user to customize the color scheme and layout. This resulted in some rather unprofessional looking sites, but did re-enforce the users sense of ownership with the site and its content. It is perhaps for this reason that we are likely to have a problem moving some users to SP1, because they are so wedded to their original theme. The new theme, is much more consistent with less clutter, clearer navigation and looks more professional. The goal was to preserve the flexibility we offered with SP1 but offer a more consistent set of navigational structures across the whole site.
  10. So for example users see these little setting wheels where they can easily configure content and the display. Likewise we have predefined swatches that enable users to pick a series of complimentary colours for their site, rather than the hodge podge that we currently have under SP1.
  11. Another major improvement in SP2 is a much easier editing and administration interface. All editing now takes place in overlays that sit on top of the display, providing a logical link between editing and how its presented. Likewise the interface to these is much more consistent, removing a lot of the clutter that was previously present.
  12. One of the biggest improvements in SP2 is a t abular data management environment. Almost all categories of content can be displayed in a spreadsheet like environment that supports one clink Excel import from templates that are dynamically created by the Scratchpad to reflect the field structure of the content type. A key feature of this is the ability to instantly filter the display on any of the fields, and export this content. Even better is the ability to download the data, update it off line and then reimport in with the corrections. We are expecting this to be heavily used and to significantly drive up content within the Scratchpads.
  13. One of the biggest problems in SP1 was that many activities had to be performed in a particular sequence, and that tis sequence wasn’t especially intuitive. As a result many users simple had to know or learn the sequence in order to achieve their goals. This is fixed in SP2 with the workflows module, which allows us to chain activities together, such as the processes involve the site setup, adding new users, or activities like creating a new taxonomy, The result is that the whole system should be much more intuitive to the user. This is one part of SP2 that has not been themed yet so there is nothing too special to see. But the goal is that users can follow a more logical structure to running their site, which out having to look at the help resources or contact the core Scratchpad team.
  14. Another big improvement is the incorporation of a faceted search interface into all the content types. For those unfamiliar with faceted search it’ s a bit like when shopping on Amazon and having a intelligent list of search categories presented, depending on what you are looking at. SO for example when you go tot the default bibliography view users see lists of authors, years and journals though which they can rapidly discover the information they are looking for. As I ’ ve already mentions, for content displayed in a grip users can similarly search on any field to discover content too.
  15. This faceted search interface has been incorporated into the media gallery , which has a completely new look and also includes video support. As part of this it is out intention (and I say intention because I don’t know whether this has been done yet) to embed YouTube and Vimeo videos. Likewise users should be able to embed links to specific flickr images too. As a result this means we don’ t have to host the actual content, is users want t place this else where because of the benefits conferred by those other sites.
  16. As part of the redesign we have completely changed our taxon pages. Previously these used to be presented as a single page and shows a mix of content from the users site as well as a wide range of third party content. Frankly the interface for this wasn’t very good. It was very hard to navigate and customize. Also the quality of the third party information was often very poor. As a result, users usually just displayed their own content, and not the third party content, and in fact the taxon pages simply were not very well used. We have tried to address these shortcomings in SP2 by moving all content to tabs, and limiting the sources for third party content. This uses the EOL API and at the moment only displays third party content from EOL, although we plan to expand these sources later this year.
  17. The new interface is designed to encourage easy publishing of structured textual data (at present these are the SPM fields) to EOL. Users can also directly edit this content from the taxon page, As with SP1 the taxon pages support parent child inheritance so optionally data on for example a species page, is available on the genus page too. Frankly the taxon pages are still in the process of being themed ( this was only done last week) so these still have a little way to go before they are fully presentable.
  18. With regard to maps, in SP1 users had a rather confusing choice of three map types. They could make use of the GBIF occurrence maps, although interestingly may users chose not to display these because of concerns about data quality. Alternatively they could display the maps dynamically constructed from their own specimen record in their site. Alternatively they could construct regional maps identifying the presence or absence of taxa in a particular county, according to the TDWG standard, which we supported to level 4.
  19. Within SP2 the goal is to integrate all these map types, and ensure that even the third party data can be locally edited. This has already been achieved for point and TDWG region data. In addition users can now define flexible polygons, lines or annotations on the map, all with structured metadata. Technically at this stage it is also possible to import GBIF points on the the map along with the metadata, which can then be edited and the location of points edited (or more likely suppressed on the map). However the danger here is that we are potentially recreating GBIF for certain taxa or regions within the site, and that is not the intention. Also there are major issues displaying very large number of points, although it is worth noting that many of these issues have now been fixed. Thus there is still some work to do here before the mapping module is fully realized with al this functionality.
  20. Within SP2 we have made major headway in allowing users to formally publish content from their site, to other resources. These publishing options range from full manuscripts containing taxon descriptions, to data sets, or even individual data projects. Essentially these publishing options are the payback for users engaging with the site that have taken the trouble of structuring their data. By doing this the system enables them to very easily reuse this information and gain credit for this via publishing outlets. In the context of formal manuscripts, a prototype system has already been in operation in SP1. This enables users to create a paper for peer review and publication in the Journals ZooKeys or PhytoKeys. This is done by entering the basic metadata about the paper, selecting which Scratchpad content forms part of the paper, providing an interface to organise the manuscript, and then the means to submit this as structured XML using Taxpub markup to Pensoft. At present we have has three papers with new species descriptions published this way. As part of the SP2 work, Pensoft are altering the XML structure and we have changed the interface to make this even easier to user.
  21. Shortly after the release of SP2 we will also be providing the option for users to do this for scientific datasets. Many researchers have checklists, ecological, phenotypic, genotypic or morphometric data that on its own doesn’t justify a traditional scientific paper. However, with sufficient descriptive metadata and a mechanism to offer this information to others in a structured way, would justify publication. Parts of the Scratchpad, like the Character project tool, provide the means for users to import or build these datasets very quickly, and in partnership with Pensoft we will provide a mechanism through which they can be formally published at the touch of a button.
  22. This same metaphor extends to smaller amounts of data. For example, it is out intention to support the direct publication of taxon names to ZooBank, once sufficient metadata has been deposited in the taxon editor. Likewise these principles extend to EOL SPM content. Once sufficient content is present, it should activate a button that enables (at the users disgression) a mechanism to push data to these third party databases. I should add that achieving this level of granular publication has not been completed yet, although Scratchpads do (and will continue to support) the publication of species profile information to EOL. However, we hope to add these functions in the near future.
  23. I just wanted to finish up with a few discussion points that particularly relate to the Scratchpads and EOL. These might be best discussed after the LifeDesk presentation but essentially are the issues we face either making greater use of EOL in Scratchpads or in terms of pushing content too EOL.