2. Overview Project goals & drivers History of the project Short walkthrough Overall Architecture The features of the fluor tool Setting up a collection Project Status 12th Sakai Conference – Los Angeles, California – June 14-16 2
3. Project Goals Make researchers share research data in a controlled community Integrate Sakai with the Fedora Content Repository Support searching and browsing Support different access models Open vs. closed 12th Sakai Conference – Los Angeles, California – June 14-16 3
4. Project Drivers UvALibrary: make researchers aware of the importance of sharing data Making research data publically available becomes more and more a requirement than a wish Publishing: Support publications by disseminate the underlying research data Teaching: have students work with actual research data 12th Sakai Conference – Los Angeles, California – June 14-16 4
5. A bit of history And now... 12th Sakai Conference – Los Angeles, California – June 14-16 5
6. Project history: testweeklab ‘Testweeklab’ project (2008) Work with 40 years of privacy sensitive research data Strong security requirements Only metadata (publically) accessible Complicated access procedure for accessing the actual data Very specific metadata schema, search and browse requirements Very specific fields (year, N, type of test, scale) Build as Sakai tool for connecting to a Fedora repository 12th Sakai Conference – Los Angeles, California – June 14-16 6
7. Project history: next steps Findings from testweeklab User Interaction improvements Configurability Support more types of usage Make the tool flexible to support different collections and types of use Make the access model flexible Support personalization 12th Sakai Conference – Los Angeles, California – June 14-16 7
8. A short functional walkthrough And now… 12th Sakai Conference – Los Angeles, California – June 14-16 8
15. Overall architecture: components Fedora: act as a content repository. Generic search: do the updates and transformations to Solr Solr: indexing, provide a search and browse interface Sakai FLUOR tool: create a UI for researchers to work with 12th Sakai Conference – Los Angeles, California – June 14-16 15
16. Fedora Content repository Content managed as data objects Unique identifier: PID Metadata Datastreams Relation between objects Virtual datastreams Versioning, logging Multiple collections Objects handled as XML (FOXML) 12th Sakai Conference – Los Angeles, California – June 14-16 16
17. Fedora generic search Enables browsing and search with Lucene, Solr and Zebra Gets notifications about updates from Fedora and fetches the objects XSLT transforms FOXML into documents for the search engine a Rest and SOAP interface, search and browse based on SRW/SRU. 12th Sakai Conference – Los Angeles, California – June 14-16 17
18. Solr Search engine Build on top of Lucene Easy to deploy and configure Advanced full-text searching and indexing Open interfaces, Rest, JSON, XML Admin interfaces Plugin architecture 12th Sakai Conference – Los Angeles, California – June 14-16 18
19. the fluor tool And now… 12th Sakai Conference – Los Angeles, California – June 14-16 19
20. Fluor tool features Access research data Search and browse Access items in the repository Create favorites Upload new items Added directly depending on security model 12th Sakai Conference – Los Angeles, California – June 14-16 20
21. Data access security model Metadata is always accessible Access on datastreams is limited Open: no restrictions Request based: user needs to create a request, and admin reviews them. Fluor tool features 12th Sakai Conference – Los Angeles, California – June 14-16 21
22. Fluor tool features 12th Sakai Conference – Los Angeles, California – June 14-16 22
23. Fluor tool features The access model is configurable. Open Request based Per object configurable 12th Sakai Conference – Los Angeles, California – June 14-16 23
24. Fluor tool features Support for versioning Enables the download of previous versions. Configurable 12th Sakai Conference – Los Angeles, California – June 14-16 24
25. Fluor tool features Data encryption The ability to encrypt datastreams Backups etc. cause no threat to privacy Configurable on the datastream 12th Sakai Conference – Los Angeles, California – June 14-16 25
26. Setting up a repository 12th Sakai Conference – Los Angeles, California – June 14-16 26
27. Setting up a collection Describe the collection What datastreams are there? What metadata is there? Set up the Fedora repository Configure the Fedora datamodel Set up generic search and Solr Configure the FLUOR tool 12th Sakai Conference – Los Angeles, California – June 14-16 27
28. Setting up a collection 12th Sakai Conference – Los Angeles, California – June 14-16 28
29. The status of the project And now… 12th Sakai Conference – Los Angeles, California – June 14-16 29
30. Project status The project is currently being tested by targeted end-users Common UI bugs and issues Search and index has problems 12th Sakai Conference – Los Angeles, California – June 14-16 30
31. Solrvs. Generic search Generic search 2.2 Browse functionality broken with Solr. Does not use facet browsing, instead access lucene index on file system. Browse not limited to collection, results polluted. Solution: Access Solr directly instead of generic search 12th Sakai Conference – Los Angeles, California – June 14-16 31
32. Any questions? An finally… 12th Sakai Conference – Los Angeles, California – June 14-16 32
33. Thank you! 12th Sakai Conference – Los Angeles, California – June 14-16 33
Notas do Editor
Make researchers aware of the importance of sharing data.Sharing data becomes even more a requirement than a wish.Support publications
Role of the library is to support researchers managing and storing data