Digitised content is often created behind tailored interfaces. How can the world of open data and APIs allow for different interfaces be built over the same content for different audiences
1. Digitised Content in an API World Alastair Dunning, JISC a.dunning AT jisc.ac.uk Resource Discovery Taskforce Meeting London, 20th April 2011 Acronym Count: 8
2. types of content I’m talking about: digitised text, manuscripts, images, film footage, audio archives, newspapers, documents, music (and their metadata). ie. the type of stuff at http://www.jisc-content.ac.uk/
3. such content tends to be locked down behind interfaces. usage is tied to technical infrastructure and interface
4. the trouble with current resources is that they demand certain ways of analysing and representing the resource – and they constitute the creators’ way of seeing the world, not the users’
6. but what would such content look like in an RDTF world, where data and service are separated? What do APIs allow to happen?
7. an API driven world would allow much greater flexibility over analysing a digitised dataset, i.e. different intellectual questions to be asked
8. and also different ways of visualising that digital content Thanks David McCandless! http://www.informationisbeautiful.net/visualizations/
9. and also of different ways of tailoring content for different audiences – different interfaces for schools, undergrads and researchers – all over the same content
10. more importantly, it can help break down the notion of a collection, and the related silos
22. Proceedings of the Old Bailey Online, 1674-1913but it is based on an API architecture which allows for aggregation and cross-search of 11 enriched metadata sets for the resources listed above (note: the aggregators created the enriched metadata and the APIs not the resource provider, and are testing the business model behind this)
23. so not only has an API architecture created a new resource for early modern British history, aggregating many disparate datasets but others can come along and create their own interfaces – including or excluding elements of 11 resources (and adding others) as required within these sources, there is rich metadata about places, areas, streets, names, crimes, genders, ages, occupations – these can be exploited in myriad ways
24. indeed, the team will be incorporating map data and archaeological data from BL and Museum of London to allow for spatial visualisation via geographical data (maps in this case) and mashing of historical data (largely about events and people) with archaeological data (largely about objects) Map - First Series Ordnance Survey, c.1805 from British Library via http://visionofbritain.org.uk/maps
25. and think how this could work when you start bringing datasets and content from different subject areas – economics, anthropology, fashion
26. on a practical note: don’t forget sustainability – the pressure of sustaining dataset and digitised content is relaxed for the collection holder; looking after the interface less important
27. short-term wins content and enthusiasm is out there, although disparate – see The New History Lab article visualisation can produce eye-catching success short bursts of funding can make things happen scholarly labs at KCL, UCL, Sheffield and elsewhere (BL, BUFVC) enthusiasm of GLAM sector (good work at V+A and Sci Museum) opportunties for enriching metadata via crowdsourcing
28. long-term challenges getting people to build and document and sustain APIs; explaing to collection curators how and why to do it (some) publishers suspicious getting people to build interfaces on top of APIs; technical knowledge required to do so quality of metadata; who owns enriched metadata? business models unclear; related licencing interoperability between APIs? citation academic scepticism + misunderstanding {the web changes}