SlideShare uma empresa Scribd logo
1 de 45
Audiovisual archives and digital humanities
                                       Netherlands Institute for Sound and Vision


                                                            Johan Oomen
                                                            Head of R&D (+ researcher VU University)

                                                            Roeland Ordelman
                                                            Policy advisor audiovisual access (+ researcher
                                                            University of Twente)
                                                            Erwin Verbruggen
                                                            Project manager EUscreen




http://www.walkerart.org/calendar/2009/benches-binoculars
                                                            contact: joomen@beeldengeluid.nl


   8 February 2013
                                                                *
                                                                                           #ousa2013
Netherlands Institute
for Sound and Vision
Sound and Vision R&D
Agenda

                         Johan Oomen
 – Open archives for Digital Humanities


         Roeland Ordelman
         - Speech search and Digital Humanities


                     Erwin Verbruggen
                   - EUscreen and DH

                     *
http://jurnsearch.wordpress.com/2013/01/13/digital-humanities-map/
Images for the Future


http://imagesforthefuture.com/en/news/images-
future-90-seconds




   @johanoomen

                       *
It would take over 6 million
years to watch the amount
of video that will cross
global IP networks each
month in 2016.
Every second, 1.2 million
minutes of video content
will cross the network in
2016.



                                 goal:
        ...be the best provider of your content

                      http://www.cisco.com/en/US/solutions/collateral/ns341/ns525/ns537/ns705/ns827
                              white_paper_c11-481360_ns827_Networking_Solutions_White_Paper.htm
Known item search
Explorative search




Bron M., van Gorp J., Nack F., de Rijke M., van Gorp J., de Leeuw S., "A Subjunctive Exploratory Search Interface to Support Media Studies Researchers", SIGIR '12: 35th
                         international ACM SIGIR conference on Research and development in information retrieval,, Portland, Oregon, ACM, pp. 425-434 , August, 2012.
Contextual search




http://zookma.science.uva.nl/linking-ui?session_id=510f98e28f034
Contextual search
Linking
Vocabularies




               Over 20 million
               records and growing.
Archives and DH

1.  Digitisation as driver for change
  •    Towards a cultural commonwealth
  •    Archives as a bridge to CS and DH
2.  Mutual benefit
  •  digging into data ó adding meaning
3.  From pilots to sustainable solutions
  •    Standards (W3C)
  •    In-house production system
  •    Shared infrastructures (i.e. CLARIAH.eu)




                                    *
Audiovisual collections, the
spoken word and user needs of
  scholars in the Humanities
   Observations based on related
     work in The Netherlands
            2005-2012          Roeland Ordelman
                                 @roelandordelman
E-Research E-research

• New and/or rapid ways to gain knowledge
• Digital resources and information technology
• Big data & data mining (social sciences)
• Digital Humanities / E-Humanities
• Digitization, Infra, Tools, Standards
• CLARIN.eu / DARIAH.eu
Emerging focus audiovisual
Emerging focus on on audiovisual

• Multi-modal, multi-semiotic:
  • multiple layers of meaning / interpretation
  • E.g., “quote + intonation + images + discourse”
• New dimensions for scholarly research
• Large investments in digitization:
  • Images for the Future: 200k hours of film, video
    and audio
  • Various digitization projects for scientific
    collections
METADATA
 RULES     ?
Metadata & Annotations
Metadata & annotations

• Annotations:
  • General (document level)
  • Specific (segment level)
• Metadata: typically sparse / document level
• Requirements dependent on research field
• Annotation generation:
  • Manual (Individual, Teams, Crowd)
  • Automatic: (un/lightly) supervised
Monitoring radio transcripts




INGEST SUPERVISION // ARCHIVIST
            SUPPORT:
   Quickly assess quality of ASR
Spoken word search 2005-2012

• Wide range of projects in various domains
  • Radio
    • Daily ingest: selection of programs
    • Woord.nl: public access to radio content
  • Historical video collections with sparse data
  • ``Oral History’’
• Development of an ASR service for
  cultural heritage institutions
1st experiment on ASR for
humanities: access to
personal recordings of Dutch
novelist WF Hermans
Access to interview
collection with camp
survivors World War II
Access to interview collections

FEMINIST MOVEMENT
Alignment of transcripts for indexing

INTERVIEWS ON BOMBARDEMENT
OF ROTTERDAM
Access to Radio interviews
Experiments with various types of access and result
presentation: speaker changes, speaking rate, search
strategies, word clouds
Access to Historical
Speeches:
Alignment & Linking
ACCESS TO
 DISTRIBUTED ORAL
 HISTORY
 COLLECTIONS

•  Infrastructure for
   searching collections
   at various institutes in
   The Netherlands
•  Harvesting of
   Metadata (OAI-PMH)
•  ASR as a service
•  Evaluated with Oral
   Historians
Observations on speech search

• Large variation in ASR performance
• Performance (and decisions on use)
  should be assessed in context of
  application: audiovisual search
• Usefulness in audiovisual search should
  be assessed in context of use scenarios
• Use scenarios require specific
  presentation/visualization requests
Usefulness of results
•  Perception of usefulness
   •  Usefulness in context of search/data exploration
   •  Educate / Expectation management
   •  Guide searching
   •  Show why (errors, confidence, trust-levels, cut-offs)
   •  Focus on research needs
•  Improve on ASR quality
   •  Educate: how to record an interview (Oral History)
   •  Use available textual resources (alignment, vocab optimization)
•  Improve on search application
   •  Visualization
   •  Result presentation
       •  documents versus segments
       •  combination of information sources
       •  cross/within-collection linking
Methodology
  Methodology (1)                          (1)
•  E-research is an intervention in current practices!
•  Promise:
   •  increased efficiency, relevance, novelty
•  Interest of scholars:
   • tools that facilitate or simplify existing practice (RIN
     report, 2011)
•  Co-development ICT-researchers & scholars to adjust
   expectations. Examples:
   • Finding more in less time may not be a goal in itself for
     humanities researchers
   • Deep engagement with primary texts versus results on the
     segment level
Methodology (2)

•  4 stages:
   1.    Preliminary archival search
         •  Browsing as a general interest
         •  Purpose driven (checking details, complementary resources)
         •  Item-oriented (finding first mentioning of something)
         •  Collection-oriented (thematic, source, person, event)
   2.    Content analysis
         •  Visualization, compression, aggregation
         •  (optionally) go back to (1)
   3.    Presentation and dissemination
         •  Enhanced publications (persistent identifiers on segment level)
   4.    Curation
         •  Trusted digital repository
•  (spoken) search scenarios: facilitate these stages
ASR for ASR for
        research         research
• Triple-A: Accessible, Affordable, Accurate
• Individual researchers sending files to ASR?
• Embedded in suite of research tools?
• What about integration in search
  applications?
  • Stagnation due to inadequate local infrastructures
• Variation across collections requires ‘tailor-
  made’ approaches: e.g., speaker adaptation,
  vocabulary adaptation, alignment, collection
  of related resources (information trail)
ASR
        ASR service              service



Upload: via http, ftp, api



Model of use:
 •  Free test bundle (10h)
 •  Various small/medium/large
    bundles
 •  Reduced costs (only
    hardware and maintenance)
 •  Management by CH body
 •  Maintenance by industry
    partner
Dutch Queen
Wilhelmina addressing
the Dutch people from
London during WWII
Exploring Europe’s Television Heritage in
Changing Contexts

 Erwin Verbruggen, R&D
     @erwinverb
Partner overview
Metadata
                         mint.image.ece.ntua.gr/

                    Based on EBUcore
            Mapped to the Europeana Data Model

      MAPPING TOOL                                 ANNOTATION TOOL


Massive uploads                                                  Item and
                                                    Group Level Annotation
Schema Mapping Service
                                                          Connection with
Quality Control                                         EUscreen Thesauri


Europeana Preview Services                    Search and Browsing Services
Euscreen Portal




WWW.EUSCREEN.EU
Storylines
Collaborative design sessions




    Virtual Exhibition Tool
Open access publishing with AV sources




WWW.VIEWJOURNAL.EU
Linked Open Data Pilot




LOD.EUSCREEN.EU
Visualisation demos




DEMO.EUSCREEN.EU
www.euscreen.eu
         facebook.com/euscreen
         twitter.com/euscreen




2/8/13

Mais conteúdo relacionado

Semelhante a Audiovisual archives and digital humanities

Research and Development at Sound and Vision
Research and Development at Sound and Vision Research and Development at Sound and Vision
Research and Development at Sound and Vision Victor de Boer
 
Digital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework ProgrammeDigital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework Programmelocloud
 
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...TimelessFuture
 
Audiovisual collections, the spoken word and user needs of scholars in the Hu...
Audiovisual collections, the spoken word and user needs of scholars in the Hu...Audiovisual collections, the spoken word and user needs of scholars in the Hu...
Audiovisual collections, the spoken word and user needs of scholars in the Hu...roelandordelman.nl
 
What is an archaeological research infrastructure and why do we need it? Aims...
What is an archaeological research infrastructure and why do we need it? Aims...What is an archaeological research infrastructure and why do we need it? Aims...
What is an archaeological research infrastructure and why do we need it? Aims...ariadnenetwork
 
Sharing cultural heritage the linked open data way: why you should sign up
Sharing cultural heritage the linked open data way: why you should sign up Sharing cultural heritage the linked open data way: why you should sign up
Sharing cultural heritage the linked open data way: why you should sign up Johan Oomen
 
Developing the PARTHENOS eHumanities and eHeritage Webinar Series
Developing the PARTHENOS eHumanities and eHeritage Webinar SeriesDeveloping the PARTHENOS eHumanities and eHeritage Webinar Series
Developing the PARTHENOS eHumanities and eHeritage Webinar SeriesParthenos
 
R&D at Sound and Vision
R&D at Sound and VisionR&D at Sound and Vision
R&D at Sound and VisionBouke Huurnink
 
Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana
 
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...The European Library
 
Crowdsourcing Descriptions for Nature Recordings
Crowdsourcing Descriptions for Nature RecordingsCrowdsourcing Descriptions for Nature Recordings
Crowdsourcing Descriptions for Nature Recordingsmaartenbrinkerink
 
Chaos&Order: Using visualization as a means to
 explore large heritage collec...
Chaos&Order: Using visualization as a means to
 explore large heritage collec...Chaos&Order: Using visualization as a means to
 explore large heritage collec...
Chaos&Order: Using visualization as a means to
 explore large heritage collec...TimelessFuture
 
What's the Point Of Digitisation: Measuring Use and Impact
What's the Point Of Digitisation: Measuring Use and ImpactWhat's the Point Of Digitisation: Measuring Use and Impact
What's the Point Of Digitisation: Measuring Use and ImpactAlastair Dunning
 
Introducing parthenos powerpoint presentation december 2015 updated
Introducing parthenos powerpoint presentation december 2015 updatedIntroducing parthenos powerpoint presentation december 2015 updated
Introducing parthenos powerpoint presentation december 2015 updatedParthenos
 
Research as infrastructure, Digital Humanities Congress, Sheffield 2012
Research as infrastructure, Digital Humanities Congress, Sheffield 2012Research as infrastructure, Digital Humanities Congress, Sheffield 2012
Research as infrastructure, Digital Humanities Congress, Sheffield 2012University of South Australlia
 
PhDO May 20 2011
PhDO May 20 2011PhDO May 20 2011
PhDO May 20 2011Johan Oomen
 
LinkedUp - European Data Forum
LinkedUp - European Data ForumLinkedUp - European Data Forum
LinkedUp - European Data ForumMarieke Guy
 
Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...
Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...
Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...The Research Council of Norway, IKTPLUSS
 
Building Research Environments Online
Building Research Environments OnlineBuilding Research Environments Online
Building Research Environments OnlineDeb Verhoeven
 

Semelhante a Audiovisual archives and digital humanities (20)

Research and Development at Sound and Vision
Research and Development at Sound and Vision Research and Development at Sound and Vision
Research and Development at Sound and Vision
 
Digital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework ProgrammeDigital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework Programme
 
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
 
Audiovisual collections, the spoken word and user needs of scholars in the Hu...
Audiovisual collections, the spoken word and user needs of scholars in the Hu...Audiovisual collections, the spoken word and user needs of scholars in the Hu...
Audiovisual collections, the spoken word and user needs of scholars in the Hu...
 
What is an archaeological research infrastructure and why do we need it? Aims...
What is an archaeological research infrastructure and why do we need it? Aims...What is an archaeological research infrastructure and why do we need it? Aims...
What is an archaeological research infrastructure and why do we need it? Aims...
 
Sharing cultural heritage the linked open data way: why you should sign up
Sharing cultural heritage the linked open data way: why you should sign up Sharing cultural heritage the linked open data way: why you should sign up
Sharing cultural heritage the linked open data way: why you should sign up
 
Developing the PARTHENOS eHumanities and eHeritage Webinar Series
Developing the PARTHENOS eHumanities and eHeritage Webinar SeriesDeveloping the PARTHENOS eHumanities and eHeritage Webinar Series
Developing the PARTHENOS eHumanities and eHeritage Webinar Series
 
R&D at Sound and Vision
R&D at Sound and VisionR&D at Sound and Vision
R&D at Sound and Vision
 
Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017
 
Kick-off meeting Linkflows project
Kick-off meeting Linkflows projectKick-off meeting Linkflows project
Kick-off meeting Linkflows project
 
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
 
Crowdsourcing Descriptions for Nature Recordings
Crowdsourcing Descriptions for Nature RecordingsCrowdsourcing Descriptions for Nature Recordings
Crowdsourcing Descriptions for Nature Recordings
 
Chaos&Order: Using visualization as a means to
 explore large heritage collec...
Chaos&Order: Using visualization as a means to
 explore large heritage collec...Chaos&Order: Using visualization as a means to
 explore large heritage collec...
Chaos&Order: Using visualization as a means to
 explore large heritage collec...
 
What's the Point Of Digitisation: Measuring Use and Impact
What's the Point Of Digitisation: Measuring Use and ImpactWhat's the Point Of Digitisation: Measuring Use and Impact
What's the Point Of Digitisation: Measuring Use and Impact
 
Introducing parthenos powerpoint presentation december 2015 updated
Introducing parthenos powerpoint presentation december 2015 updatedIntroducing parthenos powerpoint presentation december 2015 updated
Introducing parthenos powerpoint presentation december 2015 updated
 
Research as infrastructure, Digital Humanities Congress, Sheffield 2012
Research as infrastructure, Digital Humanities Congress, Sheffield 2012Research as infrastructure, Digital Humanities Congress, Sheffield 2012
Research as infrastructure, Digital Humanities Congress, Sheffield 2012
 
PhDO May 20 2011
PhDO May 20 2011PhDO May 20 2011
PhDO May 20 2011
 
LinkedUp - European Data Forum
LinkedUp - European Data ForumLinkedUp - European Data Forum
LinkedUp - European Data Forum
 
Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...
Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...
Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...
 
Building Research Environments Online
Building Research Environments OnlineBuilding Research Environments Online
Building Research Environments Online
 

Mais de Johan Oomen

RE:VIVE pitch at the Time Machine conference
RE:VIVE pitch at the Time Machine conferenceRE:VIVE pitch at the Time Machine conference
RE:VIVE pitch at the Time Machine conferenceJohan Oomen
 
Towards Horizon Europe - Europeana Research and Innovation Agenda
Towards Horizon Europe - Europeana Research and Innovation AgendaTowards Horizon Europe - Europeana Research and Innovation Agenda
Towards Horizon Europe - Europeana Research and Innovation AgendaJohan Oomen
 
Open, Smart and Connected access to Audiovisual Collections
Open, Smart and Connected access to Audiovisual CollectionsOpen, Smart and Connected access to Audiovisual Collections
Open, Smart and Connected access to Audiovisual CollectionsJohan Oomen
 
New approaches towards accessing digital audiovisual heritage What will EUscr...
New approaches towards accessing digital audiovisual heritage What will EUscr...New approaches towards accessing digital audiovisual heritage What will EUscr...
New approaches towards accessing digital audiovisual heritage What will EUscr...Johan Oomen
 
SEAPAVAA 2018 Closing panel
SEAPAVAA 2018 Closing panelSEAPAVAA 2018 Closing panel
SEAPAVAA 2018 Closing panelJohan Oomen
 
DIVE+: Explorative Search for Digital Humanities
DIVE+: Explorative Search for Digital HumanitiesDIVE+: Explorative Search for Digital Humanities
DIVE+: Explorative Search for Digital HumanitiesJohan Oomen
 
Preserving Interactive Media - SXSW 2017
Preserving Interactive Media - SXSW 2017Preserving Interactive Media - SXSW 2017
Preserving Interactive Media - SXSW 2017Johan Oomen
 
Over de impact van open en genetwerkt erfgoed
Over de impact van open en genetwerkt erfgoedOver de impact van open en genetwerkt erfgoed
Over de impact van open en genetwerkt erfgoedJohan Oomen
 
CLARIAH kick-off 13 March 2015
CLARIAH kick-off 13 March 2015CLARIAH kick-off 13 March 2015
CLARIAH kick-off 13 March 2015Johan Oomen
 
LinkedTV Europeana tech 2015 ignite talk
LinkedTV Europeana tech 2015 ignite talkLinkedTV Europeana tech 2015 ignite talk
LinkedTV Europeana tech 2015 ignite talkJohan Oomen
 
Kwartaalbijeenkomst december 2015
Kwartaalbijeenkomst december 2015Kwartaalbijeenkomst december 2015
Kwartaalbijeenkomst december 2015Johan Oomen
 
Towards more smart, connected and open audiovisual archives
Towards more smart, connected and open audiovisual archivesTowards more smart, connected and open audiovisual archives
Towards more smart, connected and open audiovisual archivesJohan Oomen
 
Pilod 2014 welkom
Pilod 2014 welkomPilod 2014 welkom
Pilod 2014 welkomJohan Oomen
 
Op weg naar een Nederlandse Erfgoedthesaurus met Linked Open Data
Op weg naar een Nederlandse Erfgoedthesaurus met Linked Open DataOp weg naar een Nederlandse Erfgoedthesaurus met Linked Open Data
Op weg naar een Nederlandse Erfgoedthesaurus met Linked Open DataJohan Oomen
 
The many unexptected joys if being "out there": examples of user participatio...
The many unexptected joys if being "out there": examples of user participatio...The many unexptected joys if being "out there": examples of user participatio...
The many unexptected joys if being "out there": examples of user participatio...Johan Oomen
 
Europeana Awareness year 2 review slides for Workpackage 2 'End-user engagement'
Europeana Awareness year 2 review slides for Workpackage 2 'End-user engagement'Europeana Awareness year 2 review slides for Workpackage 2 'End-user engagement'
Europeana Awareness year 2 review slides for Workpackage 2 'End-user engagement'Johan Oomen
 
Europeana Sounds kick-off - Workpackage 2 Enrichment and Participation
Europeana Sounds kick-off - Workpackage 2 Enrichment and ParticipationEuropeana Sounds kick-off - Workpackage 2 Enrichment and Participation
Europeana Sounds kick-off - Workpackage 2 Enrichment and ParticipationJohan Oomen
 

Mais de Johan Oomen (20)

RE:VIVE pitch at the Time Machine conference
RE:VIVE pitch at the Time Machine conferenceRE:VIVE pitch at the Time Machine conference
RE:VIVE pitch at the Time Machine conference
 
Towards Horizon Europe - Europeana Research and Innovation Agenda
Towards Horizon Europe - Europeana Research and Innovation AgendaTowards Horizon Europe - Europeana Research and Innovation Agenda
Towards Horizon Europe - Europeana Research and Innovation Agenda
 
DMI slides
DMI slidesDMI slides
DMI slides
 
Open, Smart and Connected access to Audiovisual Collections
Open, Smart and Connected access to Audiovisual CollectionsOpen, Smart and Connected access to Audiovisual Collections
Open, Smart and Connected access to Audiovisual Collections
 
MediaDNA


MediaDNA

MediaDNA


MediaDNA


 
New approaches towards accessing digital audiovisual heritage What will EUscr...
New approaches towards accessing digital audiovisual heritage What will EUscr...New approaches towards accessing digital audiovisual heritage What will EUscr...
New approaches towards accessing digital audiovisual heritage What will EUscr...
 
SEAPAVAA 2018 Closing panel
SEAPAVAA 2018 Closing panelSEAPAVAA 2018 Closing panel
SEAPAVAA 2018 Closing panel
 
DIVE+: Explorative Search for Digital Humanities
DIVE+: Explorative Search for Digital HumanitiesDIVE+: Explorative Search for Digital Humanities
DIVE+: Explorative Search for Digital Humanities
 
Preserving Interactive Media - SXSW 2017
Preserving Interactive Media - SXSW 2017Preserving Interactive Media - SXSW 2017
Preserving Interactive Media - SXSW 2017
 
Over de impact van open en genetwerkt erfgoed
Over de impact van open en genetwerkt erfgoedOver de impact van open en genetwerkt erfgoed
Over de impact van open en genetwerkt erfgoed
 
FIAT-IFTA panel
FIAT-IFTA panelFIAT-IFTA panel
FIAT-IFTA panel
 
CLARIAH kick-off 13 March 2015
CLARIAH kick-off 13 March 2015CLARIAH kick-off 13 March 2015
CLARIAH kick-off 13 March 2015
 
LinkedTV Europeana tech 2015 ignite talk
LinkedTV Europeana tech 2015 ignite talkLinkedTV Europeana tech 2015 ignite talk
LinkedTV Europeana tech 2015 ignite talk
 
Kwartaalbijeenkomst december 2015
Kwartaalbijeenkomst december 2015Kwartaalbijeenkomst december 2015
Kwartaalbijeenkomst december 2015
 
Towards more smart, connected and open audiovisual archives
Towards more smart, connected and open audiovisual archivesTowards more smart, connected and open audiovisual archives
Towards more smart, connected and open audiovisual archives
 
Pilod 2014 welkom
Pilod 2014 welkomPilod 2014 welkom
Pilod 2014 welkom
 
Op weg naar een Nederlandse Erfgoedthesaurus met Linked Open Data
Op weg naar een Nederlandse Erfgoedthesaurus met Linked Open DataOp weg naar een Nederlandse Erfgoedthesaurus met Linked Open Data
Op weg naar een Nederlandse Erfgoedthesaurus met Linked Open Data
 
The many unexptected joys if being "out there": examples of user participatio...
The many unexptected joys if being "out there": examples of user participatio...The many unexptected joys if being "out there": examples of user participatio...
The many unexptected joys if being "out there": examples of user participatio...
 
Europeana Awareness year 2 review slides for Workpackage 2 'End-user engagement'
Europeana Awareness year 2 review slides for Workpackage 2 'End-user engagement'Europeana Awareness year 2 review slides for Workpackage 2 'End-user engagement'
Europeana Awareness year 2 review slides for Workpackage 2 'End-user engagement'
 
Europeana Sounds kick-off - Workpackage 2 Enrichment and Participation
Europeana Sounds kick-off - Workpackage 2 Enrichment and ParticipationEuropeana Sounds kick-off - Workpackage 2 Enrichment and Participation
Europeana Sounds kick-off - Workpackage 2 Enrichment and Participation
 

Último

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 

Último (20)

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 

Audiovisual archives and digital humanities

  • 1. Audiovisual archives and digital humanities Netherlands Institute for Sound and Vision Johan Oomen Head of R&D (+ researcher VU University) Roeland Ordelman Policy advisor audiovisual access (+ researcher University of Twente) Erwin Verbruggen Project manager EUscreen http://www.walkerart.org/calendar/2009/benches-binoculars contact: joomen@beeldengeluid.nl 8 February 2013 * #ousa2013
  • 4. Agenda Johan Oomen – Open archives for Digital Humanities Roeland Ordelman - Speech search and Digital Humanities Erwin Verbruggen - EUscreen and DH *
  • 6. Images for the Future http://imagesforthefuture.com/en/news/images- future-90-seconds @johanoomen *
  • 7. It would take over 6 million years to watch the amount of video that will cross global IP networks each month in 2016. Every second, 1.2 million minutes of video content will cross the network in 2016. goal: ...be the best provider of your content http://www.cisco.com/en/US/solutions/collateral/ns341/ns525/ns537/ns705/ns827 white_paper_c11-481360_ns827_Networking_Solutions_White_Paper.htm
  • 9. Explorative search Bron M., van Gorp J., Nack F., de Rijke M., van Gorp J., de Leeuw S., "A Subjunctive Exploratory Search Interface to Support Media Studies Researchers", SIGIR '12: 35th international ACM SIGIR conference on Research and development in information retrieval,, Portland, Oregon, ACM, pp. 425-434 , August, 2012.
  • 13. Vocabularies Over 20 million records and growing.
  • 14. Archives and DH 1.  Digitisation as driver for change •  Towards a cultural commonwealth •  Archives as a bridge to CS and DH 2.  Mutual benefit •  digging into data ó adding meaning 3.  From pilots to sustainable solutions •  Standards (W3C) •  In-house production system •  Shared infrastructures (i.e. CLARIAH.eu) *
  • 15. Audiovisual collections, the spoken word and user needs of scholars in the Humanities Observations based on related work in The Netherlands 2005-2012 Roeland Ordelman @roelandordelman
  • 16. E-Research E-research • New and/or rapid ways to gain knowledge • Digital resources and information technology • Big data & data mining (social sciences) • Digital Humanities / E-Humanities • Digitization, Infra, Tools, Standards • CLARIN.eu / DARIAH.eu
  • 17. Emerging focus audiovisual Emerging focus on on audiovisual • Multi-modal, multi-semiotic: • multiple layers of meaning / interpretation • E.g., “quote + intonation + images + discourse” • New dimensions for scholarly research • Large investments in digitization: • Images for the Future: 200k hours of film, video and audio • Various digitization projects for scientific collections
  • 19. Metadata & Annotations Metadata & annotations • Annotations: • General (document level) • Specific (segment level) • Metadata: typically sparse / document level • Requirements dependent on research field • Annotation generation: • Manual (Individual, Teams, Crowd) • Automatic: (un/lightly) supervised
  • 20. Monitoring radio transcripts INGEST SUPERVISION // ARCHIVIST SUPPORT: Quickly assess quality of ASR
  • 21. Spoken word search 2005-2012 • Wide range of projects in various domains • Radio • Daily ingest: selection of programs • Woord.nl: public access to radio content • Historical video collections with sparse data • ``Oral History’’ • Development of an ASR service for cultural heritage institutions
  • 22. 1st experiment on ASR for humanities: access to personal recordings of Dutch novelist WF Hermans
  • 23. Access to interview collection with camp survivors World War II
  • 24. Access to interview collections FEMINIST MOVEMENT
  • 25. Alignment of transcripts for indexing INTERVIEWS ON BOMBARDEMENT OF ROTTERDAM
  • 26. Access to Radio interviews Experiments with various types of access and result presentation: speaker changes, speaking rate, search strategies, word clouds
  • 28. ACCESS TO DISTRIBUTED ORAL HISTORY COLLECTIONS •  Infrastructure for searching collections at various institutes in The Netherlands •  Harvesting of Metadata (OAI-PMH) •  ASR as a service •  Evaluated with Oral Historians
  • 29. Observations on speech search • Large variation in ASR performance • Performance (and decisions on use) should be assessed in context of application: audiovisual search • Usefulness in audiovisual search should be assessed in context of use scenarios • Use scenarios require specific presentation/visualization requests
  • 30. Usefulness of results •  Perception of usefulness •  Usefulness in context of search/data exploration •  Educate / Expectation management •  Guide searching •  Show why (errors, confidence, trust-levels, cut-offs) •  Focus on research needs •  Improve on ASR quality •  Educate: how to record an interview (Oral History) •  Use available textual resources (alignment, vocab optimization) •  Improve on search application •  Visualization •  Result presentation •  documents versus segments •  combination of information sources •  cross/within-collection linking
  • 31. Methodology Methodology (1) (1) •  E-research is an intervention in current practices! •  Promise: •  increased efficiency, relevance, novelty •  Interest of scholars: • tools that facilitate or simplify existing practice (RIN report, 2011) •  Co-development ICT-researchers & scholars to adjust expectations. Examples: • Finding more in less time may not be a goal in itself for humanities researchers • Deep engagement with primary texts versus results on the segment level
  • 32. Methodology (2) •  4 stages: 1.  Preliminary archival search •  Browsing as a general interest •  Purpose driven (checking details, complementary resources) •  Item-oriented (finding first mentioning of something) •  Collection-oriented (thematic, source, person, event) 2.  Content analysis •  Visualization, compression, aggregation •  (optionally) go back to (1) 3.  Presentation and dissemination •  Enhanced publications (persistent identifiers on segment level) 4.  Curation •  Trusted digital repository •  (spoken) search scenarios: facilitate these stages
  • 33. ASR for ASR for research research • Triple-A: Accessible, Affordable, Accurate • Individual researchers sending files to ASR? • Embedded in suite of research tools? • What about integration in search applications? • Stagnation due to inadequate local infrastructures • Variation across collections requires ‘tailor- made’ approaches: e.g., speaker adaptation, vocabulary adaptation, alignment, collection of related resources (information trail)
  • 34. ASR ASR service service Upload: via http, ftp, api Model of use: •  Free test bundle (10h) •  Various small/medium/large bundles •  Reduced costs (only hardware and maintenance) •  Management by CH body •  Maintenance by industry partner
  • 35. Dutch Queen Wilhelmina addressing the Dutch people from London during WWII
  • 36. Exploring Europe’s Television Heritage in Changing Contexts Erwin Verbruggen, R&D @erwinverb
  • 38. Metadata mint.image.ece.ntua.gr/ Based on EBUcore Mapped to the Europeana Data Model MAPPING TOOL ANNOTATION TOOL Massive uploads Item and Group Level Annotation Schema Mapping Service Connection with Quality Control EUscreen Thesauri Europeana Preview Services Search and Browsing Services
  • 41. Collaborative design sessions Virtual Exhibition Tool
  • 42. Open access publishing with AV sources WWW.VIEWJOURNAL.EU
  • 43. Linked Open Data Pilot LOD.EUSCREEN.EU
  • 45. www.euscreen.eu facebook.com/euscreen twitter.com/euscreen 2/8/13