SlideShare uma empresa Scribd logo
1 de 49
Baixar para ler offline
Media Suite: Unlocking Archives for Mixed
Media Scholarly Research
Roeland Ordelman - Technical coordinator CLARIAH Media Suite
Netherlands Institute for Sound and Vision / University of Twente
The Netherlands
Media Studies
Focus on both “institutional”
data collections and collections
created by scholars
Welke data zitten in de Media Suite V3?
Radio & Television (1.88M items) Newspapers (60M pages)
Film (1129 films) Oral History (2744 interviews)
MULTIMEDIA
Welke data zitten in de Media Suite V3?
MIXEDMEDIA
RESEARCH PILOTS
Cross-Medial Analysis of WW2
Eyewitness Testimonies
Cross-media research of public debates
on drugs and regulation
Me and Myself: Tracing first person in
documentary history in AV-collections
Annotating EYE’s Jean Desmet Collection:
Towards Mixed Media Analysis in Digital Media History
Narrativizing Disruption: How exploratory search can support
media researchers to interpret ‘disruptive’ media events as lucid narratives
Remediation in Sports News
clariah.nl/projecten/research-pilots
Media Suite: enabling Mixed Media Scholarly Research
with Multi-media Data in a Sustainable Infrastructure
CLARIAH Centers
Common Lab Research Infrastructure for the Arts and Humanities
SUSTAINABLE
üAvailable after the project
üMaintenance and support
üUpdates and upgrades
Architecture principles
1. Centers are responsible for data quality and to facilitate
access to data
2. Authorized access using a federated authentication
mechanism
3. Data is connected to a shared “workspace” (VRE) for
various forms of analysis …
4. … that provides exports of data in various formats for
using tools outside the closed environment
5. The Media Suite provides the interface on the underlying
architecture
1. Centers facilitate are responsible for data
quality and access to data
REGISTER COLLECTION
HARVEST COLLECTION METADATA
SEARCH COLLECTION
Collection Owner
Media Suite
Scholar
CKAN web-based
open source management
system for the storage and
distribution of open data
Open Archive Initiative (OAI)
ISSUE:
Persistent link to
source file
ISSUE:
IPR (e.g., no
subtitles)
Example: DANS registers set Oral History
Common Lab Research Infrastructure for the Humanities
“METADATA ARCHEOLOGY”
Manual effort to describe metadata fields
ISSUE:
Resources
manual effort
Tools for inspection of metadata
Common Lab Research Infrastructure for the Humanities
2. Authorized access using a federated
authentication mechanism
Secure play-out and viewing
ISSUE:
Not always
available
Federated login
3. Data is connected to a shared
“workspace” (VRE) for analysis
ISSUE:
Currently semi-
shared
WORKSPACE
ü Create virtual personal
mixed media collections
ü Create projects
ü Stores annotations
ü Upload personal collections
ü Advanced Data Analysis
(Jupyter Notebooks)
ü Advanced Data processing
ü Export annotations
Data analysis: Jupyter Notebooks or NLP
Common Lab Research Infrastructure for the Humanities
ISSUE:
Robust pipelines
Write your own (Python)
code to analyze the data
in the Media Suite
ISSUE:
expertise
Example
output
Jupyter
Notebook
Auto Metadata Extraction –
Large scale speech recognition
350K hours processed
until now
Poster slam 11:00 – 11:30 tomorrow
4. Provide exports of data for tools outside
Media Suite is just an
interface on the
underlying
infrastructure….
Speech Suite
Media Suite: Unlocking Archives for Mixed Media
Scholarly Research
Co-development
Community
building
User stories!
Short iterations
(sprints) of 2 weeks:
development &
testing
• Information Specialist
• Experienced DH Researcher
Liaisons part of
development team:
Workshops, hack-a-
thons, data-a-thons
Discussing issues with Gitter
Tracking issues with Github
SCHOLARLY PRIMITIVES
Unsworth, 2000
Blanke and Hedges, 2013
“Unlock data”
Distant reading
Close reading
1. Discovery & Inspection of data sets hidden in archives
2. Discovery of items in large archival data sets
3. Accessing items (play, view) from restricted data sets
4. Discovery of segments in time-based media
5. Relating and comparing data on the segment level
DistantreadingClosereading
Search Oral History in Media Suite
Common Lab Research Infrastructure for the Humanities
Project
Search
Bookmark
Save
Bookmark
Save
Query
Bookmark view View Source
Annotation view View SourceAlignment
ISSUE:
Complex
interface
Private collection Apply enrichment or a “pipeline”
To appear:
Content-based Cross-media
Recommendations
1. Registered collections: persistent link (data management)
2. Registered collections: rights don’t permit (legal)
3. Metadata archeology: manual resources (funding)
4. Play-out/view: not always available (funding)
5. Shared workspace: semi-shared (infra development)
6. Advanced analysis: expertise scholars (training)
7. Advanced analysis: robust pipelines (benchmarking)
8. Workspace: complex interface (interaction design)
Issues/investments
Main contribution: enabling mixed media scholarly
research for “institutional” multimedia collections
Bringing the Tools to the Data: in progress but already
useful:
ü Unlocking the data, enabling distant/close reading
ü Supporting the scholarly primitives
ü Providing a workspace for saving annotations, creating
collections and options for (advanced) analysis
Summary…
Research coordination: Julia Noordegraaf @jjnoordegraaf
Technical coordination: Roeland Ordelman @roelandordelman
DEMO & QUESTIONS AT THE BAZAR
mediasuite.clariah.nl

Mais conteúdo relacionado

Mais procurados

The Chemist's Toolkit 10 9 09
The Chemist's Toolkit 10 9 09The Chemist's Toolkit 10 9 09
The Chemist's Toolkit 10 9 09
Elizabeth Brown
 
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
Paolo Ciccarese
 

Mais procurados (20)

Ariadne: Data Sharing
Ariadne: Data SharingAriadne: Data Sharing
Ariadne: Data Sharing
 
3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2...
3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2...3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2...
3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2...
 
Reading avoidance
Reading avoidanceReading avoidance
Reading avoidance
 
Ird3 2 lib
Ird3 2 libIrd3 2 lib
Ird3 2 lib
 
Sharing re-usable phylogenetic data: we're not there yet
Sharing re-usable phylogenetic data: we're not there yetSharing re-usable phylogenetic data: we're not there yet
Sharing re-usable phylogenetic data: we're not there yet
 
(Live) Annotopia Overview by Paolo Ciccarese (Architect and principal developer)
(Live) Annotopia Overview by Paolo Ciccarese (Architect and principal developer)(Live) Annotopia Overview by Paolo Ciccarese (Architect and principal developer)
(Live) Annotopia Overview by Paolo Ciccarese (Architect and principal developer)
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
 
OpenAIRE at e-infrastructures DC-NET Brussels, October 2010
OpenAIRE at e-infrastructures DC-NET Brussels, October 2010OpenAIRE at e-infrastructures DC-NET Brussels, October 2010
OpenAIRE at e-infrastructures DC-NET Brussels, October 2010
 
The State of Open Research Data
The State of Open Research DataThe State of Open Research Data
The State of Open Research Data
 
Museum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on themMuseum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on them
 
Open Knowledge and the Benefits for University-based Research
Open Knowledge and the Benefits for University-based ResearchOpen Knowledge and the Benefits for University-based Research
Open Knowledge and the Benefits for University-based Research
 
The Chemist's Toolkit 10 9 09
The Chemist's Toolkit 10 9 09The Chemist's Toolkit 10 9 09
The Chemist's Toolkit 10 9 09
 
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
 
Open data licensing : Trojan horse or sunken treasure? Authors: Caleb Derven,...
Open data licensing : Trojan horse or sunken treasure? Authors: Caleb Derven,...Open data licensing : Trojan horse or sunken treasure? Authors: Caleb Derven,...
Open data licensing : Trojan horse or sunken treasure? Authors: Caleb Derven,...
 
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
 
Digital Library
Digital LibraryDigital Library
Digital Library
 
Research Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staffResearch Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staff
 
Ssp Collexis Overview 2009
Ssp Collexis   Overview 2009Ssp Collexis   Overview 2009
Ssp Collexis Overview 2009
 
Annotopia: Open Annotation Server
Annotopia: Open Annotation ServerAnnotopia: Open Annotation Server
Annotopia: Open Annotation Server
 
Open Notebook Science
Open Notebook ScienceOpen Notebook Science
Open Notebook Science
 

Semelhante a Media Suite: Unlocking Archives for Mixed Media Scholarly Research

Strategic overview, Alastair Dunning, Programme Manager at The European Library
Strategic overview, Alastair Dunning, Programme Manager at The European LibraryStrategic overview, Alastair Dunning, Programme Manager at The European Library
Strategic overview, Alastair Dunning, Programme Manager at The European Library
The European Library
 
Alastair Dunning, Open data at The European library, TEL
Alastair Dunning, Open data at The European library, TELAlastair Dunning, Open data at The European library, TEL
Alastair Dunning, Open data at The European library, TEL
The European Library
 

Semelhante a Media Suite: Unlocking Archives for Mixed Media Scholarly Research (20)

Data Publishing in Archaeozoology
Data Publishing in ArchaeozoologyData Publishing in Archaeozoology
Data Publishing in Archaeozoology
 
An Open Context for Archaeology
An Open Context for ArchaeologyAn Open Context for Archaeology
An Open Context for Archaeology
 
Strategic overview, Alastair Dunning, Programme Manager at The European Library
Strategic overview, Alastair Dunning, Programme Manager at The European LibraryStrategic overview, Alastair Dunning, Programme Manager at The European Library
Strategic overview, Alastair Dunning, Programme Manager at The European Library
 
Alastair Dunning, Open data at The European library, TEL
Alastair Dunning, Open data at The European library, TELAlastair Dunning, Open data at The European library, TEL
Alastair Dunning, Open data at The European library, TEL
 
Open Data from the European Library
Open Data from the European LibraryOpen Data from the European Library
Open Data from the European Library
 
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
 
Open Archives & Open Access
Open Archives & Open AccessOpen Archives & Open Access
Open Archives & Open Access
 
Open Science
Open ScienceOpen Science
Open Science
 
T-Space
T-SpaceT-Space
T-Space
 
Open sciencerefresher2019
Open sciencerefresher2019Open sciencerefresher2019
Open sciencerefresher2019
 
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
 
Scholarship in a connected world: New ways to know, new ways to show
Scholarship in a connected world: New ways to know, new ways to showScholarship in a connected world: New ways to know, new ways to show
Scholarship in a connected world: New ways to know, new ways to show
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
 
Presentation - First International Library Staff Exchange Week, Zagreb
Presentation - First International Library Staff Exchange Week, ZagrebPresentation - First International Library Staff Exchange Week, Zagreb
Presentation - First International Library Staff Exchange Week, Zagreb
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital Libraries
 
Using technologies to promote projects
Using technologies to promote projectsUsing technologies to promote projects
Using technologies to promote projects
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 
Linked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media ArchivesLinked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media Archives
 
Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?
 
British Library Datasets Programme 2010
British Library Datasets Programme 2010British Library Datasets Programme 2010
British Library Datasets Programme 2010
 

Mais de roelandordelman.nl

Linking inside a video collection
Linking inside a video collectionLinking inside a video collection
Linking inside a video collection
roelandordelman.nl
 
20130212 immovator cross media cafe - linkedtv en axes
20130212 immovator cross media cafe - linkedtv en axes20130212 immovator cross media cafe - linkedtv en axes
20130212 immovator cross media cafe - linkedtv en axes
roelandordelman.nl
 
Audiovisual Content Exploitation at FIA 15042010 NISV
Audiovisual Content Exploitation at FIA 15042010 NISVAudiovisual Content Exploitation at FIA 15042010 NISV
Audiovisual Content Exploitation at FIA 15042010 NISV
roelandordelman.nl
 

Mais de roelandordelman.nl (13)

Video Hyperlinking
Video HyperlinkingVideo Hyperlinking
Video Hyperlinking
 
Accessing Large AV Collections using Visual Analysis in Digital Humanities
Accessing Large AV Collections using Visual Analysis in Digital HumanitiesAccessing Large AV Collections using Visual Analysis in Digital Humanities
Accessing Large AV Collections using Visual Analysis in Digital Humanities
 
Audiovisual collections, the spoken word and user needs of scholars in the Hu...
Audiovisual collections, the spoken word and user needs of scholars in the Hu...Audiovisual collections, the spoken word and user needs of scholars in the Hu...
Audiovisual collections, the spoken word and user needs of scholars in the Hu...
 
Oral History Today: project eindpresentatie
Oral History Today: project eindpresentatieOral History Today: project eindpresentatie
Oral History Today: project eindpresentatie
 
User Requirements in Audiovisual Search: a Quantitative Approach
User Requirements in Audiovisual Search: a Quantitative ApproachUser Requirements in Audiovisual Search: a Quantitative Approach
User Requirements in Audiovisual Search: a Quantitative Approach
 
Intetain presentation on VideoHypE, the LinkedTV video hyperlink editor
Intetain presentation on VideoHypE, the LinkedTV video hyperlink editorIntetain presentation on VideoHypE, the LinkedTV video hyperlink editor
Intetain presentation on VideoHypE, the LinkedTV video hyperlink editor
 
Presentation on MediaEval Search & Linking task 2013
Presentation on MediaEval Search & Linking task 2013Presentation on MediaEval Search & Linking task 2013
Presentation on MediaEval Search & Linking task 2013
 
Linking inside a video collection
Linking inside a video collectionLinking inside a video collection
Linking inside a video collection
 
Clariah kick-off-oht final
Clariah kick-off-oht finalClariah kick-off-oht final
Clariah kick-off-oht final
 
20130212 immovator cross media cafe - linkedtv en axes
20130212 immovator cross media cafe - linkedtv en axes20130212 immovator cross media cafe - linkedtv en axes
20130212 immovator cross media cafe - linkedtv en axes
 
Presentatie Mediapark Jaarcongres 2010
Presentatie Mediapark Jaarcongres 2010Presentatie Mediapark Jaarcongres 2010
Presentatie Mediapark Jaarcongres 2010
 
Audiovisual content exploitation JTS2010
Audiovisual content exploitation  JTS2010 Audiovisual content exploitation  JTS2010
Audiovisual content exploitation JTS2010
 
Audiovisual Content Exploitation at FIA 15042010 NISV
Audiovisual Content Exploitation at FIA 15042010 NISVAudiovisual Content Exploitation at FIA 15042010 NISV
Audiovisual Content Exploitation at FIA 15042010 NISV
 

Último

一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
wsppdmt
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
wsppdmt
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
vexqp
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 

Último (20)

一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptx
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATIONCapstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
 

Media Suite: Unlocking Archives for Mixed Media Scholarly Research

  • 1. Media Suite: Unlocking Archives for Mixed Media Scholarly Research Roeland Ordelman - Technical coordinator CLARIAH Media Suite Netherlands Institute for Sound and Vision / University of Twente The Netherlands
  • 2. Media Studies Focus on both “institutional” data collections and collections created by scholars
  • 3. Welke data zitten in de Media Suite V3? Radio & Television (1.88M items) Newspapers (60M pages) Film (1129 films) Oral History (2744 interviews) MULTIMEDIA
  • 4. Welke data zitten in de Media Suite V3? MIXEDMEDIA
  • 5. RESEARCH PILOTS Cross-Medial Analysis of WW2 Eyewitness Testimonies Cross-media research of public debates on drugs and regulation Me and Myself: Tracing first person in documentary history in AV-collections Annotating EYE’s Jean Desmet Collection: Towards Mixed Media Analysis in Digital Media History Narrativizing Disruption: How exploratory search can support media researchers to interpret ‘disruptive’ media events as lucid narratives Remediation in Sports News clariah.nl/projecten/research-pilots
  • 6. Media Suite: enabling Mixed Media Scholarly Research with Multi-media Data in a Sustainable Infrastructure
  • 7. CLARIAH Centers Common Lab Research Infrastructure for the Arts and Humanities SUSTAINABLE üAvailable after the project üMaintenance and support üUpdates and upgrades
  • 8.
  • 9. Architecture principles 1. Centers are responsible for data quality and to facilitate access to data 2. Authorized access using a federated authentication mechanism 3. Data is connected to a shared “workspace” (VRE) for various forms of analysis … 4. … that provides exports of data in various formats for using tools outside the closed environment 5. The Media Suite provides the interface on the underlying architecture
  • 10. 1. Centers facilitate are responsible for data quality and access to data
  • 11. REGISTER COLLECTION HARVEST COLLECTION METADATA SEARCH COLLECTION Collection Owner Media Suite Scholar CKAN web-based open source management system for the storage and distribution of open data Open Archive Initiative (OAI) ISSUE: Persistent link to source file ISSUE: IPR (e.g., no subtitles)
  • 12. Example: DANS registers set Oral History Common Lab Research Infrastructure for the Humanities
  • 13. “METADATA ARCHEOLOGY” Manual effort to describe metadata fields ISSUE: Resources manual effort
  • 14. Tools for inspection of metadata Common Lab Research Infrastructure for the Humanities
  • 15.
  • 16. 2. Authorized access using a federated authentication mechanism
  • 17.
  • 18. Secure play-out and viewing ISSUE: Not always available
  • 20. 3. Data is connected to a shared “workspace” (VRE) for analysis ISSUE: Currently semi- shared
  • 21. WORKSPACE ü Create virtual personal mixed media collections ü Create projects ü Stores annotations ü Upload personal collections ü Advanced Data Analysis (Jupyter Notebooks) ü Advanced Data processing ü Export annotations
  • 22. Data analysis: Jupyter Notebooks or NLP Common Lab Research Infrastructure for the Humanities ISSUE: Robust pipelines
  • 23. Write your own (Python) code to analyze the data in the Media Suite ISSUE: expertise
  • 25. Auto Metadata Extraction – Large scale speech recognition 350K hours processed until now
  • 26. Poster slam 11:00 – 11:30 tomorrow
  • 27. 4. Provide exports of data for tools outside
  • 28. Media Suite is just an interface on the underlying infrastructure…. Speech Suite
  • 29. Media Suite: Unlocking Archives for Mixed Media Scholarly Research
  • 30. Co-development Community building User stories! Short iterations (sprints) of 2 weeks: development & testing • Information Specialist • Experienced DH Researcher Liaisons part of development team: Workshops, hack-a- thons, data-a-thons
  • 35. 1. Discovery & Inspection of data sets hidden in archives 2. Discovery of items in large archival data sets 3. Accessing items (play, view) from restricted data sets 4. Discovery of segments in time-based media 5. Relating and comparing data on the segment level DistantreadingClosereading
  • 36. Search Oral History in Media Suite Common Lab Research Infrastructure for the Humanities
  • 37.
  • 38.
  • 39.
  • 40.
  • 42.
  • 44. Annotation view View SourceAlignment ISSUE: Complex interface
  • 45. Private collection Apply enrichment or a “pipeline”
  • 47. 1. Registered collections: persistent link (data management) 2. Registered collections: rights don’t permit (legal) 3. Metadata archeology: manual resources (funding) 4. Play-out/view: not always available (funding) 5. Shared workspace: semi-shared (infra development) 6. Advanced analysis: expertise scholars (training) 7. Advanced analysis: robust pipelines (benchmarking) 8. Workspace: complex interface (interaction design) Issues/investments
  • 48. Main contribution: enabling mixed media scholarly research for “institutional” multimedia collections Bringing the Tools to the Data: in progress but already useful: ü Unlocking the data, enabling distant/close reading ü Supporting the scholarly primitives ü Providing a workspace for saving annotations, creating collections and options for (advanced) analysis Summary…
  • 49. Research coordination: Julia Noordegraaf @jjnoordegraaf Technical coordination: Roeland Ordelman @roelandordelman DEMO & QUESTIONS AT THE BAZAR mediasuite.clariah.nl