SlideShare uma empresa Scribd logo
1 de 22
HARD CONTENT, FAB FRONT-END
Archiving websites of the Dutch Public Broadcasters
23-5-2014
Lotte Belice Baltussen | R&D, Netherlands Institute for Sound and Vision
IIPC | 21 May 2014 | BnF, Paris
Nederlands Instituut voor
Beeld en Geluid
Sound and Vision
• 70% of Dutch AV heritage
• > 850,000 hours
• 2M photos
•20,000 objects
• Large paper archives
“The Archive as a Laboratory”
Web archiving since 2008 (LiWA, several pilots) with various objectives
NTR PILOT
(2013-2014)
23-5-2014
WHY:
• Saving websites selected to be taken offline
• Getting insights in user requirements
• Create great front and back-end
• Provide public access
• Shape future plans
WEBSITES
23-5-2014
CRAWLING ISSUES
ACCESS ISSUES
USER REQUIREMENTS, PT. 1
Phase 1: Focus group
USER REQUIREMENTS SUMMARY
• Communication and information
e.g. “As a user, I can suggest a website that should be archived”
• Metadata
e.g. “As a user, I can see the crawl date for each archived URL”
• Searching
e.g. “As a user, I can search full-text through a single archived website”
• Visualisation
e.g. “As a user, I can see side-by-side comparisons of the same URL that was
archived at different moments in time”
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
USER REQUIREMENTS, PT. 2
Phase 2: Usability tests
think-aloud, 60-90 minutes
x 2:
• 37, PostDoc web archive research project
• 58, Multimedia editor at a Dutch public broadcaster
x 3:
• 44, Crawl engineer
• 50, Manager digital projects at a Dutch public broadcaster
• 58, Freelance (archive) researcher & journalist
LESSONS-LEARNED
UI/UX
+ Clean, visual look
- More functionality explanations
COMMUNICATION
+ FAQ contains good info about
web archiving
- Info about status + plans
/ More info about scope and size
of web archive
METADATA
+ Overview of outgoing links
- TMI
/ Creation + last change of
website
SEARCHING
+ Fast!
+ Thumbnail previews
- Search by URL
- More filtering options
- Relevance ranking
VISUALISATION
/ More stats, e.g., % text
- Highlight differences crawls
USERS & USAGE
+ Current groups representative
- No av-streaming big loss for all
/ Add more fine-grained
subgroups
FUTURE WORK WEB ARCHIVES:
CONTEXT COLLECTIONS
“Public broadcaster web archives will help you learn where you come from”
-- Usability test participant
• We need to be more dynamic than the websites we archive
• We can and must achieve public access
• We are moving from pilot to standard practice
• Connect crawls to catalogue
• Increase public broadcaster cooperation
Thanks!
@lottebelice | lbbaltussen@beeldengeluid.nl
@benglabs

Mais conteúdo relacionado

Semelhante a Hard Content, Fab Front-end @ IIPC 2014

Anupriy Kanti - Content Strategy Portfolio_August 2016
Anupriy Kanti - Content Strategy Portfolio_August 2016Anupriy Kanti - Content Strategy Portfolio_August 2016
Anupriy Kanti - Content Strategy Portfolio_August 2016Anupriy Kanti
 
Conducting User Research
Conducting User ResearchConducting User Research
Conducting User ResearchJeremy Horn
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver
 
AGM 2013 Task Force meetings
AGM 2013 Task Force meetingsAGM 2013 Task Force meetings
AGM 2013 Task Force meetingsEuropeana
 
LoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud ServicesLoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud Serviceslocloud
 
2009: British Accessibility Standards - PAS-78 to BS8878
2009: British Accessibility Standards - PAS-78 to BS88782009: British Accessibility Standards - PAS-78 to BS8878
2009: British Accessibility Standards - PAS-78 to BS8878Jonathan Hassell
 
A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...OpenAIRE
 
Whowas: Historical Whois Service
Whowas: Historical Whois ServiceWhowas: Historical Whois Service
Whowas: Historical Whois ServiceAPNIC
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreAndy Powell
 
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...SSHOC
 
Summary of Day 1
Summary of Day 1Summary of Day 1
Summary of Day 1Europeana
 
Create great cncf user base from lessons learned from other open source com...
Create great cncf user base from   lessons learned from other open source com...Create great cncf user base from   lessons learned from other open source com...
Create great cncf user base from lessons learned from other open source com...Krishna-Kumar
 
Web 2.0 in Libraries
Web 2.0 in LibrariesWeb 2.0 in Libraries
Web 2.0 in LibrariesAnupama Saini
 
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...Stefan Buddenbohm
 

Semelhante a Hard Content, Fab Front-end @ IIPC 2014 (20)

Anupriy Kanti - Content Strategy Portfolio_August 2016
Anupriy Kanti - Content Strategy Portfolio_August 2016Anupriy Kanti - Content Strategy Portfolio_August 2016
Anupriy Kanti - Content Strategy Portfolio_August 2016
 
Personal learning environment
Personal learning environmentPersonal learning environment
Personal learning environment
 
AtoM, Authenticity, and the Chain of Custody
AtoM, Authenticity, and the Chain of CustodyAtoM, Authenticity, and the Chain of Custody
AtoM, Authenticity, and the Chain of Custody
 
Conducting User Research
Conducting User ResearchConducting User Research
Conducting User Research
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award Ceremony
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award Ceremony
 
AGM 2013 Task Force meetings
AGM 2013 Task Force meetingsAGM 2013 Task Force meetings
AGM 2013 Task Force meetings
 
LoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud ServicesLoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud Services
 
2009: British Accessibility Standards - PAS-78 to BS8878
2009: British Accessibility Standards - PAS-78 to BS88782009: British Accessibility Standards - PAS-78 to BS8878
2009: British Accessibility Standards - PAS-78 to BS8878
 
A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...
 
Whowas: Historical Whois Service
Whowas: Historical Whois ServiceWhowas: Historical Whois Service
Whowas: Historical Whois Service
 
255 shaw
255 shaw255 shaw
255 shaw
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
 
04_Knutas_DOIT platform as an open educational resource
04_Knutas_DOIT platform as an open educational resource04_Knutas_DOIT platform as an open educational resource
04_Knutas_DOIT platform as an open educational resource
 
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...
 
All WP Meeting Athens - Europeana Inside - Gordon McKenna
All WP Meeting Athens - Europeana Inside - Gordon McKennaAll WP Meeting Athens - Europeana Inside - Gordon McKenna
All WP Meeting Athens - Europeana Inside - Gordon McKenna
 
Summary of Day 1
Summary of Day 1Summary of Day 1
Summary of Day 1
 
Create great cncf user base from lessons learned from other open source com...
Create great cncf user base from   lessons learned from other open source com...Create great cncf user base from   lessons learned from other open source com...
Create great cncf user base from lessons learned from other open source com...
 
Web 2.0 in Libraries
Web 2.0 in LibrariesWeb 2.0 in Libraries
Web 2.0 in Libraries
 
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...
 

Mais de Lotte Belice Baltussen

Digitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid
Digitaal duurzame links - NDE Werelddag van de Digitale DuurzaamheidDigitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid
Digitaal duurzame links - NDE Werelddag van de Digitale DuurzaamheidLotte Belice Baltussen
 
Cultuurmarketing - Digitale Innovatie (19 september 2019)
Cultuurmarketing - Digitale Innovatie (19 september 2019)Cultuurmarketing - Digitale Innovatie (19 september 2019)
Cultuurmarketing - Digitale Innovatie (19 september 2019)Lotte Belice Baltussen
 
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...Lotte Belice Baltussen
 
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...Lotte Belice Baltussen
 
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...Lotte Belice Baltussen
 
DISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur DataDISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur DataLotte Belice Baltussen
 
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game onLotte Belice Baltussen
 
Open cultuur data - Wikimedia Conferentie Nederland 2012
Open cultuur data - Wikimedia Conferentie Nederland 2012Open cultuur data - Wikimedia Conferentie Nederland 2012
Open cultuur data - Wikimedia Conferentie Nederland 2012Lotte Belice Baltussen
 
Open Cultuur Data / Open Beelden - HackersNL #6
Open Cultuur Data / Open Beelden - HackersNL #6Open Cultuur Data / Open Beelden - HackersNL #6
Open Cultuur Data / Open Beelden - HackersNL #6Lotte Belice Baltussen
 
Crowdsourcing metadata for audiovisual collections
Crowdsourcing metadata for audiovisual collectionsCrowdsourcing metadata for audiovisual collections
Crowdsourcing metadata for audiovisual collectionsLotte Belice Baltussen
 
Baltussen - AVA_Net Najaarsconferentie 2011
Baltussen - AVA_Net Najaarsconferentie 2011Baltussen - AVA_Net Najaarsconferentie 2011
Baltussen - AVA_Net Najaarsconferentie 2011Lotte Belice Baltussen
 

Mais de Lotte Belice Baltussen (20)

Digitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid
Digitaal duurzame links - NDE Werelddag van de Digitale DuurzaamheidDigitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid
Digitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid
 
Cultuurmarketing - Digitale Innovatie (19 september 2019)
Cultuurmarketing - Digitale Innovatie (19 september 2019)Cultuurmarketing - Digitale Innovatie (19 september 2019)
Cultuurmarketing - Digitale Innovatie (19 september 2019)
 
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...
 
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...
 
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...
 
Open Cultuur Data België eind-event
Open Cultuur Data België eind-eventOpen Cultuur Data België eind-event
Open Cultuur Data België eind-event
 
DISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur DataDISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur Data
 
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on
 
AVA_net workshop 7 maart 2013
AVA_net workshop 7 maart 2013AVA_net workshop 7 maart 2013
AVA_net workshop 7 maart 2013
 
Open cultuur data - Wikimedia Conferentie Nederland 2012
Open cultuur data - Wikimedia Conferentie Nederland 2012Open cultuur data - Wikimedia Conferentie Nederland 2012
Open cultuur data - Wikimedia Conferentie Nederland 2012
 
Open cultuur data - cop gouda gha
Open cultuur data - cop gouda ghaOpen cultuur data - cop gouda gha
Open cultuur data - cop gouda gha
 
Open Cultuur Data - Eth0:2012 Summer
Open Cultuur Data - Eth0:2012 Summer Open Cultuur Data - Eth0:2012 Summer
Open Cultuur Data - Eth0:2012 Summer
 
Workshop DEN Baas over eigen metadata
Workshop DEN Baas over eigen metadataWorkshop DEN Baas over eigen metadata
Workshop DEN Baas over eigen metadata
 
Open Culture Data - PMOD
Open Culture Data - PMODOpen Culture Data - PMOD
Open Culture Data - PMOD
 
Open Cultuur Data competitie 2012
Open Cultuur Data competitie 2012Open Cultuur Data competitie 2012
Open Cultuur Data competitie 2012
 
Open Cultuur Data - hackathon pitches
Open Cultuur Data - hackathon pitchesOpen Cultuur Data - hackathon pitches
Open Cultuur Data - hackathon pitches
 
Open Cultuur Data - KVAN 2012
Open Cultuur Data - KVAN 2012Open Cultuur Data - KVAN 2012
Open Cultuur Data - KVAN 2012
 
Open Cultuur Data / Open Beelden - HackersNL #6
Open Cultuur Data / Open Beelden - HackersNL #6Open Cultuur Data / Open Beelden - HackersNL #6
Open Cultuur Data / Open Beelden - HackersNL #6
 
Crowdsourcing metadata for audiovisual collections
Crowdsourcing metadata for audiovisual collectionsCrowdsourcing metadata for audiovisual collections
Crowdsourcing metadata for audiovisual collections
 
Baltussen - AVA_Net Najaarsconferentie 2011
Baltussen - AVA_Net Najaarsconferentie 2011Baltussen - AVA_Net Najaarsconferentie 2011
Baltussen - AVA_Net Najaarsconferentie 2011
 

Último

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 

Último (20)

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 

Hard Content, Fab Front-end @ IIPC 2014

  • 1.
  • 2. HARD CONTENT, FAB FRONT-END Archiving websites of the Dutch Public Broadcasters 23-5-2014 Lotte Belice Baltussen | R&D, Netherlands Institute for Sound and Vision IIPC | 21 May 2014 | BnF, Paris
  • 3. Nederlands Instituut voor Beeld en Geluid Sound and Vision • 70% of Dutch AV heritage • > 850,000 hours • 2M photos •20,000 objects • Large paper archives
  • 4.
  • 5. “The Archive as a Laboratory” Web archiving since 2008 (LiWA, several pilots) with various objectives
  • 6. NTR PILOT (2013-2014) 23-5-2014 WHY: • Saving websites selected to be taken offline • Getting insights in user requirements • Create great front and back-end • Provide public access • Shape future plans
  • 10. USER REQUIREMENTS, PT. 1 Phase 1: Focus group
  • 11.
  • 12.
  • 13. USER REQUIREMENTS SUMMARY • Communication and information e.g. “As a user, I can suggest a website that should be archived” • Metadata e.g. “As a user, I can see the crawl date for each archived URL” • Searching e.g. “As a user, I can search full-text through a single archived website” • Visualisation e.g. “As a user, I can see side-by-side comparisons of the same URL that was archived at different moments in time”
  • 19. USER REQUIREMENTS, PT. 2 Phase 2: Usability tests think-aloud, 60-90 minutes x 2: • 37, PostDoc web archive research project • 58, Multimedia editor at a Dutch public broadcaster x 3: • 44, Crawl engineer • 50, Manager digital projects at a Dutch public broadcaster • 58, Freelance (archive) researcher & journalist
  • 20. LESSONS-LEARNED UI/UX + Clean, visual look - More functionality explanations COMMUNICATION + FAQ contains good info about web archiving - Info about status + plans / More info about scope and size of web archive METADATA + Overview of outgoing links - TMI / Creation + last change of website SEARCHING + Fast! + Thumbnail previews - Search by URL - More filtering options - Relevance ranking VISUALISATION / More stats, e.g., % text - Highlight differences crawls USERS & USAGE + Current groups representative - No av-streaming big loss for all / Add more fine-grained subgroups
  • 21. FUTURE WORK WEB ARCHIVES: CONTEXT COLLECTIONS “Public broadcaster web archives will help you learn where you come from” -- Usability test participant • We need to be more dynamic than the websites we archive • We can and must achieve public access • We are moving from pilot to standard practice • Connect crawls to catalogue • Increase public broadcaster cooperation