W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility

•

1 gostou•437 visualizações

This document summarizes a large-scale study of web accessibility that analyzed over 28 million web pages from the Portuguese web. The study found that (1) only 56% of pages passed basic accessibility checkpoints, (2) page complexity and size were correlated with accessibility rates, with smaller pages tending to score better, and (3) warnings around accessibility issues were more prevalent than expected and open to different interpretations. The study highlights ongoing challenges in fully evaluating web accessibility at scale.

Tecnologia

Web Not For All
A Large Scale Study of Web Accessibility

Rui Lopes1, Daniel Gomes2, Luís Carriço1

1 LaSIGE, University of Lisbon
2 FCCN

Context

• The Web is the biggest information source
for Mankind. Decentralised architecture
made it blossom.
• Humans (and computers!) contribute to
information production and consumption,
leading to ~45B Web pages.

Context
• Growth of users contributing and
interacting with the Web leads to significant
diversity of users, including people with
disabilities.
• The openness and decentralisation of the
Web leads to an uncontrolled quality check
of Websites’ usability (and accessibility).

What is the state of accessibility on the Web?

• It is known that Web accessibility adequacy
is often far worse than desired.
• Studies tend to focus on a restricted (small)
set of Web sites.

• Do macroscopic properties of Web
accessibility emerge from analysing at a
large scale?

Experiment
background

• The Portuguese Web Archive initiative
periodically crawls contents from the
Portuguese Web (.pt and others) for future
preservation.
• Services are built on top of crawled
collections: search (end users) & analysis
framework (researchers).

Methodology
data acquisition - obtaining the document collection

• Collect a sufficiently large portion of the
Web, yet representative (e.g., national
Webs)
• Spider traps handled gracefully
• Boostraped with 200,000 Website
addresses from the .pt TLD
• Collected March/May 2008

Methodology
data acquisition - evaluation process

• Implementation of 39 WCAG 1.0
checkpoints yield pass, fail, warn.
(collection previous to WCAG 2.0 TR)

• Overcome computational effort with
Hadoop cluster, streams, caching, etc.

Methodology
data analysis

• Failure rate, 3 criteria:

Results
general

• 28M Web pages were evaluated. (58%)

• 21GB evaluation data collected for analysis.
• 40B HTML elements evaluated. (~1500/page)

• 1.5B elements passed. (56/page, 3.89%)

• 2.9B elements failed. (103/page, 7.15%)

• 36B elements warned. (1291/page, 89%)

Results
rates versus page count distribution

conservative optimistic strict

Results
rates versus page complexity (# HTML elements)

conservative optimistic strict

Discussion
on the results

• Large scale confirms predictions of small
scale studies - the Web is still not for all.
• Smaller Web pages tend to have greater
accessibility quality.
• Nature of warnings is more striking than
expected, completely different
interpretations.
• Automated evaluation is just the
beginning.

Discussion
on the limitations of the experiment

• HTML structure vs. content rhetorics.
(CSS & Javascript can change it all)

• Collecting the Web is hard.
(deep Web - AJAX & forms -, infinite generation, robots.txt, etc.)

• Scaling evaluation & analysis processes is hard.
(evaluation streamability, resource inter-dependencies, billion node graphs, etc.)

Conclusions
• Large scale accessibility evaluation of the
Portuguese Web.
• Re-confirmed studies at the large.
• Educating developers & designers about
warnings is crucial for accessibility success!
• Automated evaluation is just the start.
Always need for expert & users evaluations.

Ongoing Work
we’re still at the tip of the iceberg

• Linking properties (ranking vs. accessibility)

• Evolution of accessibility compliance in
time (different document collections)
• Cross-cuts: gov, e-com, personalisation, etc.

• Developing countries
countries)
(Portuguese speaking African

Ongoing Work
help wanted from community!

• Making available evaluation datasets (e.g.,
Linked Data). Ours and yours!
• Larger document collections.

• Transforming warnings into failures with
machine learning.

Mais conteúdo relacionado

Mais procurados

292 daniel dollar ssp yale_28_may2008Society for Scholarly Publishing

Walk this way: Online content platform migration experiences and collaboration NASIG

What Libraries Still Need from Discovery LayersNational Information Standards Organization (NISO)

Peer Council 2016 Keynote Address with John ChapmanAndrea Coffin

Transforming University Research - Mar 2006Jill Patrick

Lightning talk on MARC records for the Contemporary Composers Web Archive pre...Anna Perricci

'Your Scholarship. Our World. Preserving the Long Tail' by Vicky ReichEDINA, University of Edinburgh

Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...National Information Standards Organization (NISO)

Discovery Service Implementation: What We Wish We Had Known, or Known to AskAndrea Coffin

Contemporary Composers Web Archive (CCWA): Progress in Collaboratively Collec...Anna Perricci

Siegman "Creating Accessible Content"National Information Standards Organization (NISO)

PESC-Kirchhoff-ALA Annual 2015 NISO UpdateNational Information Standards Organization (NISO)

Exposing Library Content with the NISO Metasearch XML Gateway ProtocolElectronic Resources & Libraries

2015 NISO Forum: The Future of Library Resource DiscoveryNational Information Standards Organization (NISO)

METRO Conference 2014: How collaboration can save [more of] the web: recent p...Anna Perricci

Mais procurados (16)

292 daniel dollar ssp yale_28_may2008

Walk this way: Online content platform migration experiences and collaboration

What Libraries Still Need from Discovery Layers

Peer Council 2016 Keynote Address with John Chapman

Transforming University Research - Mar 2006

Lightning talk on MARC records for the Contemporary Composers Web Archive pre...

'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich

Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...

Discovery Service Implementation: What We Wish We Had Known, or Known to Ask

Contemporary Composers Web Archive (CCWA): Progress in Collaboratively Collec...

Siegman "Creating Accessible Content"

PESC-Kirchhoff-ALA Annual 2015 NISO Update

Exposing Library Content with the NISO Metasearch XML Gateway Protocol

2015 NISO Forum: The Future of Library Resource Discovery

METRO Conference 2014: How collaboration can save [more of] the web: recent p...

Destaque

Some notes on UXRui Lopes

Assistive technologyguest1b791015

Networkingphilco11

On Web Accessibility EnvironmentsRui Lopes

Luottamus digitaalisessa turvallisuudessa yleisöluento jarno limnéll_08032016Jarno Limnéll

Assistive technologyguest1b791015

Mahdollistava turvallisuus Jarno Limnéll Rytminmuutos 13062016Jarno Limnéll

Destaque (7)

Some notes on UX

Assistive technology

Networking

On Web Accessibility Environments

Luottamus digitaalisessa turvallisuudessa yleisöluento jarno limnéll_08032016

Assistive technology

Mahdollistava turvallisuus Jarno Limnéll Rytminmuutos 13062016

Semelhante a W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility

Web-Scale Discovery: Post ImplementationRachel Vacek

Spca2014 practical large scale migration guidance v1.0 andries den haanNCCOMMS

Practical large scale migration guidanceAndries den Haan

Web MiningMudit Dholakia

Web miningInnovative Pencils

IRT Unit_4.pptxthenmozhip8

Ir1Tomas Anikevičius

Measuring impactStephen Emmott

Web Archiving – Lessons and PotentialDaniel Gomes

Archiving the French Web: the BnF web archiving workflow. Sara AubryBiblioteca Nacional de España

Scalability andefficiencypresNekoGato

IWMW 2005: Lies, Damn Lies, and Web Statistics (1)IWMW

Introduction to Web Technology by Mahesh SharmaArunima Education Foundation

introduction to web engineering.pdfNaglaaFathy42

Tools. Techniques. Trouble?Testplant

introduction to web engineering.pptxNaglaaFathy42

Subject gateway knowledge organisationAparna Sane

Charleston 2021 - Hit the ground running - Best practices for navigating cont...Matthew Ragucci

Wednesday 6 May: Hand me the data! What you should know as a humanities resea...WARCnet

WELecture01.pptxSheikhBadarUdDinTahi1

Semelhante a W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility (20)

Web-Scale Discovery: Post Implementation

Spca2014 practical large scale migration guidance v1.0 andries den haan

Practical large scale migration guidance

Web Mining

Web mining

IRT Unit_4.pptx

Ir1

Measuring impact

Web Archiving – Lessons and Potential

Archiving the French Web: the BnF web archiving workflow. Sara Aubry

Scalability andefficiencypres

IWMW 2005: Lies, Damn Lies, and Web Statistics (1)

Introduction to Web Technology by Mahesh Sharma

introduction to web engineering.pdf

Tools. Techniques. Trouble?

introduction to web engineering.pptx

Subject gateway knowledge organisation

Charleston 2021 - Hit the ground running - Best practices for navigating cont...

Wednesday 6 May: Hand me the data! What you should know as a humanities resea...

WELecture01.pptx

Último

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea

Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021

AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)Samir Dash

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Understanding the FAA Part 107 License ..Christopher Logan Kennedy

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot

Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea

AI in Action: Real World Use Cases by AnitarajAnitaRaj43

Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37

Exploring Multimodal Embeddings with MilvusZilliz

W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility

1. Web Not For All A Large Scale Study of Web Accessibility Rui Lopes1, Daniel Gomes2, Luís Carriço1 1 LaSIGE, University of Lisbon 2 FCCN

2. Context • The Web is the biggest information source for Mankind. Decentralised architecture made it blossom. • Humans (and computers!) contribute to information production and consumption, leading to ~45B Web pages.

3. Context • Growth of users contributing and interacting with the Web leads to significant diversity of users, including people with disabilities. • The openness and decentralisation of the Web leads to an uncontrolled quality check of Websites’ usability (and accessibility).

4. What is the state of accessibility on the Web?

5. • It is known that Web accessibility adequacy is often far worse than desired. • Studies tend to focus on a restricted (small) set of Web sites. • Do macroscopic properties of Web accessibility emerge from analysing at a large scale?

6. Experiment background • The Portuguese Web Archive initiative periodically crawls contents from the Portuguese Web (.pt and others) for future preservation. • Services are built on top of crawled collections: search (end users) & analysis framework (researchers).

7. Methodology data acquisition - obtaining the document collection • Collect a sufficiently large portion of the Web, yet representative (e.g., national Webs) • Spider traps handled gracefully • Boostraped with 200,000 Website addresses from the .pt TLD • Collected March/May 2008

8. Methodology data acquisition - evaluation process • Implementation of 39 WCAG 1.0 checkpoints yield pass, fail, warn. (collection previous to WCAG 2.0 TR) • Overcome computational effort with Hadoop cluster, streams, caching, etc.

9. Methodology data analysis • Failure rate, 3 criteria:

10. Results general • 28M Web pages were evaluated. (58%) • 21GB evaluation data collected for analysis. • 40B HTML elements evaluated. (~1500/page) • 1.5B elements passed. (56/page, 3.89%) • 2.9B elements failed. (103/page, 7.15%) • 36B elements warned. (1291/page, 89%)

11. Results rates versus page count distribution conservative optimistic strict

12. Results rates versus page complexity (# HTML elements) conservative optimistic strict

13. Discussion on the results • Large scale confirms predictions of small scale studies - the Web is still not for all. • Smaller Web pages tend to have greater accessibility quality. • Nature of warnings is more striking than expected, completely different interpretations. • Automated evaluation is just the beginning.

14. Discussion on the limitations of the experiment • HTML structure vs. content rhetorics. (CSS & Javascript can change it all) • Collecting the Web is hard. (deep Web - AJAX & forms -, infinite generation, robots.txt, etc.) • Scaling evaluation & analysis processes is hard. (evaluation streamability, resource inter-dependencies, billion node graphs, etc.)

15. Conclusions • Large scale accessibility evaluation of the Portuguese Web. • Re-confirmed studies at the large. • Educating developers & designers about warnings is crucial for accessibility success! • Automated evaluation is just the start. Always need for expert & users evaluations.

16. Ongoing Work we’re still at the tip of the iceberg • Linking properties (ranking vs. accessibility) • Evolution of accessibility compliance in time (different document collections) • Cross-cuts: gov, e-com, personalisation, etc. • Developing countries countries) (Portuguese speaking African

17. Ongoing Work help wanted from community! • Making available evaluation datasets (e.g., Linked Data). Ours and yours! • Larger document collections. • Transforming warnings into failures with machine learning.

18. Thank you! rlopes@di.fc.ul.pt

W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (16)

Destaque

Destaque (7)

Semelhante a W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility

Semelhante a W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility (20)

Último

Último (20)

W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility