SlideShare uma empresa Scribd logo
1 de 44
Baixar para ler offline
Putting the world’s
                 cultural heritage online
                   with crowdsourcing
                                      Frederick Zarndt
                                          sponsored by
                          CCS / Digital Divide Data / DL Consulting




                                                         Photo held by John Oxley Library, State Library of Queensland. Original from
                                               1         Courier-mail, Brisbane, Queensland, Australia.

Saturday, August 11, 12
Crowds
                          2
Saturday, August 11, 12
The Wisdom of Crowds

             In 2004 James Surowiecki published “The Wisdom
             of Crowds: Why the Many Are Smarter Than the
             Few and How Collective Wisdom Shapes Business,
             Economies, Societies and Nations”. In it he asserts

                          a crowd of persons that are diverse,
                          independent, and decentralized usually make
                          better judgements or decisions than single
                          persons


                                            3
Saturday, August 11, 12
“crowdsourcing”

              was coined by Jeff Howe in “The rise of
              crowdsourcing” published in Wired magazine June
              2006.




                                    4
Saturday, August 11, 12
A Google advanced search for
                      “crowdsourcing” from 1-Jun-2006, the date
                       of publication of Jeff Howe’s Wired magazine
                         article, to 1-Jun-2007 gives 44,600 hits.
                      A date range of 1-Jun-2011 to 1-Jun-2012 gives
                                     2,680,000 hits.




                                          5
Saturday, August 11, 12
Crowdsourcing is a process that
               involves outsourcing tasks to a distributed
               group of people. ... the difference between
               crowdsourcing and ordinary outsourcing is
               that a task or problem is outsourced to an
               undefined public rather than a specific
               body, such as paid employees.



Wikipedia contributors, "Crowdsourcing," Wikipedia, The Free Encyclopedia, http://en.wikipedia.org/wiki/Crowdsourcing
(accessed June 1, 2012)                                      6
Saturday, August 11, 12
Crowdsourcing is a type of participative online activity in
            which an individual, an institution, a non-profit
            organization, or company proposes to a group of individuals
            of varying knowledge, heterogeneity, and number, via a
            flexible open call, the voluntary undertaking of a task. The
            undertaking of the task, of variable complexity and
            modularity, and in which the crowd should participate
            bringing their work, money, knowledge and/or experience,
            always entails mutual benefit. The user will receive the
            satisfaction of a given type of need, be it economic, social
            recognition, self-esteem, or the development of individual
            skills, while the crowdsourcer will obtain and utilize to their
            advantage that what the user has brought to the venture,
            whose form will depend on the type of activity undertaken.



Enrique Estellés-Arolas and Fernando González-Ladrón-de-Guevara. Towards an integrated crowdsourcing definition.
Journal of Information Science XX(X). 2012. pp. 1-14.     7
Saturday, August 11, 12
crowdcollaboration             crowd*




                                            crowdsourcing
                   ng
                 di




                          citizen science
               un
           df
     ow
 cr




crowdcasting                       crowdvoting
                              8
Saturday, August 11, 12
crowdsourcing




                                   Amazon Mechanical Turk was launched Nov 2005
                          Alexa global rank of Amazon Mechanical Turk (13-Jun-2012): 6,022
                                                       9
Saturday, August 11, 12
crowdsourcing




                   Each day 200,000,000 recaptcha’s are solved by humans around the world

                                                   10
Saturday, August 11, 12
crowdvoting

                                Iowa Electronic Market was 1st
                                launched in 1995

                                Alexa global traffic rank of Iowa
                                Electronic Market (6-Aug-2012):
                                11,290

                                Alexa US traffic rank of Iowa
                                Electronic Market (6-Aug-2012):
                                3,923




                          11
Saturday, August 11, 12
citizen science




                                     Galaxy Zoo was 1st launched July 2007
                          Alexa global traffic rank of Galaxy Zoo (13-Jun-2012): 557,766
                                                      12
Saturday, August 11, 12
crowdfunding




                                      Kickstarter was 1st launched in 2008
                           Alexa global traffic rank of Kickstarter (6-Aug-2012): 752
                     27,528 projects successfully funded with more than USD $254,000,000
                                                    13
Saturday, August 11, 12
crowdcollaboration




                                  14
Saturday, August 11, 12
Wikipedia

                 •        Began 2001

                 •        Now in 285 languages

                 •        3,900,000+ articles in English, 1,400,000+ in German, 1,250,000+ in
                          French, 1,050,000 in Dutch

                 •        40 wikipedia languages with more than 100,000 articles

                 •        112 wikipedia languages with more than 10,000 articles

                 •        400,000,000 unique visitors per month

                 •        85,000 active contributors

                 •        Alexa global traffic rank: #6 in worldwide web traffic




                                                         15
Saturday, August 11, 12
16
Saturday, August 11, 12
Family Search Indexing was 1st launched (beta) 2004
                          Alexa global traffic rank of FamilySearch (13-Jun-2012): 4,419
                                                      17
Saturday, August 11, 12
• Started (beta) 2004
             • More than 780,000 worldwide registered volunteers
               from ~25 countries index records relevant to family
               history
             • Approximately 100,000 active volunteers each month
             • UI in Chinese, English, German, French, Italian,
               Japanese, Korean, Portuguese, and Russian
             • Blind double-key entry with arbitration / reconciliation
             • More than 1,500,088,741 records indexed (July 2012)
             • Accuracy typically > 99.95%
                                          18
Saturday, August 11, 12
Project Gutenberg was 1st launched Dec 1971
                          Alexa global traffic rank of Project Gutenberg (13-Jun-2012): 5,744
                                                        19
Saturday, August 11, 12
• Started Dec 1971
             • Worldwide volunteers transcribe or proofread OCR’d
               public domain books through Distributed Proofreaders
             • 40,000 books completed (July 2012)
             • Partner / affiliated projects for Australia, Canada,
               Europe, Germany, Luxembourg, Philippines, Runeberg
               (Nordic literature), Russia, Taiwan



                                        20
Saturday, August 11, 12
21
Saturday, August 11, 12
National Library of
                               Australia
             • Online since 2008
             • 7,200,000+ pages
             • Top text corrector 1,250,000 lines (June 2012)
             • 2,450,000+ lines corrected each month (average
               for 1st 6 months 2012)
             • 68,908,757 lines corrected as of July 2012, up
               from 42,411,468 lines corrected July 2011.
             • 63,613 total registered users (July 2012)
             • 4,146 active users (June 2012)
                                      22
Saturday, August 11, 12
23
Saturday, August 11, 12
California Digital
                   Newspaper Collection
                 • CDNC began digitizing newspapers in 2005 as
                   part of NDNP
                 • Hosted on Veridian beginning 2009
                 • Currently ~500,000 pages
                 • User OCR correction added August 2011
                 • ~395,000 lines of text corrected (July 2012)
                 • Top corrector 155,000 lines > 2x 2nd corrector

                                        24
Saturday, August 11, 12
Motivation
Graphic from Kaufmann et al. “More than fun and money. Worker Motivation
in Crowdsourcing – A Study on Mechanical Turk.”          25
Saturday, August 11, 12
Cognitive surplus

               ... people are learning to use their free time for creative
               activities rather than consumptive ones [such as watching
               TV] ...

               ... the total human cognitive effort in creating all of
               Wikipedia in every language is about one hundred million
               hours ...

               ... Americans alone watch two hundred billion hours of TV
               every year, or enough time, if it would be devoted to projects
               similar to Wikipedia, to create about 2000 of them.


Clay Shirky. Cognitive surplus: Creativity and generosity in a connected age.
Penguin Press. New York. 2010.                                                  26
 Saturday, August 11, 12
Motivation
                          Genealogists and family historians


                          • National Library of Australia guesses that
                            ~80% of Trove digitized newspapers users
                            are family historians
 PAPERSPAST               • National Library of New Zealand survey
                            found that ~50% of PapersPast users are
                            genealogists
                          • California Digital Newspaper Collection
                            survey found that ~70% of its users are
                            genealogists; 75% are 50 years old or older

                                           27
Saturday, August 11, 12
Motivation
                                 Trove users’ report


            • “I enjoy the correction - it’s a great way to learn more
            about past history and things of interest whilst doing a
            ‘service to the community’ by correcting text for the benefit
            of others.”
            • “I have recently retired from IT and thought that I could be
            of some assistance to the project. It benefits me and other
            people. It helps with family research.”




From Rose Holley in “Many Hands Make Light Work.”
                                                    28
National Library of Australia March 2009.
Saturday, August 11, 12
Motivation
                                  CDNC users’ report

         • “I am interested in all kinds of history. I have pursued genealogy as
         a hobby for many years. I correct text at CDNC because I see it as a
         constructive way to contribute to a worthwhile project. Because I am
         interested in history, I enjoy it.”

         • “I only correct the text on articles of local interest - nothing at state,
         national or international level, no advertisements, etc.  The objective
         is to be able to help researchers to locate local people, places,
         organizations and events using the on-line search at CDNC.  I correct
         local news & gossip, personal items, real estate transactions, superior
         court proceedings, county and local board of supervisors meetings,
         obituaries, birth notices, marriages, yachting news, etc.”


Personal communication with CDNC text correctors.
                                                    29
Saturday, August 11, 12
Website traffic




                                 30
Saturday, August 11, 12
Website traffic

            After a crowdsourcing transcription project of diaries from the
            American War Between the States, Nicole Saylor, Head of Digital
            Library Services at the University of Iowa Libraries, reported



                          “On June 9, 2011, we went from about 1000
                          daily hits to our digital library on a really good
                          day to more than 70,000.”



                                                31
Saturday, August 11, 12
Website traffic
                  Changes in website traffic at CDNC after implementing
                 crowdsourcing were not so dramatic as for the University
                                    of Iowa Libraries


                                  11-Jun-2011 / 12-Jul-2011 11-Jun-2012 / 12-Jul-2012   change


                      visits              16,934                    20,758              +22.6%

               unique visitors            11,030                    12,951              +17.4%

                visit duration           9m 24s                     11m 6s              +18.1%

                 bounce rate              51.3%                     44.7%               -12.9%




                                                       32
Saturday, August 11, 12
Crowdsourcing
                             benefits




                                33   Public domain photo courtesy of US Navy
Saturday, August 11, 12
$
                              Economics

                   Financial value of OCR text correction?
                   Assumptions
           • 25 to 50 characters per line in a newspaper column:
             Assume 35 characters per line
           • Outsourced text transcription or correction costs USD
             $0.35 to $1.20 per 1000 characters: Assume $0.50
             per 1000 characters


                                         34
Saturday, August 11, 12
€
                             Economics


                 • CDNC: 394,365 lines x 35 characters per line x
                   1/1000 x $0.50 = $6,901 $$
                 • Trove: 69,918,892 lines x 35 characters per
                   line x 1/1000 x $0.50 = $1,223,581 $$$$$




                                       35
Saturday, August 11, 12
Text accuracy

             • Edwin Kiljin (Koninklijke Bibliotheek the Netherlands)
             reports raw OCR character accuracies of 68% for early 20th
             century newspapers
             • Rose Holley (National Library of Australia) reports raw
             OCR character accuracy varied from 71% to 98% on a
             sample Trove digitized newspapers



 Edwin Kiljin. “The current state-of-art in newspaper
 digitization.” D-Lib Magazine. January/February 2008.

 Rose Holley. “How good can it get? Analysing and
 improving OCR accuracy in large scale historic newspaper
 digitisation programs. D-Lib Magazine. March/April
 2009.                                                      36
Saturday, August 11, 12
Text accuracy

             Optimistically assume that average raw OCR
             character accuracy is 90%.
             Average length of an English word is 5
             characters.
             Average word accuracy is 90% x 90% x 90%
             x 90% x 90% = 59% (6 out of 10 words
             correct).

Public domain graphic courtesy of Wikimedia Commons.   37
Saturday, August 11, 12
Search recall
                             no text correction
                                                                ARNDT




                                      ARNDT            ARNDT
                          ARNDT   ARNDT
                                              ARNDT


                                          ARNDT         ARNDT




                                                                        ARNDT



                                          ARNDT




  instances of “ARNDT” found                              instances of “ARNDT” not found
                                                  38
Saturday, August 11, 12
Text accuracy


             Assume the crowd corrects OCR text to
             99.5% accuracy.
             Average word accuracy is now 99.5% x
             99.5% x 99.5% x 99.5% x 99.5% = 97.5% (9+
             out of 10 words correct).



Public domain graphic courtesy of Wikimedia Commons.   39
Saturday, August 11, 12
Search recall
                          with text correction

                                              ARNDT

                                      ARNDT                ARNDT
                          ARNDT   ARNDT
                                               ARNDT
                                                   ARNDT
                                          ARNDT             ARNDT



                                              ARNDT




  instances of “ARNDT” found                                  instances of “ARNDT” not found
                                                  40
Saturday, August 11, 12
Benefits

               “when someone transcribes a document, they are
                actually better fulfilling the mission of a cultural
             heritage organization than someone who simply stops
                          by to flip through the pages”




From Trevor Owen’s Crowdstorming blog
http://crowdstorming.wordpress.com/     41
Saturday, August 11, 12
Benefits

            “in addition to increasing search accuracy or lowering
            the costs of document transcription, crowdsourcing is
           the single greatest advancement in getting people using
                   and interacting with library collections”




Paraphrased from Trevor Owen’s Crowdstorming blog
http://crowdstorming.wordpress.com/         42
Saturday, August 11, 12
Conclusions
              • Lots of crowdsourcing in cultural heritage
                organizations and elsewhere
              • Benefits are multi-faceted: Economic, data
                accuracy, patron engagement, increased web
                traffic




Conclusion of the Sonata for piano #32, opus 111 by
Ludwig van Beethoven                                  43
 Saturday, August 11, 12
?
                                      Frederick Zarndt
                               frederick@frederickzarndt.com
                                          sponsored by
                          CCS / Digital Divide Data / DL Consulting


                                                         Photo held by John Oxley Library, State Library of Queensland. Original from
                                               44        Courier-mail, Brisbane, Queensland, Australia.

Saturday, August 11, 12

Mais conteúdo relacionado

Destaque

201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...Frederick Zarndt
 
201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...Frederick Zarndt
 
20120820 conversion of historic newspapers to digital objects [boris yeltsin ...
20120820 conversion of historic newspapers to digital objects [boris yeltsin ...20120820 conversion of historic newspapers to digital objects [boris yeltsin ...
20120820 conversion of historic newspapers to digital objects [boris yeltsin ...Frederick Zarndt
 
20120821 putting the world’s cultural heritage online with crowd sourcing [na...
20120821 putting the world’s cultural heritage online with crowd sourcing [na...20120821 putting the world’s cultural heritage online with crowd sourcing [na...
20120821 putting the world’s cultural heritage online with crowd sourcing [na...Frederick Zarndt
 
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...Frederick Zarndt
 
Digital projects best practices [xxxiii reunión nacional de archivos 201111]
Digital projects best practices [xxxiii reunión nacional de archivos 201111]Digital projects best practices [xxxiii reunión nacional de archivos 201111]
Digital projects best practices [xxxiii reunión nacional de archivos 201111]Frederick Zarndt
 
International Newspaper Digitization:ALA Newspaper Interest Group
International Newspaper Digitization:ALA Newspaper Interest GroupInternational Newspaper Digitization:ALA Newspaper Interest Group
International Newspaper Digitization:ALA Newspaper Interest GroupFrederick Zarndt
 
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...Frederick Zarndt
 
2012-04-12 BnF International newspapers conference
2012-04-12 BnF International newspapers conference2012-04-12 BnF International newspapers conference
2012-04-12 BnF International newspapers conferenceFrederick Zarndt
 
20130412 Productivity of the crowd [acrl indianapolis]
20130412 Productivity of the crowd [acrl indianapolis]20130412 Productivity of the crowd [acrl indianapolis]
20130412 Productivity of the crowd [acrl indianapolis]Frederick Zarndt
 
20130629 If you build it, will they visit [ala lita lightning talk]
20130629 If you build it, will they visit [ala lita lightning talk]20130629 If you build it, will they visit [ala lita lightning talk]
20130629 If you build it, will they visit [ala lita lightning talk]Frederick Zarndt
 
Here Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsHere Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsFrederick Zarndt
 
Rootstech 2015 finding and using digitized historical newspapers workshop [20...
Rootstech 2015 finding and using digitized historical newspapers workshop [20...Rootstech 2015 finding and using digitized historical newspapers workshop [20...
Rootstech 2015 finding and using digitized historical newspapers workshop [20...Frederick Zarndt
 
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...Frederick Zarndt
 
What did you say? A tutorial on intercultural communication
What did you say? A tutorial on intercultural communicationWhat did you say? A tutorial on intercultural communication
What did you say? A tutorial on intercultural communicationFrederick Zarndt
 
20140410 ifla digitization workshop [idlc kuala lumpur]
20140410 ifla digitization workshop [idlc kuala lumpur]20140410 ifla digitization workshop [idlc kuala lumpur]
20140410 ifla digitization workshop [idlc kuala lumpur]Frederick Zarndt
 
Born Digital Newspapers: ALA Newspaper Interest Group Born Digital Newspapers...
Born Digital Newspapers: ALA Newspaper Interest Group Born Digital Newspapers...Born Digital Newspapers: ALA Newspaper Interest Group Born Digital Newspapers...
Born Digital Newspapers: ALA Newspaper Interest Group Born Digital Newspapers...Frederick Zarndt
 

Destaque (18)

201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...
 
201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...
 
20120820 conversion of historic newspapers to digital objects [boris yeltsin ...
20120820 conversion of historic newspapers to digital objects [boris yeltsin ...20120820 conversion of historic newspapers to digital objects [boris yeltsin ...
20120820 conversion of historic newspapers to digital objects [boris yeltsin ...
 
20120821 putting the world’s cultural heritage online with crowd sourcing [na...
20120821 putting the world’s cultural heritage online with crowd sourcing [na...20120821 putting the world’s cultural heritage online with crowd sourcing [na...
20120821 putting the world’s cultural heritage online with crowd sourcing [na...
 
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
 
Digital projects best practices [xxxiii reunión nacional de archivos 201111]
Digital projects best practices [xxxiii reunión nacional de archivos 201111]Digital projects best practices [xxxiii reunión nacional de archivos 201111]
Digital projects best practices [xxxiii reunión nacional de archivos 201111]
 
International Newspaper Digitization:ALA Newspaper Interest Group
International Newspaper Digitization:ALA Newspaper Interest GroupInternational Newspaper Digitization:ALA Newspaper Interest Group
International Newspaper Digitization:ALA Newspaper Interest Group
 
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
 
2012-04-12 BnF International newspapers conference
2012-04-12 BnF International newspapers conference2012-04-12 BnF International newspapers conference
2012-04-12 BnF International newspapers conference
 
20130412 Productivity of the crowd [acrl indianapolis]
20130412 Productivity of the crowd [acrl indianapolis]20130412 Productivity of the crowd [acrl indianapolis]
20130412 Productivity of the crowd [acrl indianapolis]
 
What Did You Say?
What Did You Say?What Did You Say?
What Did You Say?
 
20130629 If you build it, will they visit [ala lita lightning talk]
20130629 If you build it, will they visit [ala lita lightning talk]20130629 If you build it, will they visit [ala lita lightning talk]
20130629 If you build it, will they visit [ala lita lightning talk]
 
Here Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsHere Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital News
 
Rootstech 2015 finding and using digitized historical newspapers workshop [20...
Rootstech 2015 finding and using digitized historical newspapers workshop [20...Rootstech 2015 finding and using digitized historical newspapers workshop [20...
Rootstech 2015 finding and using digitized historical newspapers workshop [20...
 
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
 
What did you say? A tutorial on intercultural communication
What did you say? A tutorial on intercultural communicationWhat did you say? A tutorial on intercultural communication
What did you say? A tutorial on intercultural communication
 
20140410 ifla digitization workshop [idlc kuala lumpur]
20140410 ifla digitization workshop [idlc kuala lumpur]20140410 ifla digitization workshop [idlc kuala lumpur]
20140410 ifla digitization workshop [idlc kuala lumpur]
 
Born Digital Newspapers: ALA Newspaper Interest Group Born Digital Newspapers...
Born Digital Newspapers: ALA Newspaper Interest Group Born Digital Newspapers...Born Digital Newspapers: ALA Newspaper Interest Group Born Digital Newspapers...
Born Digital Newspapers: ALA Newspaper Interest Group Born Digital Newspapers...
 

Semelhante a 2012-08-14 Crowdsourcing National Digitisation Centre Mikkeli Finland

20130123 Crowdsourcing [hamilton library u of hi]
20130123 Crowdsourcing [hamilton library u of hi]20130123 Crowdsourcing [hamilton library u of hi]
20130123 Crowdsourcing [hamilton library u of hi]Frederick Zarndt
 
20130321 Putting the world's cultural heritage online with crowdsourcing [roo...
20130321 Putting the world's cultural heritage online with crowdsourcing [roo...20130321 Putting the world's cultural heritage online with crowdsourcing [roo...
20130321 Putting the world's cultural heritage online with crowdsourcing [roo...Frederick Zarndt
 
20130630 What motivates library crowdsourcing volunteers? [ALA LITA]
20130630 What motivates library crowdsourcing volunteers? [ALA LITA]20130630 What motivates library crowdsourcing volunteers? [ALA LITA]
20130630 What motivates library crowdsourcing volunteers? [ALA LITA]Frederick Zarndt
 
20121105 no tempest in my teapot [dlf forum denver]
20121105 no tempest in my teapot [dlf forum denver]20121105 no tempest in my teapot [dlf forum denver]
20121105 no tempest in my teapot [dlf forum denver]Frederick Zarndt
 
Some highlights from OKCon 2013, Geneva
Some highlights from OKCon 2013, GenevaSome highlights from OKCon 2013, Geneva
Some highlights from OKCon 2013, Genevaewan_klein
 
20111114 b hyland government data and publishers
20111114   b hyland government data and publishers20111114   b hyland government data and publishers
20111114 b hyland government data and publishersBernadette Hyland-Wood
 
Crowdsourcing 102: Mining Real-Time Data
Crowdsourcing 102: Mining Real-Time DataCrowdsourcing 102: Mining Real-Time Data
Crowdsourcing 102: Mining Real-Time DataUshahidi
 
Introducing Apereo: Presentation to the PESC Fall Data Summit, September 2013
Introducing Apereo: Presentation to the PESC Fall Data Summit, September 2013Introducing Apereo: Presentation to the PESC Fall Data Summit, September 2013
Introducing Apereo: Presentation to the PESC Fall Data Summit, September 2013Ian Dolphin
 
Wisdom of the Crowd vs. Collective Intelligence.
Wisdom of the Crowd vs. Collective Intelligence.Wisdom of the Crowd vs. Collective Intelligence.
Wisdom of the Crowd vs. Collective Intelligence.The New School
 
Crowdsourcing based curation and user engagement in digital library design
Crowdsourcing based curation and user engagement in digital library designCrowdsourcing based curation and user engagement in digital library design
Crowdsourcing based curation and user engagement in digital library designRose Holley
 
Ease Leads to Exposure , Exposure Leads to Adoption
Ease Leads to Exposure, Exposure Leads to AdoptionEase Leads to Exposure, Exposure Leads to Adoption
Ease Leads to Exposure , Exposure Leads to AdoptionDawn Wright
 
Stories to tell: The making of our digital nation. April 2010
Stories to tell: The making of our digital nation. April 2010 Stories to tell: The making of our digital nation. April 2010
Stories to tell: The making of our digital nation. April 2010 Rose Holley
 
"Toward Sustainability: "Margin" and "Mission" in the Natural History Setting...
"Toward Sustainability: "Margin" and "Mission" in the Natural History Setting..."Toward Sustainability: "Margin" and "Mission" in the Natural History Setting...
"Toward Sustainability: "Margin" and "Mission" in the Natural History Setting...Tom Moritz
 
Wilbury Crockett Library End of Year Report 2011
Wilbury Crockett Library End of Year Report 2011Wilbury Crockett Library End of Year Report 2011
Wilbury Crockett Library End of Year Report 2011deethellis
 
Open Source in Libraries: Freedom and Community
Open Source in Libraries: Freedom and CommunityOpen Source in Libraries: Freedom and Community
Open Source in Libraries: Freedom and CommunityNicole C. Engard
 
Como mejorar el mundo desde la transparencia
Como mejorar el mundo desde la transparenciaComo mejorar el mundo desde la transparencia
Como mejorar el mundo desde la transparenciaRuth Del Campo
 

Semelhante a 2012-08-14 Crowdsourcing National Digitisation Centre Mikkeli Finland (20)

20130123 Crowdsourcing [hamilton library u of hi]
20130123 Crowdsourcing [hamilton library u of hi]20130123 Crowdsourcing [hamilton library u of hi]
20130123 Crowdsourcing [hamilton library u of hi]
 
20130321 Putting the world's cultural heritage online with crowdsourcing [roo...
20130321 Putting the world's cultural heritage online with crowdsourcing [roo...20130321 Putting the world's cultural heritage online with crowdsourcing [roo...
20130321 Putting the world's cultural heritage online with crowdsourcing [roo...
 
20130630 What motivates library crowdsourcing volunteers? [ALA LITA]
20130630 What motivates library crowdsourcing volunteers? [ALA LITA]20130630 What motivates library crowdsourcing volunteers? [ALA LITA]
20130630 What motivates library crowdsourcing volunteers? [ALA LITA]
 
20121105 no tempest in my teapot [dlf forum denver]
20121105 no tempest in my teapot [dlf forum denver]20121105 no tempest in my teapot [dlf forum denver]
20121105 no tempest in my teapot [dlf forum denver]
 
Some highlights from OKCon 2013, Geneva
Some highlights from OKCon 2013, GenevaSome highlights from OKCon 2013, Geneva
Some highlights from OKCon 2013, Geneva
 
Crowdsourcing
CrowdsourcingCrowdsourcing
Crowdsourcing
 
20111114 b hyland government data and publishers
20111114   b hyland government data and publishers20111114   b hyland government data and publishers
20111114 b hyland government data and publishers
 
Crowdsourcing 102: Mining Real-Time Data
Crowdsourcing 102: Mining Real-Time DataCrowdsourcing 102: Mining Real-Time Data
Crowdsourcing 102: Mining Real-Time Data
 
Introducing Apereo: Presentation to the PESC Fall Data Summit, September 2013
Introducing Apereo: Presentation to the PESC Fall Data Summit, September 2013Introducing Apereo: Presentation to the PESC Fall Data Summit, September 2013
Introducing Apereo: Presentation to the PESC Fall Data Summit, September 2013
 
Wisdom of the Crowd vs. Collective Intelligence.
Wisdom of the Crowd vs. Collective Intelligence.Wisdom of the Crowd vs. Collective Intelligence.
Wisdom of the Crowd vs. Collective Intelligence.
 
Crowdsourcing based curation and user engagement in digital library design
Crowdsourcing based curation and user engagement in digital library designCrowdsourcing based curation and user engagement in digital library design
Crowdsourcing based curation and user engagement in digital library design
 
Data Publishing in Archaeozoology
Data Publishing in ArchaeozoologyData Publishing in Archaeozoology
Data Publishing in Archaeozoology
 
Belltown workshop
Belltown workshopBelltown workshop
Belltown workshop
 
Ease Leads to Exposure , Exposure Leads to Adoption
Ease Leads to Exposure, Exposure Leads to AdoptionEase Leads to Exposure, Exposure Leads to Adoption
Ease Leads to Exposure , Exposure Leads to Adoption
 
20111101 b hyland-w3-c-tpac-egov
20111101 b hyland-w3-c-tpac-egov20111101 b hyland-w3-c-tpac-egov
20111101 b hyland-w3-c-tpac-egov
 
Stories to tell: The making of our digital nation. April 2010
Stories to tell: The making of our digital nation. April 2010 Stories to tell: The making of our digital nation. April 2010
Stories to tell: The making of our digital nation. April 2010
 
"Toward Sustainability: "Margin" and "Mission" in the Natural History Setting...
"Toward Sustainability: "Margin" and "Mission" in the Natural History Setting..."Toward Sustainability: "Margin" and "Mission" in the Natural History Setting...
"Toward Sustainability: "Margin" and "Mission" in the Natural History Setting...
 
Wilbury Crockett Library End of Year Report 2011
Wilbury Crockett Library End of Year Report 2011Wilbury Crockett Library End of Year Report 2011
Wilbury Crockett Library End of Year Report 2011
 
Open Source in Libraries: Freedom and Community
Open Source in Libraries: Freedom and CommunityOpen Source in Libraries: Freedom and Community
Open Source in Libraries: Freedom and Community
 
Como mejorar el mundo desde la transparencia
Como mejorar el mundo desde la transparenciaComo mejorar el mundo desde la transparencia
Como mejorar el mundo desde la transparencia
 

Mais de Frederick Zarndt

Digitization of the Tuol Sleng Genocide Museum Archives
Digitization of the Tuol Sleng Genocide Museum ArchivesDigitization of the Tuol Sleng Genocide Museum Archives
Digitization of the Tuol Sleng Genocide Museum ArchivesFrederick Zarndt
 
2017 Born Digital Legal Deposit Policies and Practices
2017 Born Digital Legal Deposit Policies and Practices2017 Born Digital Legal Deposit Policies and Practices
2017 Born Digital Legal Deposit Policies and PracticesFrederick Zarndt
 
e-Legal Deposit Survey 2017
e-Legal Deposit Survey 2017e-Legal Deposit Survey 2017
e-Legal Deposit Survey 2017Frederick Zarndt
 
Project Management according to Great Pumpkin Principles
Project Management according to Great Pumpkin PrinciplesProject Management according to Great Pumpkin Principles
Project Management according to Great Pumpkin PrinciplesFrederick Zarndt
 
What did you say? interculture communication [20160308 phnom penh]
What did you say? interculture communication [20160308 phnom penh]What did you say? interculture communication [20160308 phnom penh]
What did you say? interculture communication [20160308 phnom penh]Frederick Zarndt
 
Coronado public library digital newspapers workshop local partnerships [oct 2...
Coronado public library digital newspapers workshop local partnerships [oct 2...Coronado public library digital newspapers workshop local partnerships [oct 2...
Coronado public library digital newspapers workshop local partnerships [oct 2...Frederick Zarndt
 
Coronado public library digital newspapers workshop [Oct 2016]
Coronado public library digital newspapers workshop [Oct 2016]Coronado public library digital newspapers workshop [Oct 2016]
Coronado public library digital newspapers workshop [Oct 2016]Frederick Zarndt
 
What did you say? mindful interculture communication [201608 icgse]
What did you say? mindful interculture communication [201608 icgse]What did you say? mindful interculture communication [201608 icgse]
What did you say? mindful interculture communication [201608 icgse]Frederick Zarndt
 
Here Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsHere Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsFrederick Zarndt
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...Frederick Zarndt
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...Frederick Zarndt
 
What did you say? Intercultural expectations, misunderstandings, and communic...
What did you say? Intercultural expectations, misunderstandings, and communic...What did you say? Intercultural expectations, misunderstandings, and communic...
What did you say? Intercultural expectations, misunderstandings, and communic...Frederick Zarndt
 
20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]Frederick Zarndt
 
20131019 digital collections - if you build them will anyone visit [library 2...
20131019 digital collections - if you build them will anyone visit [library 2...20131019 digital collections - if you build them will anyone visit [library 2...
20131019 digital collections - if you build them will anyone visit [library 2...Frederick Zarndt
 
20130903 what did you say? interculture communication [hamburg]
20130903 what did you say? interculture communication [hamburg]20130903 what did you say? interculture communication [hamburg]
20130903 what did you say? interculture communication [hamburg]Frederick Zarndt
 
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...Frederick Zarndt
 

Mais de Frederick Zarndt (16)

Digitization of the Tuol Sleng Genocide Museum Archives
Digitization of the Tuol Sleng Genocide Museum ArchivesDigitization of the Tuol Sleng Genocide Museum Archives
Digitization of the Tuol Sleng Genocide Museum Archives
 
2017 Born Digital Legal Deposit Policies and Practices
2017 Born Digital Legal Deposit Policies and Practices2017 Born Digital Legal Deposit Policies and Practices
2017 Born Digital Legal Deposit Policies and Practices
 
e-Legal Deposit Survey 2017
e-Legal Deposit Survey 2017e-Legal Deposit Survey 2017
e-Legal Deposit Survey 2017
 
Project Management according to Great Pumpkin Principles
Project Management according to Great Pumpkin PrinciplesProject Management according to Great Pumpkin Principles
Project Management according to Great Pumpkin Principles
 
What did you say? interculture communication [20160308 phnom penh]
What did you say? interculture communication [20160308 phnom penh]What did you say? interculture communication [20160308 phnom penh]
What did you say? interculture communication [20160308 phnom penh]
 
Coronado public library digital newspapers workshop local partnerships [oct 2...
Coronado public library digital newspapers workshop local partnerships [oct 2...Coronado public library digital newspapers workshop local partnerships [oct 2...
Coronado public library digital newspapers workshop local partnerships [oct 2...
 
Coronado public library digital newspapers workshop [Oct 2016]
Coronado public library digital newspapers workshop [Oct 2016]Coronado public library digital newspapers workshop [Oct 2016]
Coronado public library digital newspapers workshop [Oct 2016]
 
What did you say? mindful interculture communication [201608 icgse]
What did you say? mindful interculture communication [201608 icgse]What did you say? mindful interculture communication [201608 icgse]
What did you say? mindful interculture communication [201608 icgse]
 
Here Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsHere Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital News
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...
 
What did you say? Intercultural expectations, misunderstandings, and communic...
What did you say? Intercultural expectations, misunderstandings, and communic...What did you say? Intercultural expectations, misunderstandings, and communic...
What did you say? Intercultural expectations, misunderstandings, and communic...
 
20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]
 
20131019 digital collections - if you build them will anyone visit [library 2...
20131019 digital collections - if you build them will anyone visit [library 2...20131019 digital collections - if you build them will anyone visit [library 2...
20131019 digital collections - if you build them will anyone visit [library 2...
 
20130903 what did you say? interculture communication [hamburg]
20130903 what did you say? interculture communication [hamburg]20130903 what did you say? interculture communication [hamburg]
20130903 what did you say? interculture communication [hamburg]
 
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
 

Último

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 

Último (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 

2012-08-14 Crowdsourcing National Digitisation Centre Mikkeli Finland

  • 1. Putting the world’s cultural heritage online with crowdsourcing Frederick Zarndt sponsored by CCS / Digital Divide Data / DL Consulting Photo held by John Oxley Library, State Library of Queensland. Original from 1 Courier-mail, Brisbane, Queensland, Australia. Saturday, August 11, 12
  • 2. Crowds 2 Saturday, August 11, 12
  • 3. The Wisdom of Crowds In 2004 James Surowiecki published “The Wisdom of Crowds: Why the Many Are Smarter Than the Few and How Collective Wisdom Shapes Business, Economies, Societies and Nations”. In it he asserts a crowd of persons that are diverse, independent, and decentralized usually make better judgements or decisions than single persons 3 Saturday, August 11, 12
  • 4. “crowdsourcing” was coined by Jeff Howe in “The rise of crowdsourcing” published in Wired magazine June 2006. 4 Saturday, August 11, 12
  • 5. A Google advanced search for “crowdsourcing” from 1-Jun-2006, the date of publication of Jeff Howe’s Wired magazine article, to 1-Jun-2007 gives 44,600 hits. A date range of 1-Jun-2011 to 1-Jun-2012 gives 2,680,000 hits. 5 Saturday, August 11, 12
  • 6. Crowdsourcing is a process that involves outsourcing tasks to a distributed group of people. ... the difference between crowdsourcing and ordinary outsourcing is that a task or problem is outsourced to an undefined public rather than a specific body, such as paid employees. Wikipedia contributors, "Crowdsourcing," Wikipedia, The Free Encyclopedia, http://en.wikipedia.org/wiki/Crowdsourcing (accessed June 1, 2012) 6 Saturday, August 11, 12
  • 7. Crowdsourcing is a type of participative online activity in which an individual, an institution, a non-profit organization, or company proposes to a group of individuals of varying knowledge, heterogeneity, and number, via a flexible open call, the voluntary undertaking of a task. The undertaking of the task, of variable complexity and modularity, and in which the crowd should participate bringing their work, money, knowledge and/or experience, always entails mutual benefit. The user will receive the satisfaction of a given type of need, be it economic, social recognition, self-esteem, or the development of individual skills, while the crowdsourcer will obtain and utilize to their advantage that what the user has brought to the venture, whose form will depend on the type of activity undertaken. Enrique Estellés-Arolas and Fernando González-Ladrón-de-Guevara. Towards an integrated crowdsourcing definition. Journal of Information Science XX(X). 2012. pp. 1-14. 7 Saturday, August 11, 12
  • 8. crowdcollaboration crowd* crowdsourcing ng di citizen science un df ow cr crowdcasting crowdvoting 8 Saturday, August 11, 12
  • 9. crowdsourcing Amazon Mechanical Turk was launched Nov 2005 Alexa global rank of Amazon Mechanical Turk (13-Jun-2012): 6,022 9 Saturday, August 11, 12
  • 10. crowdsourcing Each day 200,000,000 recaptcha’s are solved by humans around the world 10 Saturday, August 11, 12
  • 11. crowdvoting Iowa Electronic Market was 1st launched in 1995 Alexa global traffic rank of Iowa Electronic Market (6-Aug-2012): 11,290 Alexa US traffic rank of Iowa Electronic Market (6-Aug-2012): 3,923 11 Saturday, August 11, 12
  • 12. citizen science Galaxy Zoo was 1st launched July 2007 Alexa global traffic rank of Galaxy Zoo (13-Jun-2012): 557,766 12 Saturday, August 11, 12
  • 13. crowdfunding Kickstarter was 1st launched in 2008 Alexa global traffic rank of Kickstarter (6-Aug-2012): 752 27,528 projects successfully funded with more than USD $254,000,000 13 Saturday, August 11, 12
  • 14. crowdcollaboration 14 Saturday, August 11, 12
  • 15. Wikipedia • Began 2001 • Now in 285 languages • 3,900,000+ articles in English, 1,400,000+ in German, 1,250,000+ in French, 1,050,000 in Dutch • 40 wikipedia languages with more than 100,000 articles • 112 wikipedia languages with more than 10,000 articles • 400,000,000 unique visitors per month • 85,000 active contributors • Alexa global traffic rank: #6 in worldwide web traffic 15 Saturday, August 11, 12
  • 17. Family Search Indexing was 1st launched (beta) 2004 Alexa global traffic rank of FamilySearch (13-Jun-2012): 4,419 17 Saturday, August 11, 12
  • 18. • Started (beta) 2004 • More than 780,000 worldwide registered volunteers from ~25 countries index records relevant to family history • Approximately 100,000 active volunteers each month • UI in Chinese, English, German, French, Italian, Japanese, Korean, Portuguese, and Russian • Blind double-key entry with arbitration / reconciliation • More than 1,500,088,741 records indexed (July 2012) • Accuracy typically > 99.95% 18 Saturday, August 11, 12
  • 19. Project Gutenberg was 1st launched Dec 1971 Alexa global traffic rank of Project Gutenberg (13-Jun-2012): 5,744 19 Saturday, August 11, 12
  • 20. • Started Dec 1971 • Worldwide volunteers transcribe or proofread OCR’d public domain books through Distributed Proofreaders • 40,000 books completed (July 2012) • Partner / affiliated projects for Australia, Canada, Europe, Germany, Luxembourg, Philippines, Runeberg (Nordic literature), Russia, Taiwan 20 Saturday, August 11, 12
  • 22. National Library of Australia • Online since 2008 • 7,200,000+ pages • Top text corrector 1,250,000 lines (June 2012) • 2,450,000+ lines corrected each month (average for 1st 6 months 2012) • 68,908,757 lines corrected as of July 2012, up from 42,411,468 lines corrected July 2011. • 63,613 total registered users (July 2012) • 4,146 active users (June 2012) 22 Saturday, August 11, 12
  • 24. California Digital Newspaper Collection • CDNC began digitizing newspapers in 2005 as part of NDNP • Hosted on Veridian beginning 2009 • Currently ~500,000 pages • User OCR correction added August 2011 • ~395,000 lines of text corrected (July 2012) • Top corrector 155,000 lines > 2x 2nd corrector 24 Saturday, August 11, 12
  • 25. Motivation Graphic from Kaufmann et al. “More than fun and money. Worker Motivation in Crowdsourcing – A Study on Mechanical Turk.” 25 Saturday, August 11, 12
  • 26. Cognitive surplus ... people are learning to use their free time for creative activities rather than consumptive ones [such as watching TV] ... ... the total human cognitive effort in creating all of Wikipedia in every language is about one hundred million hours ... ... Americans alone watch two hundred billion hours of TV every year, or enough time, if it would be devoted to projects similar to Wikipedia, to create about 2000 of them. Clay Shirky. Cognitive surplus: Creativity and generosity in a connected age. Penguin Press. New York. 2010. 26 Saturday, August 11, 12
  • 27. Motivation Genealogists and family historians • National Library of Australia guesses that ~80% of Trove digitized newspapers users are family historians PAPERSPAST • National Library of New Zealand survey found that ~50% of PapersPast users are genealogists • California Digital Newspaper Collection survey found that ~70% of its users are genealogists; 75% are 50 years old or older 27 Saturday, August 11, 12
  • 28. Motivation Trove users’ report • “I enjoy the correction - it’s a great way to learn more about past history and things of interest whilst doing a ‘service to the community’ by correcting text for the benefit of others.” • “I have recently retired from IT and thought that I could be of some assistance to the project. It benefits me and other people. It helps with family research.” From Rose Holley in “Many Hands Make Light Work.” 28 National Library of Australia March 2009. Saturday, August 11, 12
  • 29. Motivation CDNC users’ report • “I am interested in all kinds of history. I have pursued genealogy as a hobby for many years. I correct text at CDNC because I see it as a constructive way to contribute to a worthwhile project. Because I am interested in history, I enjoy it.” • “I only correct the text on articles of local interest - nothing at state, national or international level, no advertisements, etc.  The objective is to be able to help researchers to locate local people, places, organizations and events using the on-line search at CDNC.  I correct local news & gossip, personal items, real estate transactions, superior court proceedings, county and local board of supervisors meetings, obituaries, birth notices, marriages, yachting news, etc.” Personal communication with CDNC text correctors. 29 Saturday, August 11, 12
  • 30. Website traffic 30 Saturday, August 11, 12
  • 31. Website traffic After a crowdsourcing transcription project of diaries from the American War Between the States, Nicole Saylor, Head of Digital Library Services at the University of Iowa Libraries, reported “On June 9, 2011, we went from about 1000 daily hits to our digital library on a really good day to more than 70,000.” 31 Saturday, August 11, 12
  • 32. Website traffic Changes in website traffic at CDNC after implementing crowdsourcing were not so dramatic as for the University of Iowa Libraries 11-Jun-2011 / 12-Jul-2011 11-Jun-2012 / 12-Jul-2012 change visits 16,934 20,758 +22.6% unique visitors 11,030 12,951 +17.4% visit duration 9m 24s 11m 6s +18.1% bounce rate 51.3% 44.7% -12.9% 32 Saturday, August 11, 12
  • 33. Crowdsourcing benefits 33 Public domain photo courtesy of US Navy Saturday, August 11, 12
  • 34. $ Economics Financial value of OCR text correction? Assumptions • 25 to 50 characters per line in a newspaper column: Assume 35 characters per line • Outsourced text transcription or correction costs USD $0.35 to $1.20 per 1000 characters: Assume $0.50 per 1000 characters 34 Saturday, August 11, 12
  • 35. Economics • CDNC: 394,365 lines x 35 characters per line x 1/1000 x $0.50 = $6,901 $$ • Trove: 69,918,892 lines x 35 characters per line x 1/1000 x $0.50 = $1,223,581 $$$$$ 35 Saturday, August 11, 12
  • 36. Text accuracy • Edwin Kiljin (Koninklijke Bibliotheek the Netherlands) reports raw OCR character accuracies of 68% for early 20th century newspapers • Rose Holley (National Library of Australia) reports raw OCR character accuracy varied from 71% to 98% on a sample Trove digitized newspapers Edwin Kiljin. “The current state-of-art in newspaper digitization.” D-Lib Magazine. January/February 2008. Rose Holley. “How good can it get? Analysing and improving OCR accuracy in large scale historic newspaper digitisation programs. D-Lib Magazine. March/April 2009. 36 Saturday, August 11, 12
  • 37. Text accuracy Optimistically assume that average raw OCR character accuracy is 90%. Average length of an English word is 5 characters. Average word accuracy is 90% x 90% x 90% x 90% x 90% = 59% (6 out of 10 words correct). Public domain graphic courtesy of Wikimedia Commons. 37 Saturday, August 11, 12
  • 38. Search recall no text correction ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT instances of “ARNDT” found instances of “ARNDT” not found 38 Saturday, August 11, 12
  • 39. Text accuracy Assume the crowd corrects OCR text to 99.5% accuracy. Average word accuracy is now 99.5% x 99.5% x 99.5% x 99.5% x 99.5% = 97.5% (9+ out of 10 words correct). Public domain graphic courtesy of Wikimedia Commons. 39 Saturday, August 11, 12
  • 40. Search recall with text correction ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT instances of “ARNDT” found instances of “ARNDT” not found 40 Saturday, August 11, 12
  • 41. Benefits “when someone transcribes a document, they are actually better fulfilling the mission of a cultural heritage organization than someone who simply stops by to flip through the pages” From Trevor Owen’s Crowdstorming blog http://crowdstorming.wordpress.com/ 41 Saturday, August 11, 12
  • 42. Benefits “in addition to increasing search accuracy or lowering the costs of document transcription, crowdsourcing is the single greatest advancement in getting people using and interacting with library collections” Paraphrased from Trevor Owen’s Crowdstorming blog http://crowdstorming.wordpress.com/ 42 Saturday, August 11, 12
  • 43. Conclusions • Lots of crowdsourcing in cultural heritage organizations and elsewhere • Benefits are multi-faceted: Economic, data accuracy, patron engagement, increased web traffic Conclusion of the Sonata for piano #32, opus 111 by Ludwig van Beethoven 43 Saturday, August 11, 12
  • 44. ? Frederick Zarndt frederick@frederickzarndt.com sponsored by CCS / Digital Divide Data / DL Consulting Photo held by John Oxley Library, State Library of Queensland. Original from 44 Courier-mail, Brisbane, Queensland, Australia. Saturday, August 11, 12