SlideShare uma empresa Scribd logo
1 de 20
Baixar para ler offline
From libre software to Wikipedia:
 A tour of open collaboration




Felipe Ortega
Libresoft, Universidad Rey Juan Carlos
e-mail: jfelipe@libresoft.es
Twitter | Identi.ca: @jfelipe

Xerox PARC
June 14, 2011
                                         By Diego GrezCC-BY-SA 3.0, Wikimedia Commons
© 2011 Felipe Ortega.
                                          Some rights reserved.
                              This document is licensed under a
Creative Commons Attribution-ShareAlike 3.0 Unported License
 (Logos on first slide are (TM) of their respective organizations)
Open collaboration
“Think of how Wikipedia works, how Amazon harnesses
user annotation on its site, the way photo-sharing sites
like Flickr are bleeding out into other applications...
We're entering an era in which software learns from
its users and all of the users are connected”.

Tim O'Reilly.
TIME Magazine, 24 October 2005.




                                                By Felipe Ortega, CC-BY-SA 3.0
In the beginning...


●   ...all started with “real programmers” and FLOSS.
    ●   FSF, GNU, free licenses.
    ●   Open source goes into industry.
    ●   Libre software becomes ubiquitous.
●   However
    ●   Crowdsourced ! = Open source
    ●   Much betters if results encourage reusing and
        distribution of derivative works.
The “paradox” of open collaboration



“Wikipedia is the best thing ever. Anyone in the world can
write anything they want about any subject, so you know
you are getting the best possible information.”.

Michael Scott (played by Steve Carell)
The Office, "The Negotiation" [3.18], 5 April 2007
3 lessons from libre software



●   Onion model.
●   Generational relay.
●   Lasting participation.         By El_T, Public Domain,
                                from Wikimedia Commons
Onion model

The Social Structure of Free and Open Source Software Development
Crowston & Howison, 2005
Generational relay




      Robles, González-Barahona.
      Contributor Turnover in Libre Software Projects.
      OSS 2006.
Lasting participation


●   Robles, González-Barahona and Michlmayr.
    Evolution of Volunteer Participation in Libre Software
    Projects: Evidence from Debian. OSS 2005.



    Half-life ratio = 7.5 years!


+50% maintainers in Debian 2.0 still present in Debian 3.1
Thesis. Wikipedia: A quantitative
analysis.

●   Apply lessons from libre software to under-
    stand open collaborative process in Wikipedia.
    ●   Content production.
    ●   Effort distribution.
    ●   Implications for quality.
    ●   Participation and sustainability.
Tool: WikiXRay

Automated analysis of Wikipedia dumps.
http://git.libresoft.es/WikiXRay




                                      Download
                                                  Local MySQL
Wikimedia Download   Compressed        dumps
                                                     Server
      Center          DB dumps
                                      WIKIXRAY




Results evaluation   Analysis (scripts + GNU R)   Preparation for
                                                   data mining
New articles created in Wikipedia




                Entered steady-state in 2006,
                before graph of monthly edits
                    became stable (2007)
Interaction: talk pages

100%

90%

80%

70%

60%

50%                                                           no-talk
40%
                                                              talk

30%

20%

10%

 0%
       EN   DE   FR   PL   JA   NL   IT   PT   ES   SV

                           0.0086% (old talk pages deleted)
Contributions per editor

                    ●   Upper truncated Pareto
                        distribution.
                    ●   Limit in max. number of
                        revisions by human
                        editors.
                    ●   Better to have more
                        editors rather than
                        increasing contributions
                        per editor.
Effort distribution: Gini coefficient
Monthly effort distribution Wikipedia




                   Constant over the whole history!
              Ortega, F., González-Barahona, J., Robles, G.
              On the inequality of contributions to Wikipedia.
              HICSS 2008.
Profile editors in Featured Articles

●   Most Featured Articles are at least 1,000 days old.
●   10 times more editors in FAs than in non-FAs,
    almost 200 times in EN (!!).
●   FAs reviewed by significantly older authors
    (+3 years actively contributing to Wikipedia).


         FAs                                   non-FAs
The Digital Potlatch


●   Book with J. Rodríguez (in Spanish).
    ●   Ed. Cátedra, expected September 2011.
●   Interdisciplinary.
    ●   Anthropology + Engineering.
●   Meritocracy in Wikipedia.
●   Effort recognition.
●   Motivations.
●   Implications for quality.
                                        Public Domain, from Wikimedia Commons
Future lines of work


●   Study causes of change in
    evolution patterns and reverts.
    ●   “The singularity is not near”       By Bios, CC-BY-SA 3.0, from
                                                    Wikimedia Commons

        ASC @PARC, WikiSym 2009.
●   Edit diffs to study contribution patterns.
●   Different types of content.
●   Cross-relation with traffic patterns.

Mais conteúdo relacionado

Semelhante a Parc floss-wikipedia

Contropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit historyContropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit history
David Laniado
 
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
vbrant
 
Wmf wikimedia conference japan feb 3 en pdf
Wmf wikimedia conference japan feb 3 en pdfWmf wikimedia conference japan feb 3 en pdf
Wmf wikimedia conference japan feb 3 en pdf
Wikimedia Foundation
 
Collaborative Ontology Building Project
Collaborative Ontology Building Project  Collaborative Ontology Building Project
Collaborative Ontology Building Project
Jie Bao
 
Towards a diversity-minded Wikipedia
Towards a diversity-minded WikipediaTowards a diversity-minded Wikipedia
Towards a diversity-minded Wikipedia
RENDER project
 

Semelhante a Parc floss-wikipedia (20)

Contropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit historyContropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit history
 
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
 
Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)
 
WIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting OntologiesWIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting Ontologies
 
Editing Behavior over Time Power vs. Standard Wikidata Editors
Editing Behavior over Time  Power vs. Standard Wikidata EditorsEditing Behavior over Time  Power vs. Standard Wikidata Editors
Editing Behavior over Time Power vs. Standard Wikidata Editors
 
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
 
Wmf wikimedia conference japan feb 3 en pdf
Wmf wikimedia conference japan feb 3 en pdfWmf wikimedia conference japan feb 3 en pdf
Wmf wikimedia conference japan feb 3 en pdf
 
Practical Open Source Software for Libraries (part 1)
Practical Open Source Software for Libraries (part 1)Practical Open Source Software for Libraries (part 1)
Practical Open Source Software for Libraries (part 1)
 
Free For All: Getting Started in Open Source
Free For All: Getting Started in Open SourceFree For All: Getting Started in Open Source
Free For All: Getting Started in Open Source
 
Collaborative Ontology Building Project
Collaborative Ontology Building Project  Collaborative Ontology Building Project
Collaborative Ontology Building Project
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
 
BCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaarBCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaar
 
Open Source: Freedom and Community
Open Source: Freedom and CommunityOpen Source: Freedom and Community
Open Source: Freedom and Community
 
Wikisource - Where we are, where we want to go
Wikisource  - Where we are, where we want to go Wikisource  - Where we are, where we want to go
Wikisource - Where we are, where we want to go
 
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
 
Wanted: Best Practices for Collaborative Translation
Wanted: Best Practices for Collaborative TranslationWanted: Best Practices for Collaborative Translation
Wanted: Best Practices for Collaborative Translation
 
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
 
OpenMinteD Project - building a TDM infrastructure
OpenMinteD Project - building a TDM infrastructureOpenMinteD Project - building a TDM infrastructure
OpenMinteD Project - building a TDM infrastructure
 
Reciprocal Enrichment between Wikipedia and Machine Translators
Reciprocal Enrichment between Wikipedia and Machine TranslatorsReciprocal Enrichment between Wikipedia and Machine Translators
Reciprocal Enrichment between Wikipedia and Machine Translators
 
Towards a diversity-minded Wikipedia
Towards a diversity-minded WikipediaTowards a diversity-minded Wikipedia
Towards a diversity-minded Wikipedia
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 

Parc floss-wikipedia

  • 1. From libre software to Wikipedia: A tour of open collaboration Felipe Ortega Libresoft, Universidad Rey Juan Carlos e-mail: jfelipe@libresoft.es Twitter | Identi.ca: @jfelipe Xerox PARC June 14, 2011 By Diego GrezCC-BY-SA 3.0, Wikimedia Commons
  • 2. © 2011 Felipe Ortega. Some rights reserved. This document is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License (Logos on first slide are (TM) of their respective organizations)
  • 4. “Think of how Wikipedia works, how Amazon harnesses user annotation on its site, the way photo-sharing sites like Flickr are bleeding out into other applications... We're entering an era in which software learns from its users and all of the users are connected”. Tim O'Reilly. TIME Magazine, 24 October 2005. By Felipe Ortega, CC-BY-SA 3.0
  • 5. In the beginning... ● ...all started with “real programmers” and FLOSS. ● FSF, GNU, free licenses. ● Open source goes into industry. ● Libre software becomes ubiquitous. ● However ● Crowdsourced ! = Open source ● Much betters if results encourage reusing and distribution of derivative works.
  • 6. The “paradox” of open collaboration “Wikipedia is the best thing ever. Anyone in the world can write anything they want about any subject, so you know you are getting the best possible information.”. Michael Scott (played by Steve Carell) The Office, "The Negotiation" [3.18], 5 April 2007
  • 7. 3 lessons from libre software ● Onion model. ● Generational relay. ● Lasting participation. By El_T, Public Domain, from Wikimedia Commons
  • 8. Onion model The Social Structure of Free and Open Source Software Development Crowston & Howison, 2005
  • 9. Generational relay Robles, González-Barahona. Contributor Turnover in Libre Software Projects. OSS 2006.
  • 10. Lasting participation ● Robles, González-Barahona and Michlmayr. Evolution of Volunteer Participation in Libre Software Projects: Evidence from Debian. OSS 2005. Half-life ratio = 7.5 years! +50% maintainers in Debian 2.0 still present in Debian 3.1
  • 11. Thesis. Wikipedia: A quantitative analysis. ● Apply lessons from libre software to under- stand open collaborative process in Wikipedia. ● Content production. ● Effort distribution. ● Implications for quality. ● Participation and sustainability.
  • 12. Tool: WikiXRay Automated analysis of Wikipedia dumps. http://git.libresoft.es/WikiXRay Download Local MySQL Wikimedia Download Compressed dumps Server Center DB dumps WIKIXRAY Results evaluation Analysis (scripts + GNU R) Preparation for data mining
  • 13. New articles created in Wikipedia Entered steady-state in 2006, before graph of monthly edits became stable (2007)
  • 14. Interaction: talk pages 100% 90% 80% 70% 60% 50% no-talk 40% talk 30% 20% 10% 0% EN DE FR PL JA NL IT PT ES SV 0.0086% (old talk pages deleted)
  • 15. Contributions per editor ● Upper truncated Pareto distribution. ● Limit in max. number of revisions by human editors. ● Better to have more editors rather than increasing contributions per editor.
  • 17. Monthly effort distribution Wikipedia Constant over the whole history! Ortega, F., González-Barahona, J., Robles, G. On the inequality of contributions to Wikipedia. HICSS 2008.
  • 18. Profile editors in Featured Articles ● Most Featured Articles are at least 1,000 days old. ● 10 times more editors in FAs than in non-FAs, almost 200 times in EN (!!). ● FAs reviewed by significantly older authors (+3 years actively contributing to Wikipedia). FAs non-FAs
  • 19. The Digital Potlatch ● Book with J. Rodríguez (in Spanish). ● Ed. Cátedra, expected September 2011. ● Interdisciplinary. ● Anthropology + Engineering. ● Meritocracy in Wikipedia. ● Effort recognition. ● Motivations. ● Implications for quality. Public Domain, from Wikimedia Commons
  • 20. Future lines of work ● Study causes of change in evolution patterns and reverts. ● “The singularity is not near” By Bios, CC-BY-SA 3.0, from Wikimedia Commons ASC @PARC, WikiSym 2009. ● Edit diffs to study contribution patterns. ● Different types of content. ● Cross-relation with traffic patterns.