SlideShare uma empresa Scribd logo
1 de 22
2009   11 26
               1
7000.00




6000.00




5000.00




4000.00




3000.00




2000.00




1000.00




     0
          1

              2
✤




    ✤




    ✤




✤




    ✤




    ✤


        3
Credibility Degree   0.4




                           4
Wikipedia




            5
Wikipedia




            5
Adler et al. WikiSym ’08, WWW 2007

                          ✤


               Author A


                              ✤


              Author B
                          ✤




               Author C


Lifetime = lifetime
                                                           6
Lifetime

 50
                                        OK
37.5

                                       NG
 25


12.5


  0
       1   2   3   4   5   6   7   8    9    10

                                                  7
60    =1       =167       =7   =1
✤




    ✤        Wikipedia      : 60

        ✤                          950,548   ,   300 GB

    ✤   Wikipedian   : 70

    ✤          :     1                              11,625




                                                                   8
✤                                 (kBytes)
                          3,000


                          2,250
    ✤   80%
                          1,500
              (Ziph   )
                           750
    ✤   20%
        80%                  0
                                             ID


                                                  9
✤




✤



                                                                 10       (x 1000 editors)
✤
                                                                  9


                                                                  8


                                                                  7




            Number of Editors
                                                                  6
✤

Adler                            Uncredible editors
                                                                  5


                                                                  4
                                                                                             Credible editors



                                                                  3


                                                                  2


                                                                  1

                                                                  0

                                -10     -8       -6   -4   -2         0         2        4   6        8        10
                                                                                                          (degrees)
                                                            Reliability Degree




        ?                                                                                                             10
1.

2.

3.

4.




     11
1.

2.

3.

4.




     ?
         12
1.

2.

3.

4.   =       x -log




         ?
                      13
1


                                          0.9


                                          0.8
Spearman's rank correlation coefficient




                                          0.7


                                          0.6


                                          0.5


                                          0.4
                                                                                                         Amount of description
                                          0.3
                                                                                                         Number of articles
                                          0.2
                                                                                                         Amount of desciption
                                          0.1                                                            and Number of articles


                                           0
                                                0   10   20   30       40       50       60         70       80      90       100

                                                                   Reduction ratio of editors (%)




                                                                                                                                    14
✤




    ✤




    ✤




✤




    ✤




    ✤

        15
✤




    ✤                Wikipedia

        ✤   85,028               13.6%)   705,713         (Bot   )

✤




    ✤                                         98    443     )


                                                                     16
1,000,000


                         900,000


                         800,000


                         700,000
Calculation time (ms)




                         600,000


                         500,000


                         400,000
                                                                                          Amount of description

                         300,000
                                                                                          Number of articles
                         200,000
                                                                                     Amount of description and
                         100,000                                                     number of articles


                               0
                                    0   10   20   30    40       50      60          70      80      90        100

                                                   Decreasing ratio of editors (%)




                                                                                                                     17
10


                           9


                           8    40%
                           7                                                        Amount of description
Averaging Increaced Rank




                                                                                    and number of articles
                           6
                                Amount of description
                           5


                           4


                           3


                           2
                                                        Number of articles
                           1


                           0

                                            Important Editor Identification Method

                                                                                                             18
✤



    ✤



✤



    ✤           2%

    ✤



        ✤



            ✤

                     19
✤




    ✤




    ✤




✤




    ✤




    ✤

        20
21

Mais conteúdo relacionado

Semelhante a Wikipedia ws

2010年7月合同研究会
2010年7月合同研究会2010年7月合同研究会
2010年7月合同研究会
Yu Suzuki
 
Lean principles and practices
Lean principles and practicesLean principles and practices
Lean principles and practices
Jelle Bens
 
Creating Histograms from Data Stream via MapReduce
Creating Histograms from Data Stream via MapReduceCreating Histograms from Data Stream via MapReduce
Creating Histograms from Data Stream via MapReduce
DataWorks Summit
 
Corey Bradshaw_Assessing bias in extinction predictions from species-area rel...
Corey Bradshaw_Assessing bias in extinction predictions from species-area rel...Corey Bradshaw_Assessing bias in extinction predictions from species-area rel...
Corey Bradshaw_Assessing bias in extinction predictions from species-area rel...
TERN Australia
 
ST.Monteiro-EmbeddedFeatureSelection.pdf
ST.Monteiro-EmbeddedFeatureSelection.pdfST.Monteiro-EmbeddedFeatureSelection.pdf
ST.Monteiro-EmbeddedFeatureSelection.pdf
grssieee
 

Semelhante a Wikipedia ws (20)

Machine learning projects with r
Machine learning projects with rMachine learning projects with r
Machine learning projects with r
 
VaR of Operational Risk
VaR of Operational RiskVaR of Operational Risk
VaR of Operational Risk
 
Why we don’t know how many colors there are
Why we don’t know how many colors there areWhy we don’t know how many colors there are
Why we don’t know how many colors there are
 
MeasureWorks - Velocity Conference Europe - Performance Automation 101
MeasureWorks  - Velocity Conference Europe - Performance Automation 101MeasureWorks  - Velocity Conference Europe - Performance Automation 101
MeasureWorks - Velocity Conference Europe - Performance Automation 101
 
2010年7月合同研究会
2010年7月合同研究会2010年7月合同研究会
2010年7月合同研究会
 
Towards Probabilistic Assessment of Modularity
Towards Probabilistic Assessment of ModularityTowards Probabilistic Assessment of Modularity
Towards Probabilistic Assessment of Modularity
 
SPICE MODEL of 2SK2508 (Professional+BDP Model) in SPICE PARK
SPICE MODEL of 2SK2508 (Professional+BDP Model) in SPICE PARKSPICE MODEL of 2SK2508 (Professional+BDP Model) in SPICE PARK
SPICE MODEL of 2SK2508 (Professional+BDP Model) in SPICE PARK
 
Lean principles and practices
Lean principles and practicesLean principles and practices
Lean principles and practices
 
Creating Histograms from Data Stream via MapReduce
Creating Histograms from Data Stream via MapReduceCreating Histograms from Data Stream via MapReduce
Creating Histograms from Data Stream via MapReduce
 
Corey Bradshaw_Assessing bias in extinction predictions from species-area rel...
Corey Bradshaw_Assessing bias in extinction predictions from species-area rel...Corey Bradshaw_Assessing bias in extinction predictions from species-area rel...
Corey Bradshaw_Assessing bias in extinction predictions from species-area rel...
 
9th ICCS Noordwijkerhout
9th ICCS Noordwijkerhout9th ICCS Noordwijkerhout
9th ICCS Noordwijkerhout
 
Cache Creek Oklahoma Executive Summary
Cache Creek Oklahoma Executive SummaryCache Creek Oklahoma Executive Summary
Cache Creek Oklahoma Executive Summary
 
ST.Monteiro-EmbeddedFeatureSelection.pdf
ST.Monteiro-EmbeddedFeatureSelection.pdfST.Monteiro-EmbeddedFeatureSelection.pdf
ST.Monteiro-EmbeddedFeatureSelection.pdf
 
SPICE MODEL of 2SK2508 (Standard+BDS Model) in SPICE PARK
SPICE MODEL of 2SK2508 (Standard+BDS Model) in SPICE PARKSPICE MODEL of 2SK2508 (Standard+BDS Model) in SPICE PARK
SPICE MODEL of 2SK2508 (Standard+BDS Model) in SPICE PARK
 
Top Application Performance Landmines
Top Application Performance LandminesTop Application Performance Landmines
Top Application Performance Landmines
 
Characteristics of the kinase mutant TPK2 in bioreactors
Characteristics of the kinase mutant TPK2 in bioreactorsCharacteristics of the kinase mutant TPK2 in bioreactors
Characteristics of the kinase mutant TPK2 in bioreactors
 
Memcached
MemcachedMemcached
Memcached
 
DCT_TR802
DCT_TR802DCT_TR802
DCT_TR802
 
DCT_TR802
DCT_TR802DCT_TR802
DCT_TR802
 
DCT_TR802
DCT_TR802DCT_TR802
DCT_TR802
 

Mais de Yu Suzuki

F3-2: 信頼度を考慮した知識の構造化
F3-2: 信頼度を考慮した知識の構造化F3-2: 信頼度を考慮した知識の構造化
F3-2: 信頼度を考慮した知識の構造化
Yu Suzuki
 
DEIM F1-1: CredibilityRank: 編集履歴と著者情報を用いたWikipedia の記事信頼度算出手法
DEIM F1-1: CredibilityRank: 編集履歴と著者情報を用いたWikipedia の記事信頼度算出手法DEIM F1-1: CredibilityRank: 編集履歴と著者情報を用いたWikipedia の記事信頼度算出手法
DEIM F1-1: CredibilityRank: 編集履歴と著者情報を用いたWikipedia の記事信頼度算出手法
Yu Suzuki
 

Mais de Yu Suzuki (6)

4歳の子供を連れて学会に参加してみた
4歳の子供を連れて学会に参加してみた4歳の子供を連れて学会に参加してみた
4歳の子供を連れて学会に参加してみた
 
Wikipedia における情報の質
Wikipedia における情報の質Wikipedia における情報の質
Wikipedia における情報の質
 
Wikimedia Conference Japan 2013 情報の信頼度測定
Wikimedia Conference Japan 2013 情報の信頼度測定Wikimedia Conference Japan 2013 情報の信頼度測定
Wikimedia Conference Japan 2013 情報の信頼度測定
 
Wikisym 2012
Wikisym 2012 Wikisym 2012
Wikisym 2012
 
F3-2: 信頼度を考慮した知識の構造化
F3-2: 信頼度を考慮した知識の構造化F3-2: 信頼度を考慮した知識の構造化
F3-2: 信頼度を考慮した知識の構造化
 
DEIM F1-1: CredibilityRank: 編集履歴と著者情報を用いたWikipedia の記事信頼度算出手法
DEIM F1-1: CredibilityRank: 編集履歴と著者情報を用いたWikipedia の記事信頼度算出手法DEIM F1-1: CredibilityRank: 編集履歴と著者情報を用いたWikipedia の記事信頼度算出手法
DEIM F1-1: CredibilityRank: 編集履歴と著者情報を用いたWikipedia の記事信頼度算出手法
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 

Wikipedia ws

  • 1. 2009 11 26 1
  • 3. ✤ ✤ ✤ ✤ ✤ 3
  • 7. Adler et al. WikiSym ’08, WWW 2007 ✤ Author A ✤ Author B ✤ Author C Lifetime = lifetime 6
  • 8. Lifetime 50 OK 37.5 NG 25 12.5 0 1 2 3 4 5 6 7 8 9 10 7
  • 9. 60 =1 =167 =7 =1 ✤ ✤ Wikipedia : 60 ✤ 950,548 , 300 GB ✤ Wikipedian : 70 ✤ : 1 11,625 8
  • 10. (kBytes) 3,000 2,250 ✤ 80% 1,500 (Ziph ) 750 ✤ 20% 80% 0 ID 9
  • 11. ✤ ✤ 10 (x 1000 editors) ✤ 9 8 7 Number of Editors 6 ✤ Adler Uncredible editors 5 4 Credible editors 3 2 1 0 -10 -8 -6 -4 -2 0 2 4 6 8 10 (degrees) Reliability Degree ? 10
  • 13. 1. 2. 3. 4. ? 12
  • 14. 1. 2. 3. 4. = x -log ? 13
  • 15. 1 0.9 0.8 Spearman's rank correlation coefficient 0.7 0.6 0.5 0.4 Amount of description 0.3 Number of articles 0.2 Amount of desciption 0.1 and Number of articles 0 0 10 20 30 40 50 60 70 80 90 100 Reduction ratio of editors (%) 14
  • 16. ✤ ✤ ✤ ✤ ✤ 15
  • 17. ✤ Wikipedia ✤ 85,028 13.6%) 705,713 (Bot ) ✤ ✤ 98 443 ) 16
  • 18. 1,000,000 900,000 800,000 700,000 Calculation time (ms) 600,000 500,000 400,000 Amount of description 300,000 Number of articles 200,000 Amount of description and 100,000 number of articles 0 0 10 20 30 40 50 60 70 80 90 100 Decreasing ratio of editors (%) 17
  • 19. 10 9 8 40% 7 Amount of description Averaging Increaced Rank and number of articles 6 Amount of description 5 4 3 2 Number of articles 1 0 Important Editor Identification Method 18
  • 20. ✤ ✤ ✤ 2% ✤ ✤ ✤ 19
  • 21. ✤ ✤ ✤ ✤ ✤ 20
  • 22. 21

Notas do Editor

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n