SlideShare a Scribd company logo
1 of 36
BIG DATA?

  THE GLOBAL IMBALANCE!

        Stéphane Grumbach
              INRIA




                 1
The digital universe
Data deluge in all sectors of activity
	

	

   U.S. Library of Congress: 235 Terabytes of data
	

   Walmart: 2.5 petabytes of data, 1 million customer transactions / hour
	

   Facebook: 30 Petabytes of user data
	

   Google: processing 20 petabytes a day (2008)
	

   World: 5 billion people calling, tweeting, browsing on mobile phones

Exponential increase
	

   doubles every two years
	

   followed by the capacity to store, compute, and communicate


                                                2
The digital universe
Data deluge in all sectors of activity
	

                                                                              kilo 103
	

   U.S. Library of Congress: 235 Terabytes of data
	

   Walmart: 2.5 petabytes of data, 1 million customer transactions / hour
                                                                                mega 106
	

   Facebook: 30 Petabytes of user data
                                                                                 giga 109
	

   Google: processing 20 petabytes a day (2008)
                                                                                tera 1012
	

   World: 5 billion people calling, tweeting, browsing on mobile phones      peta 1015
                                                                                 exa 1018
Exponential increase                                                           zetta 1021
	

   doubles every two years
                                                                               yotta 1024
	

   followed by the capacity to store, compute, and communicate


                                                2
The digital universe 2.7 Zettabytes
Data deluge in all sectors of activity
	

                                                                              kilo 103
	

   U.S. Library of Congress: 235 Terabytes of data
	

   Walmart: 2.5 petabytes of data, 1 million customer transactions / hour
                                                                                mega 106
	

   Facebook: 30 Petabytes of user data
                                                                                 giga 109
	

   Google: processing 20 petabytes a day (2008)
                                                                                tera 1012
	

   World: 5 billion people calling, tweeting, browsing on mobile phones      peta 1015
                                                                                 exa 1018
Exponential increase                                                           zetta 1021
	

   doubles every two years
                                                                               yotta 1024
	

   followed by the capacity to store, compute, and communicate


                                                2
The digital universe 2.7 Zettabytes
Data deluge in all sectors of activity
	

                                                                              kilo 103
	

   U.S. Library of Congress: 235 Terabytes of data
	

   Walmart: 2.5 petabytes of data, 1 million customer transactions / hour
                                                                                mega 106
	

   Facebook: 30 Petabytes of user data
                                                                                 giga 109
	

   Google: processing 20 petabytes a day (2008)
                                                                                tera 1012
	

   World: 5 billion people calling, tweeting, browsing on mobile phones      peta 1015
                                                                                 exa 1018
Exponential increase                                                           zetta 1021
                                                                               yotta 1024
	

   doubles every two years    35 zettabytes in 2020
	

   followed by the capacity to store, compute, and communicate


                                                2
The Big Data Industry



     Advertising

     Capture users data
     Generate users profiles
     Target ads




 3
The Big Data Industry

beyond advertising

•$300 billion/year
    US health care
•€250 billion/year
    Europe public administration
               [McKinsey 2011]


Tremendous economic impact
    Teraeuros (thousands billions)


    4
First challenge: Data Harvesting

70% of the data produced by individuals
  directly produced by users:
  	

 email, photos, blogs, etc. (less than half)
  indirectly digital shadow/footprint:
  	

 surveillance, web usage, transactions




                                    5
First challenge: Data Harvesting

70% of the data produced by individuals
  directly produced by users:
  	

 email, photos, blogs, etc. (less than half)
  indirectly digital shadow/footprint:
  	

 surveillance, web usage, transactions

The free paradigm of the 2.0
  Free services traded for private user data
  Free exploitation of the accumulated data

                                    5
Second challenge: knowledge extraction

User profiles (business)
	

 => Ads target

Automatic discovery (science)
	

 => Google Flu
	

    	

   monitoring of flu related queries
	

    	

   	

   a search engine company knows everything

	

    => Biological, sociological data...


NSA (security)
	

 => Ambition to handle yottabytes (1024) !!!
                                                              6
Second challenge: knowledge extraction

User profiles (business)
	

 => Ads target

Automatic discovery (science)
	

 => Google Flu
	

    	

   monitoring of flu related queries
	

    	

   	

   a search engine company knows everything

	

    => Biological, sociological data...


NSA (security)
	

 => Ambition to handle yottabytes (1024) !!!
                                                              6
Data: raw material of the 21st century
         (much like crude oil)




                    7
Data: raw material of the 21st century
            (much like crude oil)

 extraction
from natural
                                         consumption
               transport       refining
                                           at users
  reservoirs




                           7
Data: raw material of the 21st century
            (much like crude oil)

 extraction
from natural
                                             consumption
                 transport         refining
                                               at users
  reservoirs



               accumulation                  production
   data
                  in large     Internet        of data
 analytics
                repositories                  at users




                               7
Where are these data?




           8
Where are these data?
Huge concentration of data

85% of data handled by (large) corporations
  Virtualization/dematerialization of infrastructures
  Social networks, Cloud, ...


Most of the prominent corporations based in the USA
      Google, Facebook, Amazon, Twitter, ...
      Storage capacity of Europe = 70% USA     [McKinsey 2011]




                                    8
Where are these data?
Huge concentration of data

85% of data handled by (large) corporations
  Virtualization/dematerialization of infrastructures
  Social networks, Cloud, ...


Most of the prominent corporations based in the USA
      Google, Facebook, Amazon, Twitter, ...
      Storage capacity of Europe = 70% USA     [McKinsey 2011]




        1/3 of world data stored in the cloud by 2020
                                    8
Geopolitics of big data




                      Alexa.com
            9
Geopolitics of big data
Data from the Web 2.0
 produced by users everywhere in the world
 but accumulated by corporations most often abroad

Percentage of national web corporations among top 25 by country




                                                  Alexa.com
                                9
Geopolitics of big data
Data from the Web 2.0
 produced by users everywhere in the world
 but accumulated by corporations most often abroad

Percentage of national web corporations among top 25 by country
  • USA: 100%
  • China: 92% (only Google makes it in the top 25)
  • France: 36% (but mostly marginal sites, not data intensive)
     leboncoin, Orange, Free, commentcamarche, lemonde, lequipe, lefigaro, pagesjaunes, sfr

                                                                            Alexa.com
                                               9
Geopolitics of big data




            10
Geopolitics of big data

The Top 50 websites worldwide
   • USA: 72 %




                    10
Geopolitics of big data

The Top 50 websites worldwide
   • USA: 72 %
   • China: 16 % (Baidu: 5; QQ: 8; Taobao: 13; Sina:17;
                163: 28; Soso:29; Sina weibo:31; Sohu:43)
   •   Russia: 6 % (Yandex: 21; kontakte:30; Mail: 33; )
   •   Israel: 2 % (Babylon: 22)
   •   UK: 2 % (BBC: 46)
   •   Netherland: 2 % (AVG: 47)
                             10
Geopolitics of big data
Diversity of search engines
    •   USA: Google: 65 % ; Bing: 15% ;Yahoo: 15%
    •   China: Baidu: 78% ; Google: 16%
    •   Russia: Yandex: 60% ; Google: 25%
    •   UK: Google: 91 % ; Bing: 5%
    •   France: Google: 92 % ; Bing: 3%


In France,
    • Google has a de facto monopoly
    • Google knows more about France than INSEE

                                     11
The global imbalance

               Information asymmetry

“Since asymmetries of information give rise to market power,
and perfect competition is required if markets are to be efficient,
it is perhaps not surprising that markets with information
asymmetries and other information imperfections are far from
efficient.”
                                              JOSEPH E. STIGLITZ



                                12
Impact of the global imbalance
Regulation
  What legislations over a dematerialized global industry?
  Aren’t the rules defined by those who have the control?

Business
  How to face monopolistic positions?
  How to handle the information asymmetry?

Security
  Data at the core of nations independence

                                      13
The power of data




             Map Ecological Footprint
        14   http://www.csa.com/discoveryguides/china/review.php
The power of data




             Map Ecological Footprint
        14   http://www.csa.com/discoveryguides/china/review.php
What’s at stake in Europe?
Suspicion (fear?) regarding data
	

 concern for privacy protection high in Europe
	

 active legislative work
	

 historical reasons?

Weak industrial/innovation environment
	

 no strong corporation emerging

But essential dependence on foreign systems
                             15
Are there alternatives?
dominant (centralized) model
            unclear privacy
             lost property
     active (centralized) business
    little share of business capacity




decentralized ‘utopian’ model
              high privacy                   Faroo, Yacy
            real ownership
             little business                 Diaspora



                                        16
odel
                                             m
                                   ed)
                               aliz
                  ce ntr   vac
                                y
           ( ar pri
      ant uncle perty ness
domin                t pr
                         o
                                    bus
                                        i
                                                 y
                 los         z ed)          acit
                        rali            cap
                   c ent         in ess
              ve (          bus
         acti         reo
                          f
               e sha
         littl



      de
         ce  ntr
                 aliz
                     e        d ‘u
                rea
                       hig
                          h   pri
                                     top
                   l     ow vacy           ian
              l i tt        ne                   ’m
                     le
                        bu rship                   od
                          sin
                              ess                    el        Faroo, Yacy
                                                               Diaspora
                                                          17
odel
                                             m
                                   ed)
                               aliz
                  ce ntr   vac
                                y
           ( ar pri
      ant uncle perty ness
domin                t pr
                         o
                                    bus
                                        i
                                                 y
                 los         z ed)          acit
                        rali            cap
                   c ent         in ess
              ve (          bus
         acti         reo
                          f
               e sha
         littl
                                                  an alternative path ?
                                                      active (competitive) business
                                                       symmetry of information
      de
         ce  ntr                                          ownership & privacy
                 aliz                                        anti monopoly
                     e        d ‘u
                rea
                       hig
                          h   pri
                                     top
                   l     ow vacy           ian
              l i tt        ne                   ’m
                     le
                        bu rship                   od
                          sin
                              ess                    el                       Faroo, Yacy
                                                                              Diaspora
                                                                17
An alternative path for Europe?

The information society
	

 it is only emerging
	

 it will continue to evolve
	

 it will impact political systems
	

 new business models, new equilibrium will appear




                              18
An alternative path for Europe?

The information society
	

 it is only emerging
	

 it will continue to evolve
	

 it will impact political systems
	

 new business models, new equilibrium will appear


                           Europe should embrace the future
                              18
19
谢谢
 19

More Related Content

More from Fing

Sobriété Numérique - Fabrice Flipo @Transitions²
Sobriété Numérique - Fabrice Flipo @Transitions²Sobriété Numérique - Fabrice Flipo @Transitions²
Sobriété Numérique - Fabrice Flipo @Transitions²Fing
 
L'Agenda pour le Futur de Transitions² / 3 juillet 2018, Paris, Le Square
L'Agenda pour le Futur de Transitions² / 3 juillet 2018, Paris, Le SquareL'Agenda pour le Futur de Transitions² / 3 juillet 2018, Paris, Le Square
L'Agenda pour le Futur de Transitions² / 3 juillet 2018, Paris, Le SquareFing
 
Kit AgirLocal, le numérique au service des démarches environnementales de nos...
Kit AgirLocal, le numérique au service des démarches environnementales de nos...Kit AgirLocal, le numérique au service des démarches environnementales de nos...
Kit AgirLocal, le numérique au service des démarches environnementales de nos...Fing
 
Les Transformations du travail à l'ère numérique - Fing
Les Transformations du travail à l'ère numérique - Fing Les Transformations du travail à l'ère numérique - Fing
Les Transformations du travail à l'ère numérique - Fing Fing
 
Plan d'action de la Fing en 2018
Plan d'action de la Fing en 2018Plan d'action de la Fing en 2018
Plan d'action de la Fing en 2018Fing
 
Crise de l'emploi, de l'entreprise, du travail... la faute aux technologies ?
Crise de l'emploi, de l'entreprise, du travail... la faute aux technologies ?Crise de l'emploi, de l'entreprise, du travail... la faute aux technologies ?
Crise de l'emploi, de l'entreprise, du travail... la faute aux technologies ?Fing
 
Cahier d'enjeux "Questions Numériques" Think Small
Cahier d'enjeux "Questions Numériques" Think SmallCahier d'enjeux "Questions Numériques" Think Small
Cahier d'enjeux "Questions Numériques" Think SmallFing
 
Pour un rétro-design de l'attention
Pour un rétro-design de l'attentionPour un rétro-design de l'attention
Pour un rétro-design de l'attentionFing
 
Présentation de la Musette numérique de l'actif (juillet 2017)
Présentation de la Musette numérique de l'actif (juillet 2017)Présentation de la Musette numérique de l'actif (juillet 2017)
Présentation de la Musette numérique de l'actif (juillet 2017)Fing
 
Innovation facteur 4, journée Agenda du Futur le 04/07/17
Innovation facteur 4, journée Agenda du Futur le 04/07/17Innovation facteur 4, journée Agenda du Futur le 04/07/17
Innovation facteur 4, journée Agenda du Futur le 04/07/17Fing
 
Pistes d'innovation Nos Systèmes (version de travail)
Pistes d'innovation Nos Systèmes (version de travail)Pistes d'innovation Nos Systèmes (version de travail)
Pistes d'innovation Nos Systèmes (version de travail)Fing
 
Daniel Kaplan - Transitions² / "Agenda pour le Futur" - Open Conference, 23 m...
Daniel Kaplan - Transitions² / "Agenda pour le Futur" - Open Conference, 23 m...Daniel Kaplan - Transitions² / "Agenda pour le Futur" - Open Conference, 23 m...
Daniel Kaplan - Transitions² / "Agenda pour le Futur" - Open Conference, 23 m...Fing
 
Pierre-Yves Longaretti - Transitions² / "Agenda pour le Futur" - Open Confere...
Pierre-Yves Longaretti - Transitions² / "Agenda pour le Futur" - Open Confere...Pierre-Yves Longaretti - Transitions² / "Agenda pour le Futur" - Open Confere...
Pierre-Yves Longaretti - Transitions² / "Agenda pour le Futur" - Open Confere...Fing
 
Projet Open Source Ecology - Transitions² / "Agenda pour le Futur" - Open Con...
Projet Open Source Ecology - Transitions² / "Agenda pour le Futur" - Open Con...Projet Open Source Ecology - Transitions² / "Agenda pour le Futur" - Open Con...
Projet Open Source Ecology - Transitions² / "Agenda pour le Futur" - Open Con...Fing
 
Projet WeDoGood - Transitions² / "Agenda pour le Futur" - Open Conference, 23...
Projet WeDoGood - Transitions² / "Agenda pour le Futur" - Open Conference, 23...Projet WeDoGood - Transitions² / "Agenda pour le Futur" - Open Conference, 23...
Projet WeDoGood - Transitions² / "Agenda pour le Futur" - Open Conference, 23...Fing
 
Comprendre et accompagner les transformations à l’ère numérique
Comprendre et accompagner les transformations à l’ère numériqueComprendre et accompagner les transformations à l’ère numérique
Comprendre et accompagner les transformations à l’ère numériqueFing
 
10 projets, initiatives, idées, qui font le lien entre écologie et numérique
10 projets, initiatives, idées, qui font le lien entre écologie et numérique10 projets, initiatives, idées, qui font le lien entre écologie et numérique
10 projets, initiatives, idées, qui font le lien entre écologie et numériqueFing
 
Transitions singulières : Imaginer les mondes d’après
Transitions singulières : Imaginer les mondes d’aprèsTransitions singulières : Imaginer les mondes d’après
Transitions singulières : Imaginer les mondes d’aprèsFing
 
Musette Numérique de l'Actif - livrable final
Musette Numérique de l'Actif - livrable finalMusette Numérique de l'Actif - livrable final
Musette Numérique de l'Actif - livrable finalFing
 
Musette Restitution finale
Musette Restitution finaleMusette Restitution finale
Musette Restitution finaleFing
 

More from Fing (20)

Sobriété Numérique - Fabrice Flipo @Transitions²
Sobriété Numérique - Fabrice Flipo @Transitions²Sobriété Numérique - Fabrice Flipo @Transitions²
Sobriété Numérique - Fabrice Flipo @Transitions²
 
L'Agenda pour le Futur de Transitions² / 3 juillet 2018, Paris, Le Square
L'Agenda pour le Futur de Transitions² / 3 juillet 2018, Paris, Le SquareL'Agenda pour le Futur de Transitions² / 3 juillet 2018, Paris, Le Square
L'Agenda pour le Futur de Transitions² / 3 juillet 2018, Paris, Le Square
 
Kit AgirLocal, le numérique au service des démarches environnementales de nos...
Kit AgirLocal, le numérique au service des démarches environnementales de nos...Kit AgirLocal, le numérique au service des démarches environnementales de nos...
Kit AgirLocal, le numérique au service des démarches environnementales de nos...
 
Les Transformations du travail à l'ère numérique - Fing
Les Transformations du travail à l'ère numérique - Fing Les Transformations du travail à l'ère numérique - Fing
Les Transformations du travail à l'ère numérique - Fing
 
Plan d'action de la Fing en 2018
Plan d'action de la Fing en 2018Plan d'action de la Fing en 2018
Plan d'action de la Fing en 2018
 
Crise de l'emploi, de l'entreprise, du travail... la faute aux technologies ?
Crise de l'emploi, de l'entreprise, du travail... la faute aux technologies ?Crise de l'emploi, de l'entreprise, du travail... la faute aux technologies ?
Crise de l'emploi, de l'entreprise, du travail... la faute aux technologies ?
 
Cahier d'enjeux "Questions Numériques" Think Small
Cahier d'enjeux "Questions Numériques" Think SmallCahier d'enjeux "Questions Numériques" Think Small
Cahier d'enjeux "Questions Numériques" Think Small
 
Pour un rétro-design de l'attention
Pour un rétro-design de l'attentionPour un rétro-design de l'attention
Pour un rétro-design de l'attention
 
Présentation de la Musette numérique de l'actif (juillet 2017)
Présentation de la Musette numérique de l'actif (juillet 2017)Présentation de la Musette numérique de l'actif (juillet 2017)
Présentation de la Musette numérique de l'actif (juillet 2017)
 
Innovation facteur 4, journée Agenda du Futur le 04/07/17
Innovation facteur 4, journée Agenda du Futur le 04/07/17Innovation facteur 4, journée Agenda du Futur le 04/07/17
Innovation facteur 4, journée Agenda du Futur le 04/07/17
 
Pistes d'innovation Nos Systèmes (version de travail)
Pistes d'innovation Nos Systèmes (version de travail)Pistes d'innovation Nos Systèmes (version de travail)
Pistes d'innovation Nos Systèmes (version de travail)
 
Daniel Kaplan - Transitions² / "Agenda pour le Futur" - Open Conference, 23 m...
Daniel Kaplan - Transitions² / "Agenda pour le Futur" - Open Conference, 23 m...Daniel Kaplan - Transitions² / "Agenda pour le Futur" - Open Conference, 23 m...
Daniel Kaplan - Transitions² / "Agenda pour le Futur" - Open Conference, 23 m...
 
Pierre-Yves Longaretti - Transitions² / "Agenda pour le Futur" - Open Confere...
Pierre-Yves Longaretti - Transitions² / "Agenda pour le Futur" - Open Confere...Pierre-Yves Longaretti - Transitions² / "Agenda pour le Futur" - Open Confere...
Pierre-Yves Longaretti - Transitions² / "Agenda pour le Futur" - Open Confere...
 
Projet Open Source Ecology - Transitions² / "Agenda pour le Futur" - Open Con...
Projet Open Source Ecology - Transitions² / "Agenda pour le Futur" - Open Con...Projet Open Source Ecology - Transitions² / "Agenda pour le Futur" - Open Con...
Projet Open Source Ecology - Transitions² / "Agenda pour le Futur" - Open Con...
 
Projet WeDoGood - Transitions² / "Agenda pour le Futur" - Open Conference, 23...
Projet WeDoGood - Transitions² / "Agenda pour le Futur" - Open Conference, 23...Projet WeDoGood - Transitions² / "Agenda pour le Futur" - Open Conference, 23...
Projet WeDoGood - Transitions² / "Agenda pour le Futur" - Open Conference, 23...
 
Comprendre et accompagner les transformations à l’ère numérique
Comprendre et accompagner les transformations à l’ère numériqueComprendre et accompagner les transformations à l’ère numérique
Comprendre et accompagner les transformations à l’ère numérique
 
10 projets, initiatives, idées, qui font le lien entre écologie et numérique
10 projets, initiatives, idées, qui font le lien entre écologie et numérique10 projets, initiatives, idées, qui font le lien entre écologie et numérique
10 projets, initiatives, idées, qui font le lien entre écologie et numérique
 
Transitions singulières : Imaginer les mondes d’après
Transitions singulières : Imaginer les mondes d’aprèsTransitions singulières : Imaginer les mondes d’après
Transitions singulières : Imaginer les mondes d’après
 
Musette Numérique de l'Actif - livrable final
Musette Numérique de l'Actif - livrable finalMusette Numérique de l'Actif - livrable final
Musette Numérique de l'Actif - livrable final
 
Musette Restitution finale
Musette Restitution finaleMusette Restitution finale
Musette Restitution finale
 

Lift12Fr - Stephane Grumbach

  • 1. BIG DATA? THE GLOBAL IMBALANCE! Stéphane Grumbach INRIA 1
  • 2. The digital universe Data deluge in all sectors of activity U.S. Library of Congress: 235 Terabytes of data Walmart: 2.5 petabytes of data, 1 million customer transactions / hour Facebook: 30 Petabytes of user data Google: processing 20 petabytes a day (2008) World: 5 billion people calling, tweeting, browsing on mobile phones Exponential increase doubles every two years followed by the capacity to store, compute, and communicate 2
  • 3. The digital universe Data deluge in all sectors of activity kilo 103 U.S. Library of Congress: 235 Terabytes of data Walmart: 2.5 petabytes of data, 1 million customer transactions / hour mega 106 Facebook: 30 Petabytes of user data giga 109 Google: processing 20 petabytes a day (2008) tera 1012 World: 5 billion people calling, tweeting, browsing on mobile phones peta 1015 exa 1018 Exponential increase zetta 1021 doubles every two years yotta 1024 followed by the capacity to store, compute, and communicate 2
  • 4. The digital universe 2.7 Zettabytes Data deluge in all sectors of activity kilo 103 U.S. Library of Congress: 235 Terabytes of data Walmart: 2.5 petabytes of data, 1 million customer transactions / hour mega 106 Facebook: 30 Petabytes of user data giga 109 Google: processing 20 petabytes a day (2008) tera 1012 World: 5 billion people calling, tweeting, browsing on mobile phones peta 1015 exa 1018 Exponential increase zetta 1021 doubles every two years yotta 1024 followed by the capacity to store, compute, and communicate 2
  • 5. The digital universe 2.7 Zettabytes Data deluge in all sectors of activity kilo 103 U.S. Library of Congress: 235 Terabytes of data Walmart: 2.5 petabytes of data, 1 million customer transactions / hour mega 106 Facebook: 30 Petabytes of user data giga 109 Google: processing 20 petabytes a day (2008) tera 1012 World: 5 billion people calling, tweeting, browsing on mobile phones peta 1015 exa 1018 Exponential increase zetta 1021 yotta 1024 doubles every two years 35 zettabytes in 2020 followed by the capacity to store, compute, and communicate 2
  • 6. The Big Data Industry Advertising Capture users data Generate users profiles Target ads 3
  • 7. The Big Data Industry beyond advertising •$300 billion/year US health care •€250 billion/year Europe public administration [McKinsey 2011] Tremendous economic impact Teraeuros (thousands billions) 4
  • 8. First challenge: Data Harvesting 70% of the data produced by individuals directly produced by users: email, photos, blogs, etc. (less than half) indirectly digital shadow/footprint: surveillance, web usage, transactions 5
  • 9. First challenge: Data Harvesting 70% of the data produced by individuals directly produced by users: email, photos, blogs, etc. (less than half) indirectly digital shadow/footprint: surveillance, web usage, transactions The free paradigm of the 2.0 Free services traded for private user data Free exploitation of the accumulated data 5
  • 10. Second challenge: knowledge extraction User profiles (business) => Ads target Automatic discovery (science) => Google Flu monitoring of flu related queries a search engine company knows everything => Biological, sociological data... NSA (security) => Ambition to handle yottabytes (1024) !!! 6
  • 11. Second challenge: knowledge extraction User profiles (business) => Ads target Automatic discovery (science) => Google Flu monitoring of flu related queries a search engine company knows everything => Biological, sociological data... NSA (security) => Ambition to handle yottabytes (1024) !!! 6
  • 12. Data: raw material of the 21st century (much like crude oil) 7
  • 13. Data: raw material of the 21st century (much like crude oil) extraction from natural consumption transport refining at users reservoirs 7
  • 14. Data: raw material of the 21st century (much like crude oil) extraction from natural consumption transport refining at users reservoirs accumulation production data in large Internet of data analytics repositories at users 7
  • 15. Where are these data? 8
  • 16. Where are these data? Huge concentration of data 85% of data handled by (large) corporations Virtualization/dematerialization of infrastructures Social networks, Cloud, ... Most of the prominent corporations based in the USA Google, Facebook, Amazon, Twitter, ... Storage capacity of Europe = 70% USA [McKinsey 2011] 8
  • 17. Where are these data? Huge concentration of data 85% of data handled by (large) corporations Virtualization/dematerialization of infrastructures Social networks, Cloud, ... Most of the prominent corporations based in the USA Google, Facebook, Amazon, Twitter, ... Storage capacity of Europe = 70% USA [McKinsey 2011] 1/3 of world data stored in the cloud by 2020 8
  • 18. Geopolitics of big data Alexa.com 9
  • 19. Geopolitics of big data Data from the Web 2.0 produced by users everywhere in the world but accumulated by corporations most often abroad Percentage of national web corporations among top 25 by country Alexa.com 9
  • 20. Geopolitics of big data Data from the Web 2.0 produced by users everywhere in the world but accumulated by corporations most often abroad Percentage of national web corporations among top 25 by country • USA: 100% • China: 92% (only Google makes it in the top 25) • France: 36% (but mostly marginal sites, not data intensive) leboncoin, Orange, Free, commentcamarche, lemonde, lequipe, lefigaro, pagesjaunes, sfr Alexa.com 9
  • 22. Geopolitics of big data The Top 50 websites worldwide • USA: 72 % 10
  • 23. Geopolitics of big data The Top 50 websites worldwide • USA: 72 % • China: 16 % (Baidu: 5; QQ: 8; Taobao: 13; Sina:17; 163: 28; Soso:29; Sina weibo:31; Sohu:43) • Russia: 6 % (Yandex: 21; kontakte:30; Mail: 33; ) • Israel: 2 % (Babylon: 22) • UK: 2 % (BBC: 46) • Netherland: 2 % (AVG: 47) 10
  • 24. Geopolitics of big data Diversity of search engines • USA: Google: 65 % ; Bing: 15% ;Yahoo: 15% • China: Baidu: 78% ; Google: 16% • Russia: Yandex: 60% ; Google: 25% • UK: Google: 91 % ; Bing: 5% • France: Google: 92 % ; Bing: 3% In France, • Google has a de facto monopoly • Google knows more about France than INSEE 11
  • 25. The global imbalance Information asymmetry “Since asymmetries of information give rise to market power, and perfect competition is required if markets are to be efficient, it is perhaps not surprising that markets with information asymmetries and other information imperfections are far from efficient.” JOSEPH E. STIGLITZ 12
  • 26. Impact of the global imbalance Regulation What legislations over a dematerialized global industry? Aren’t the rules defined by those who have the control? Business How to face monopolistic positions? How to handle the information asymmetry? Security Data at the core of nations independence 13
  • 27. The power of data Map Ecological Footprint 14 http://www.csa.com/discoveryguides/china/review.php
  • 28. The power of data Map Ecological Footprint 14 http://www.csa.com/discoveryguides/china/review.php
  • 29. What’s at stake in Europe? Suspicion (fear?) regarding data concern for privacy protection high in Europe active legislative work historical reasons? Weak industrial/innovation environment no strong corporation emerging But essential dependence on foreign systems 15
  • 30. Are there alternatives? dominant (centralized) model unclear privacy lost property active (centralized) business little share of business capacity decentralized ‘utopian’ model high privacy Faroo, Yacy real ownership little business Diaspora 16
  • 31. odel m ed) aliz ce ntr vac y ( ar pri ant uncle perty ness domin t pr o bus i y los z ed) acit rali cap c ent in ess ve ( bus acti reo f e sha littl de ce ntr aliz e d ‘u rea hig h pri top l ow vacy ian l i tt ne ’m le bu rship od sin ess el Faroo, Yacy Diaspora 17
  • 32. odel m ed) aliz ce ntr vac y ( ar pri ant uncle perty ness domin t pr o bus i y los z ed) acit rali cap c ent in ess ve ( bus acti reo f e sha littl an alternative path ? active (competitive) business symmetry of information de ce ntr ownership & privacy aliz anti monopoly e d ‘u rea hig h pri top l ow vacy ian l i tt ne ’m le bu rship od sin ess el Faroo, Yacy Diaspora 17
  • 33. An alternative path for Europe? The information society it is only emerging it will continue to evolve it will impact political systems new business models, new equilibrium will appear 18
  • 34. An alternative path for Europe? The information society it is only emerging it will continue to evolve it will impact political systems new business models, new equilibrium will appear Europe should embrace the future 18
  • 35. 19

Editor's Notes

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n
  22. \n
  23. \n
  24. \n
  25. \n
  26. \n
  27. \n
  28. \n
  29. \n
  30. \n
  31. \n
  32. \n
  33. \n
  34. \n
  35. \n
  36. \n
  37. \n
  38. \n
  39. \n
  40. \n