SlideShare uma empresa Scribd logo
1 de 28
Baixar para ler offline
Graph Visualization Tool for Twittersphere users
          based on a high-scalable
    Extract, Transform and Load System

       Pablo Aragón, Íñigo García and Antonio García

                       May, 27th 2011
INDEX
INTRODUCTION                Cierzo Development and SMMART
                            Structure of Twitter
                            Volume of Twitter
                            Detection of influencers


DISTRIBUTED COMPUTATION     Hadoop
                            Amazon EC2


PIPELINE DESIGN             Crawling Module
                            Metadata Extraction Module
                            Indexing Module
                            Graph Visualization Module



RESULTS                     Western Sahara Conflict
                            Patxi López
                            Conclusions
                            Future work
INTRODUCTION        CIERZO DEVELOPMENT AND SMMART
     DISTRIBUTED COMPUTATION        STRUCTURE OF TWITTER
              PIPELINE DESIGN       VOLUME OF TWITTER
                     RESULTS        DETECTION OF INFLUENCERS


INTRODUCTION: CIERZO DEVELOPMENT AND SMMART



                       SMMART (Social Media Marketing Analysis and
                       Reporting Tool) is the system developed by
                       Cierzo Development for:

                        Corporate social reputation

                        Measuring effectiveness of marketing campaigns

                        Detection of new trends
INTRODUCTION         CIERZO DEVELOPMENT AND SMMART
     DISTRIBUTED COMPUTATION         STRUCTURE OF TWITTER
              PIPELINE DESIGN        VOLUME OF TWITTER
                     RESULTS         DETECTION OF INFLUENCERS


INTRODUCTION: STRUCTURE OF TWITTER




                        Structure of a profile
INTRODUCTION          CIERZO DEVELOPMENT AND SMMART
       DISTRIBUTED COMPUTATION          STRUCTURE OF TWITTER
                PIPELINE DESIGN         VOLUME OF TWITTER
                       RESULTS          DETECTION OF INFLUENCERS


INTRODUCTION: STRUCTURE OF TWITTER



A user can set a relationship with another user by:


 Reply:        Update that begins with @username

 Mention:      Update that contains @username in the body of the tweet

 Retweet:      Update that contains the body of another user tweet by
                specifying the original author
INTRODUCTION         CIERZO DEVELOPMENT AND SMMART
     DISTRIBUTED COMPUTATION         STRUCTURE OF TWITTER
              PIPELINE DESIGN        VOLUME OF TWITTER
                     RESULTS         DETECTION OF INFLUENCERS


INTRODUCTION: VOLUME OF THE TWITTER




      More than 200M users publishing millions of tweets per day
INTRODUCTION            CIERZO DEVELOPMENT AND SMMART
     DISTRIBUTED COMPUTATION            STRUCTURE OF TWITTER
              PIPELINE DESIGN           VOLUME OF TWITTER
                     RESULTS            DETECTION OF INFLUENCERS


INTRODUCTION: DETECTION OF INFLUENCERS




    Old metrics based on data as:
         Absolute info:      Number of followers
         Relative info:      Quotient of following users and followers
INTRODUCTION              CIERZO DEVELOPMENT AND SMMART
         DISTRIBUTED COMPUTATION              STRUCTURE OF TWITTER
                  PIPELINE DESIGN             VOLUME OF TWITTER
                         RESULTS              DETECTION OF INFLUENCERS


INTRODUCTION: DETECTION OF INFLUENCERS




Available search engines track Twitter and list
results, but they do not set a value to the
users from the response.
#spanishrevolution
#yeswecamp
#15m
INTRODUCTION
     DISTRIBUTED COMPUTATION     HADOOP
              PIPELINE DESIGN    AMAZON EC2
                     RESULTS


DISTRIBUTED COMPUTATION




    Management of large volumes at the
     lowest cost

    Automatic adjustment to the daily
     growth of users and the oscillations in
     the frequency of publication
INTRODUCTION
     DISTRIBUTED COMPUTATION    HADOOP
              PIPELINE DESIGN   AMAZON EC2
                     RESULTS


DISTRIBUTED COMPUTATION: HADOOP




        Map Reduce                   Distributed File System
INTRODUCTION
     DISTRIBUTED COMPUTATION         HADOOP
              PIPELINE DESIGN        AMAZON EC2
                     RESULTS


DISTRIBUTED COMPUTATION: AMAZON EC2


                                Definition of a Hadoop node as a
                                machine image in Amazon Elastic
                                Compute Cloud.

                                The system balancing mechanism
                                adds and removes Hadoop nodes
                                in real time on demand.
INTRODUCTION    CRAWLING MODULE
      DISTRIBUTED COMPUTATION    METADATA EXTRACTION MODULE
               PIPELINE DESIGN   INDEXING MODULE
                      RESULTS    GRAPH VISUALIZATION MODULE


PIPELINE DESIGN
INTRODUCTION    CRAWLING MODULE
          DISTRIBUTED COMPUTATION    METADATA EXTRACTION MODULE
                   PIPELINE DESIGN   INDEXING MODULE
                          RESULTS    GRAPH VISUALIZATION MODULE


PIPELINE DESIGN: CRAWLING MODULE



Based on Nutch


1.   Crawl the Twitter profiles
     stored in a DB

2.   Extract outlinks to new
     profiles
INTRODUCTION    CRAWLING MODULE
     DISTRIBUTED COMPUTATION    METADATA EXTRACTION MODULE
              PIPELINE DESIGN   INDEXING MODULE
                     RESULTS    GRAPH VISUALIZATION MODULE


PIPELINE DESIGN: METADATA EXTRACTION MODULE



                                   The portion of HTML of a tweet
                                   contains a set of metadata:

                                    Textual content
                                    Publication date
                                    Author
                                    Mention to other users
INTRODUCTION          CRAWLING MODULE
        DISTRIBUTED COMPUTATION          METADATA EXTRACTION MODULE
                 PIPELINE DESIGN         INDEXING MODULE
                        RESULTS          GRAPH VISUALIZATION MODULE


PIPELINE DESIGN: INDEXING MODULE


 Apache Solr (enterprise search server based on Lucene)

  Sorting algorithms

  Stemming

  Stopwords filters

  Faceted searchs

 Multicore architecture sharding by publication date.
INTRODUCTION            CRAWLING MODULE
        DISTRIBUTED COMPUTATION            METADATA EXTRACTION MODULE
                 PIPELINE DESIGN           INDEXING MODULE
                        RESULTS            GRAPH VISUALIZATION MODULE


PIPELINE DESIGN: GRAPH VISUALIZATION MODULE

The Graph Visualization module transforms the
responses from the index into a graph by the
layout Yifan Hu’s algorithm provided in Gephi
Toolkit:

    Force-based

    Multilevel


Y. F. Hu, “Efficient and high quality force-directed
graph drawing”. The Mathematica Journal,
10 (37-71), 2005
INTRODUCTION                 WESTERN SAHARA CONFLICT
         DISTRIBUTED COMPUTATION                 PATXI LÓPEZ
                  PIPELINE DESIGN                CONCLUSIONS
                         RESULTS                 FUTURE WORK


RESULTS: WESTERN SAHARA CONFLICT




 In November 2010, Moroccan security forces involved
 in a camp in Western Sahara. This action was criticized
 by part of the Spanish society.
INTRODUCTION          WESTERN SAHARA CONFLICT
     DISTRIBUTED COMPUTATION          PATXI LÓPEZ
              PIPELINE DESIGN         CONCLUSIONS
                     RESULTS          FUTURE WORK


RESULTS: WESTERN SAHARA CONFLICT

                                Search
                                 content:‘sahara’

                                 language:’es’

                                 date:[2010-11-10 TO 2010-11-18]



                                Results
                                 1721 users

                                 3925 tweets

                                 707 mentions
INTRODUCTION    WESTERN SAHARA CONFLICT
     DISTRIBUTED COMPUTATION    PATXI LÓPEZ
              PIPELINE DESIGN   CONCLUSIONS
                     RESULTS    FUTURE WORK


RESULTS: WESTERN SAHARA CONFLICT
INTRODUCTION                     WESTERN SAHARA CONFLICT
          DISTRIBUTED COMPUTATION                     PATXI LÓPEZ
                   PIPELINE DESIGN                    CONCLUSIONS
                          RESULTS                     FUTURE WORK


RESULTS: PATXI LÓPEZ




 Patxi López holds the position of the President of
 the Basque Country Government. His campaign
 included strategies in social networks.
INTRODUCTION         WESTERN SAHARA CONFLICT
      DISTRIBUTED COMPUTATION         PATXI LÓPEZ
               PIPELINE DESIGN        CONCLUSIONS
                      RESULTS         FUTURE WORK


RESULTS: PATXI LÓPEZ

                                 Search
                                  mention:‘patxi_lopez’

                                  language:’es’

                                  date:[2010-11-10 TO 2010-11-18]



                                 Results
                                  186 users

                                  196 tweets

                                  366 mentions
INTRODUCTION    WESTERN SAHARA CONFLICT
      DISTRIBUTED COMPUTATION    PATXI LÓPEZ
               PIPELINE DESIGN   CONCLUSIONS
                      RESULTS    FUTURE WORK


RESULTS: PATXI LÓPEZ
INTRODUCTION     WESTERN SAHARA CONFLICT
       DISTRIBUTED COMPUTATION     PATXI LÓPEZ
                PIPELINE DESIGN    CONCLUSIONS
                       RESULTS     FUTURE WORK


RESULTS: CONCLUSIONS

  The implemented tool identifies main influencers in a specific
   topic or around a concrete user

  The high-scalable design adapts to a large social network as
   Twitter

  Enterprises can deploy social media monitoring systems using
   exclusively open source technologies

  The tool provides information for performing crisis management
INTRODUCTION    WESTERN SAHARA CONFLICT
       DISTRIBUTED COMPUTATION    PATXI LÓPEZ
                PIPELINE DESIGN   CONCLUSIONS
                       RESULTS    FUTURE WORK


RESULTS: FUTURE WORK

  New versions for more social media sources


  Real-time results


  New data mining applications

            Predictive models
Thanks for your attention

Mais conteúdo relacionado

Destaque

Biographical social networks on Wikipedia
Biographical social networks on WikipediaBiographical social networks on Wikipedia
Biographical social networks on WikipediaPablo Aragón
 
UAB: Análisis de redes sociales
UAB: Análisis de redes socialesUAB: Análisis de redes sociales
UAB: Análisis de redes socialesPablo Aragón
 
To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...
To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...
To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...Pablo Aragón
 
From Citizen Data to the Wisdom of the Crowds: The Case Study of Decide Madrid
From Citizen Data to the Wisdom of the Crowds: The Case Study of Decide MadridFrom Citizen Data to the Wisdom of the Crowds: The Case Study of Decide Madrid
From Citizen Data to the Wisdom of the Crowds: The Case Study of Decide MadridPablo Aragón
 
Gestión de instancias en amazon ec2 desde consola
Gestión de instancias en amazon ec2 desde consolaGestión de instancias en amazon ec2 desde consola
Gestión de instancias en amazon ec2 desde consolaPablo Aragón
 
When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...
When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...
When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...Pablo Aragón
 
Who are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in MicroblogsWho are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in MicroblogsRuth Garcia Gavilanes
 
Datactic, Data with Tactics
Datactic, Data with TacticsDatactic, Data with Tactics
Datactic, Data with TacticsPablo Aragón
 
The missing link between Network Science and the Social Media Monitoring indu...
The missing link between Network Science and the Social Media Monitoring indu...The missing link between Network Science and the Social Media Monitoring indu...
The missing link between Network Science and the Social Media Monitoring indu...Pablo Aragón
 
Spanish indignados and the evolution of #15M: Towards networked para-institut...
Spanish indignados and the evolution of #15M: Towards networked para-institut...Spanish indignados and the evolution of #15M: Towards networked para-institut...
Spanish indignados and the evolution of #15M: Towards networked para-institut...Pablo Aragón
 
Análisis de datos para la democracia
Análisis de datos para la democraciaAnálisis de datos para la democracia
Análisis de datos para la democraciaPablo Aragón
 
Assessing inter-cultural patterns through ranking biographiesBiographies
Assessing inter-cultural patterns through ranking biographiesBiographiesAssessing inter-cultural patterns through ranking biographiesBiographies
Assessing inter-cultural patterns through ranking biographiesBiographiesPablo Aragón
 

Destaque (12)

Biographical social networks on Wikipedia
Biographical social networks on WikipediaBiographical social networks on Wikipedia
Biographical social networks on Wikipedia
 
UAB: Análisis de redes sociales
UAB: Análisis de redes socialesUAB: Análisis de redes sociales
UAB: Análisis de redes sociales
 
To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...
To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...
To Thread or Not to Thread: The Impact of Conversation Threading on Online Di...
 
From Citizen Data to the Wisdom of the Crowds: The Case Study of Decide Madrid
From Citizen Data to the Wisdom of the Crowds: The Case Study of Decide MadridFrom Citizen Data to the Wisdom of the Crowds: The Case Study of Decide Madrid
From Citizen Data to the Wisdom of the Crowds: The Case Study of Decide Madrid
 
Gestión de instancias en amazon ec2 desde consola
Gestión de instancias en amazon ec2 desde consolaGestión de instancias en amazon ec2 desde consola
Gestión de instancias en amazon ec2 desde consola
 
When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...
When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...
When a Movement Becomes a Party: Computational Assessment of New Forms of Pol...
 
Who are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in MicroblogsWho are my Audiences? Evolution of Target Audiences in Microblogs
Who are my Audiences? Evolution of Target Audiences in Microblogs
 
Datactic, Data with Tactics
Datactic, Data with TacticsDatactic, Data with Tactics
Datactic, Data with Tactics
 
The missing link between Network Science and the Social Media Monitoring indu...
The missing link between Network Science and the Social Media Monitoring indu...The missing link between Network Science and the Social Media Monitoring indu...
The missing link between Network Science and the Social Media Monitoring indu...
 
Spanish indignados and the evolution of #15M: Towards networked para-institut...
Spanish indignados and the evolution of #15M: Towards networked para-institut...Spanish indignados and the evolution of #15M: Towards networked para-institut...
Spanish indignados and the evolution of #15M: Towards networked para-institut...
 
Análisis de datos para la democracia
Análisis de datos para la democraciaAnálisis de datos para la democracia
Análisis de datos para la democracia
 
Assessing inter-cultural patterns through ranking biographiesBiographies
Assessing inter-cultural patterns through ranking biographiesBiographiesAssessing inter-cultural patterns through ranking biographiesBiographies
Assessing inter-cultural patterns through ranking biographiesBiographies
 

Semelhante a Graph Visualization Tool for Twittersphere users based on a high-scalable Extract, Transform and Load System

MuCon 2019: Exploring Your Microservices Architecture Through Network Science...
MuCon 2019: Exploring Your Microservices Architecture Through Network Science...MuCon 2019: Exploring Your Microservices Architecture Through Network Science...
MuCon 2019: Exploring Your Microservices Architecture Through Network Science...OpenCredo
 
PhD Projects in Network Simulator Research Help
PhD Projects in Network Simulator Research HelpPhD Projects in Network Simulator Research Help
PhD Projects in Network Simulator Research HelpPhD Services
 
How to segment developers for your API strategy
How to segment developers for your API strategyHow to segment developers for your API strategy
How to segment developers for your API strategyApigee | Google Cloud
 
Linked services for the Web of Data
Linked services for the Web of DataLinked services for the Web of Data
Linked services for the Web of DataJohn Domingue
 
Synctree Capabilties Deck
Synctree Capabilties DeckSynctree Capabilties Deck
Synctree Capabilties DeckPhoebe B. Scott
 
Exploring Generating AI with Diffusion Models
Exploring Generating AI with Diffusion ModelsExploring Generating AI with Diffusion Models
Exploring Generating AI with Diffusion ModelsKonfHubTechConferenc
 
Social in B2B.ppt
Social in B2B.pptSocial in B2B.ppt
Social in B2B.pptDSG
 
Great Technical Architects Must Be Great Organisation Architects
Great Technical Architects Must Be Great Organisation ArchitectsGreat Technical Architects Must Be Great Organisation Architects
Great Technical Architects Must Be Great Organisation ArchitectsNick Tune
 
Automated Apache Kafka Mocking and Testing with AsyncAPI | Hugo Guerrero, Red...
Automated Apache Kafka Mocking and Testing with AsyncAPI | Hugo Guerrero, Red...Automated Apache Kafka Mocking and Testing with AsyncAPI | Hugo Guerrero, Red...
Automated Apache Kafka Mocking and Testing with AsyncAPI | Hugo Guerrero, Red...HostedbyConfluent
 
Dataiku meetup 12 july 2018 Amsterdam
Dataiku meetup 12 july 2018 AmsterdamDataiku meetup 12 july 2018 Amsterdam
Dataiku meetup 12 july 2018 AmsterdamLonghow Lam
 
SOA, OTD, and Web 2.0 = Collaboration
SOA, OTD, and Web 2.0 = CollaborationSOA, OTD, and Web 2.0 = Collaboration
SOA, OTD, and Web 2.0 = Collaborationjstogdill
 
Ichec community management - reporting & e-reputation
Ichec   community management - reporting & e-reputationIchec   community management - reporting & e-reputation
Ichec community management - reporting & e-reputationThomas Van Roy
 
How to leverage Kafka data streams with Neo4j
How to leverage Kafka data streams with Neo4jHow to leverage Kafka data streams with Neo4j
How to leverage Kafka data streams with Neo4jGraphRM
 
3D體驗平台-模組(角色)說明
3D體驗平台-模組(角色)說明3D體驗平台-模組(角色)說明
3D體驗平台-模組(角色)說明Jimmy Chang
 
Mariana Alupului Inventions
Mariana Alupului InventionsMariana Alupului Inventions
Mariana Alupului Inventionsmalupului
 
Using Web 2.0 For Outside I Nnovation Seybold Stm Dec 07
Using Web 2.0 For Outside I Nnovation Seybold Stm Dec 07Using Web 2.0 For Outside I Nnovation Seybold Stm Dec 07
Using Web 2.0 For Outside I Nnovation Seybold Stm Dec 07pseybold
 
Any DVD Converter for iPod
 Any DVD Converter for iPod Any DVD Converter for iPod
Any DVD Converter for iPodcrysatal16
 

Semelhante a Graph Visualization Tool for Twittersphere users based on a high-scalable Extract, Transform and Load System (20)

MuCon 2019: Exploring Your Microservices Architecture Through Network Science...
MuCon 2019: Exploring Your Microservices Architecture Through Network Science...MuCon 2019: Exploring Your Microservices Architecture Through Network Science...
MuCon 2019: Exploring Your Microservices Architecture Through Network Science...
 
PhD Projects in Network Simulator Research Help
PhD Projects in Network Simulator Research HelpPhD Projects in Network Simulator Research Help
PhD Projects in Network Simulator Research Help
 
How to segment developers for your API strategy
How to segment developers for your API strategyHow to segment developers for your API strategy
How to segment developers for your API strategy
 
Linked services for the Web of Data
Linked services for the Web of DataLinked services for the Web of Data
Linked services for the Web of Data
 
Synctree Capabilties Deck
Synctree Capabilties DeckSynctree Capabilties Deck
Synctree Capabilties Deck
 
nwesp 2011
nwesp 2011nwesp 2011
nwesp 2011
 
Exploring Generating AI with Diffusion Models
Exploring Generating AI with Diffusion ModelsExploring Generating AI with Diffusion Models
Exploring Generating AI with Diffusion Models
 
Tfm slides
Tfm slidesTfm slides
Tfm slides
 
Social in B2B.ppt
Social in B2B.pptSocial in B2B.ppt
Social in B2B.ppt
 
Great Technical Architects Must Be Great Organisation Architects
Great Technical Architects Must Be Great Organisation ArchitectsGreat Technical Architects Must Be Great Organisation Architects
Great Technical Architects Must Be Great Organisation Architects
 
Automated Apache Kafka Mocking and Testing with AsyncAPI | Hugo Guerrero, Red...
Automated Apache Kafka Mocking and Testing with AsyncAPI | Hugo Guerrero, Red...Automated Apache Kafka Mocking and Testing with AsyncAPI | Hugo Guerrero, Red...
Automated Apache Kafka Mocking and Testing with AsyncAPI | Hugo Guerrero, Red...
 
Dataiku meetup 12 july 2018 Amsterdam
Dataiku meetup 12 july 2018 AmsterdamDataiku meetup 12 july 2018 Amsterdam
Dataiku meetup 12 july 2018 Amsterdam
 
SOA, OTD, and Web 2.0 = Collaboration
SOA, OTD, and Web 2.0 = CollaborationSOA, OTD, and Web 2.0 = Collaboration
SOA, OTD, and Web 2.0 = Collaboration
 
DevOps for DataScience
DevOps for DataScienceDevOps for DataScience
DevOps for DataScience
 
Ichec community management - reporting & e-reputation
Ichec   community management - reporting & e-reputationIchec   community management - reporting & e-reputation
Ichec community management - reporting & e-reputation
 
How to leverage Kafka data streams with Neo4j
How to leverage Kafka data streams with Neo4jHow to leverage Kafka data streams with Neo4j
How to leverage Kafka data streams with Neo4j
 
3D體驗平台-模組(角色)說明
3D體驗平台-模組(角色)說明3D體驗平台-模組(角色)說明
3D體驗平台-模組(角色)說明
 
Mariana Alupului Inventions
Mariana Alupului InventionsMariana Alupului Inventions
Mariana Alupului Inventions
 
Using Web 2.0 For Outside I Nnovation Seybold Stm Dec 07
Using Web 2.0 For Outside I Nnovation Seybold Stm Dec 07Using Web 2.0 For Outside I Nnovation Seybold Stm Dec 07
Using Web 2.0 For Outside I Nnovation Seybold Stm Dec 07
 
Any DVD Converter for iPod
 Any DVD Converter for iPod Any DVD Converter for iPod
Any DVD Converter for iPod
 

Mais de Pablo Aragón

A preliminary approach to knowledge integrity risk assessment in Wikipedia p...
A preliminary approach to knowledge integrity  risk assessment in Wikipedia p...A preliminary approach to knowledge integrity  risk assessment in Wikipedia p...
A preliminary approach to knowledge integrity risk assessment in Wikipedia p...Pablo Aragón
 
Civic Technologies: Research, Practice, and Open Challenges
Civic Technologies: Research, Practice, and Open ChallengesCivic Technologies: Research, Practice, and Open Challenges
Civic Technologies: Research, Practice, and Open ChallengesPablo Aragón
 
Characterizing Online Participation in Civic Technologies - PhD
Characterizing Online Participation in Civic Technologies - PhDCharacterizing Online Participation in Civic Technologies - PhD
Characterizing Online Participation in Civic Technologies - PhDPablo Aragón
 
The DECODE Ecosystem: Tools for citizens’ data sovereignty in Barcelona
The DECODE Ecosystem: Tools for citizens’ data sovereignty in BarcelonaThe DECODE Ecosystem: Tools for citizens’ data sovereignty in Barcelona
The DECODE Ecosystem: Tools for citizens’ data sovereignty in BarcelonaPablo Aragón
 
Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...
Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...
Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...Pablo Aragón
 
DECODE project: Barcelona pilots
DECODE project: Barcelona pilotsDECODE project: Barcelona pilots
DECODE project: Barcelona pilotsPablo Aragón
 
Generative models of online discussion threads (ASONAM 2018 tutorial)
Generative models of online discussion threads (ASONAM 2018 tutorial)Generative models of online discussion threads (ASONAM 2018 tutorial)
Generative models of online discussion threads (ASONAM 2018 tutorial)Pablo Aragón
 
Online Petitioning Through Data Exploration and What We Found There: A Datase...
Online Petitioning Through Data Exploration and What We Found There: A Datase...Online Petitioning Through Data Exploration and What We Found There: A Datase...
Online Petitioning Through Data Exploration and What We Found There: A Datase...Pablo Aragón
 
Decidim: Visualización de datos para la innovación democrática
Decidim: Visualización de datos para la innovación democráticaDecidim: Visualización de datos para la innovación democrática
Decidim: Visualización de datos para la innovación democráticaPablo Aragón
 
Data Science in the era of Fake News
Data Science in the era of Fake NewsData Science in the era of Fake News
Data Science in the era of Fake NewsPablo Aragón
 
Datos para la participación
Datos para la participaciónDatos para la participación
Datos para la participaciónPablo Aragón
 
Decidim en redes sociales
Decidim en redes socialesDecidim en redes sociales
Decidim en redes socialesPablo Aragón
 
The dynamics of a social convention
The dynamics of a social conventionThe dynamics of a social convention
The dynamics of a social conventionPablo Aragón
 
Discussions and decisions on Decidim Barcelona
Discussions and decisions on Decidim BarcelonaDiscussions and decisions on Decidim Barcelona
Discussions and decisions on Decidim BarcelonaPablo Aragón
 
Computational Framework for the Assessment of New Forms of Political Organiza...
Computational Framework for the Assessment of New Forms of Political Organiza...Computational Framework for the Assessment of New Forms of Political Organiza...
Computational Framework for the Assessment of New Forms of Political Organiza...Pablo Aragón
 
4th Databeers BCN - When a movement becomes a party
4th Databeers BCN - When a movement becomes a party4th Databeers BCN - When a movement becomes a party
4th Databeers BCN - When a movement becomes a partyPablo Aragón
 

Mais de Pablo Aragón (19)

A preliminary approach to knowledge integrity risk assessment in Wikipedia p...
A preliminary approach to knowledge integrity  risk assessment in Wikipedia p...A preliminary approach to knowledge integrity  risk assessment in Wikipedia p...
A preliminary approach to knowledge integrity risk assessment in Wikipedia p...
 
Civic Technologies: Research, Practice, and Open Challenges
Civic Technologies: Research, Practice, and Open ChallengesCivic Technologies: Research, Practice, and Open Challenges
Civic Technologies: Research, Practice, and Open Challenges
 
Characterizing Online Participation in Civic Technologies - PhD
Characterizing Online Participation in Civic Technologies - PhDCharacterizing Online Participation in Civic Technologies - PhD
Characterizing Online Participation in Civic Technologies - PhD
 
The DECODE Ecosystem: Tools for citizens’ data sovereignty in Barcelona
The DECODE Ecosystem: Tools for citizens’ data sovereignty in BarcelonaThe DECODE Ecosystem: Tools for citizens’ data sovereignty in Barcelona
The DECODE Ecosystem: Tools for citizens’ data sovereignty in Barcelona
 
Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...
Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...
Sistema interactivo para el descubrimiento de temas y propuestas de Decide Ma...
 
DECODE project: Barcelona pilots
DECODE project: Barcelona pilotsDECODE project: Barcelona pilots
DECODE project: Barcelona pilots
 
Labmeeting18
Labmeeting18Labmeeting18
Labmeeting18
 
Generative models of online discussion threads (ASONAM 2018 tutorial)
Generative models of online discussion threads (ASONAM 2018 tutorial)Generative models of online discussion threads (ASONAM 2018 tutorial)
Generative models of online discussion threads (ASONAM 2018 tutorial)
 
Online Petitioning Through Data Exploration and What We Found There: A Datase...
Online Petitioning Through Data Exploration and What We Found There: A Datase...Online Petitioning Through Data Exploration and What We Found There: A Datase...
Online Petitioning Through Data Exploration and What We Found There: A Datase...
 
Decidim: Visualización de datos para la innovación democrática
Decidim: Visualización de datos para la innovación democráticaDecidim: Visualización de datos para la innovación democrática
Decidim: Visualización de datos para la innovación democrática
 
Data Science in the era of Fake News
Data Science in the era of Fake NewsData Science in the era of Fake News
Data Science in the era of Fake News
 
Uk seminar
Uk seminarUk seminar
Uk seminar
 
Datos para la participación
Datos para la participaciónDatos para la participación
Datos para la participación
 
Lab metadecidim
Lab metadecidimLab metadecidim
Lab metadecidim
 
Decidim en redes sociales
Decidim en redes socialesDecidim en redes sociales
Decidim en redes sociales
 
The dynamics of a social convention
The dynamics of a social conventionThe dynamics of a social convention
The dynamics of a social convention
 
Discussions and decisions on Decidim Barcelona
Discussions and decisions on Decidim BarcelonaDiscussions and decisions on Decidim Barcelona
Discussions and decisions on Decidim Barcelona
 
Computational Framework for the Assessment of New Forms of Political Organiza...
Computational Framework for the Assessment of New Forms of Political Organiza...Computational Framework for the Assessment of New Forms of Political Organiza...
Computational Framework for the Assessment of New Forms of Political Organiza...
 
4th Databeers BCN - When a movement becomes a party
4th Databeers BCN - When a movement becomes a party4th Databeers BCN - When a movement becomes a party
4th Databeers BCN - When a movement becomes a party
 

Último

Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 

Último (20)

Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 

Graph Visualization Tool for Twittersphere users based on a high-scalable Extract, Transform and Load System

  • 1. Graph Visualization Tool for Twittersphere users based on a high-scalable Extract, Transform and Load System Pablo Aragón, Íñigo García and Antonio García May, 27th 2011
  • 2. INDEX INTRODUCTION Cierzo Development and SMMART Structure of Twitter Volume of Twitter Detection of influencers DISTRIBUTED COMPUTATION Hadoop Amazon EC2 PIPELINE DESIGN Crawling Module Metadata Extraction Module Indexing Module Graph Visualization Module RESULTS Western Sahara Conflict Patxi López Conclusions Future work
  • 3. INTRODUCTION CIERZO DEVELOPMENT AND SMMART DISTRIBUTED COMPUTATION STRUCTURE OF TWITTER PIPELINE DESIGN VOLUME OF TWITTER RESULTS DETECTION OF INFLUENCERS INTRODUCTION: CIERZO DEVELOPMENT AND SMMART SMMART (Social Media Marketing Analysis and Reporting Tool) is the system developed by Cierzo Development for:  Corporate social reputation  Measuring effectiveness of marketing campaigns  Detection of new trends
  • 4. INTRODUCTION CIERZO DEVELOPMENT AND SMMART DISTRIBUTED COMPUTATION STRUCTURE OF TWITTER PIPELINE DESIGN VOLUME OF TWITTER RESULTS DETECTION OF INFLUENCERS INTRODUCTION: STRUCTURE OF TWITTER Structure of a profile
  • 5. INTRODUCTION CIERZO DEVELOPMENT AND SMMART DISTRIBUTED COMPUTATION STRUCTURE OF TWITTER PIPELINE DESIGN VOLUME OF TWITTER RESULTS DETECTION OF INFLUENCERS INTRODUCTION: STRUCTURE OF TWITTER A user can set a relationship with another user by:  Reply: Update that begins with @username  Mention: Update that contains @username in the body of the tweet  Retweet: Update that contains the body of another user tweet by specifying the original author
  • 6. INTRODUCTION CIERZO DEVELOPMENT AND SMMART DISTRIBUTED COMPUTATION STRUCTURE OF TWITTER PIPELINE DESIGN VOLUME OF TWITTER RESULTS DETECTION OF INFLUENCERS INTRODUCTION: VOLUME OF THE TWITTER More than 200M users publishing millions of tweets per day
  • 7. INTRODUCTION CIERZO DEVELOPMENT AND SMMART DISTRIBUTED COMPUTATION STRUCTURE OF TWITTER PIPELINE DESIGN VOLUME OF TWITTER RESULTS DETECTION OF INFLUENCERS INTRODUCTION: DETECTION OF INFLUENCERS Old metrics based on data as:  Absolute info: Number of followers  Relative info: Quotient of following users and followers
  • 8. INTRODUCTION CIERZO DEVELOPMENT AND SMMART DISTRIBUTED COMPUTATION STRUCTURE OF TWITTER PIPELINE DESIGN VOLUME OF TWITTER RESULTS DETECTION OF INFLUENCERS INTRODUCTION: DETECTION OF INFLUENCERS Available search engines track Twitter and list results, but they do not set a value to the users from the response.
  • 10. INTRODUCTION DISTRIBUTED COMPUTATION HADOOP PIPELINE DESIGN AMAZON EC2 RESULTS DISTRIBUTED COMPUTATION  Management of large volumes at the lowest cost  Automatic adjustment to the daily growth of users and the oscillations in the frequency of publication
  • 11. INTRODUCTION DISTRIBUTED COMPUTATION HADOOP PIPELINE DESIGN AMAZON EC2 RESULTS DISTRIBUTED COMPUTATION: HADOOP Map Reduce Distributed File System
  • 12. INTRODUCTION DISTRIBUTED COMPUTATION HADOOP PIPELINE DESIGN AMAZON EC2 RESULTS DISTRIBUTED COMPUTATION: AMAZON EC2 Definition of a Hadoop node as a machine image in Amazon Elastic Compute Cloud. The system balancing mechanism adds and removes Hadoop nodes in real time on demand.
  • 13. INTRODUCTION CRAWLING MODULE DISTRIBUTED COMPUTATION METADATA EXTRACTION MODULE PIPELINE DESIGN INDEXING MODULE RESULTS GRAPH VISUALIZATION MODULE PIPELINE DESIGN
  • 14. INTRODUCTION CRAWLING MODULE DISTRIBUTED COMPUTATION METADATA EXTRACTION MODULE PIPELINE DESIGN INDEXING MODULE RESULTS GRAPH VISUALIZATION MODULE PIPELINE DESIGN: CRAWLING MODULE Based on Nutch 1. Crawl the Twitter profiles stored in a DB 2. Extract outlinks to new profiles
  • 15. INTRODUCTION CRAWLING MODULE DISTRIBUTED COMPUTATION METADATA EXTRACTION MODULE PIPELINE DESIGN INDEXING MODULE RESULTS GRAPH VISUALIZATION MODULE PIPELINE DESIGN: METADATA EXTRACTION MODULE The portion of HTML of a tweet contains a set of metadata:  Textual content  Publication date  Author  Mention to other users
  • 16. INTRODUCTION CRAWLING MODULE DISTRIBUTED COMPUTATION METADATA EXTRACTION MODULE PIPELINE DESIGN INDEXING MODULE RESULTS GRAPH VISUALIZATION MODULE PIPELINE DESIGN: INDEXING MODULE Apache Solr (enterprise search server based on Lucene)  Sorting algorithms  Stemming  Stopwords filters  Faceted searchs Multicore architecture sharding by publication date.
  • 17. INTRODUCTION CRAWLING MODULE DISTRIBUTED COMPUTATION METADATA EXTRACTION MODULE PIPELINE DESIGN INDEXING MODULE RESULTS GRAPH VISUALIZATION MODULE PIPELINE DESIGN: GRAPH VISUALIZATION MODULE The Graph Visualization module transforms the responses from the index into a graph by the layout Yifan Hu’s algorithm provided in Gephi Toolkit:  Force-based  Multilevel Y. F. Hu, “Efficient and high quality force-directed graph drawing”. The Mathematica Journal, 10 (37-71), 2005
  • 18.
  • 19. INTRODUCTION WESTERN SAHARA CONFLICT DISTRIBUTED COMPUTATION PATXI LÓPEZ PIPELINE DESIGN CONCLUSIONS RESULTS FUTURE WORK RESULTS: WESTERN SAHARA CONFLICT In November 2010, Moroccan security forces involved in a camp in Western Sahara. This action was criticized by part of the Spanish society.
  • 20. INTRODUCTION WESTERN SAHARA CONFLICT DISTRIBUTED COMPUTATION PATXI LÓPEZ PIPELINE DESIGN CONCLUSIONS RESULTS FUTURE WORK RESULTS: WESTERN SAHARA CONFLICT Search  content:‘sahara’  language:’es’  date:[2010-11-10 TO 2010-11-18] Results  1721 users  3925 tweets  707 mentions
  • 21. INTRODUCTION WESTERN SAHARA CONFLICT DISTRIBUTED COMPUTATION PATXI LÓPEZ PIPELINE DESIGN CONCLUSIONS RESULTS FUTURE WORK RESULTS: WESTERN SAHARA CONFLICT
  • 22. INTRODUCTION WESTERN SAHARA CONFLICT DISTRIBUTED COMPUTATION PATXI LÓPEZ PIPELINE DESIGN CONCLUSIONS RESULTS FUTURE WORK RESULTS: PATXI LÓPEZ Patxi López holds the position of the President of the Basque Country Government. His campaign included strategies in social networks.
  • 23. INTRODUCTION WESTERN SAHARA CONFLICT DISTRIBUTED COMPUTATION PATXI LÓPEZ PIPELINE DESIGN CONCLUSIONS RESULTS FUTURE WORK RESULTS: PATXI LÓPEZ Search  mention:‘patxi_lopez’  language:’es’  date:[2010-11-10 TO 2010-11-18] Results  186 users  196 tweets  366 mentions
  • 24. INTRODUCTION WESTERN SAHARA CONFLICT DISTRIBUTED COMPUTATION PATXI LÓPEZ PIPELINE DESIGN CONCLUSIONS RESULTS FUTURE WORK RESULTS: PATXI LÓPEZ
  • 25. INTRODUCTION WESTERN SAHARA CONFLICT DISTRIBUTED COMPUTATION PATXI LÓPEZ PIPELINE DESIGN CONCLUSIONS RESULTS FUTURE WORK RESULTS: CONCLUSIONS  The implemented tool identifies main influencers in a specific topic or around a concrete user  The high-scalable design adapts to a large social network as Twitter  Enterprises can deploy social media monitoring systems using exclusively open source technologies  The tool provides information for performing crisis management
  • 26.
  • 27. INTRODUCTION WESTERN SAHARA CONFLICT DISTRIBUTED COMPUTATION PATXI LÓPEZ PIPELINE DESIGN CONCLUSIONS RESULTS FUTURE WORK RESULTS: FUTURE WORK  New versions for more social media sources  Real-time results  New data mining applications  Predictive models
  • 28. Thanks for your attention