SlideShare uma empresa Scribd logo
1 de 62
Baixar para ler offline
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
A GEO-stack for Big Data
Driving Spatial Analysis Beyond the Limits of Traditional Storage
Joana Sim˜oes 1
1Bdigital, CASA, CICS.NOVA
March 12, 2015
1 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Big Data Word Cloud; source:http://olap.com/big-data/
2 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Table of Contents
1 The Value of Data
2 The Big Data Revolution
3 An Use Case
4 Final Remarks
3 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Warning
* This presentation may contain tech talk, such as: databases,
clusters, NoSQL, cloud, parallelism, scalability.
If you are susceptible to these concepts, you may want to leave the
room now.
From this point beyond, you are at your own risk! *
4 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
The Value of Data
Not a New Idea!
Matthew Fontaine Maury
(1806-1873).
5 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
The Value of Data
Not a New Idea!
Matthew Fontaine Maury
(1806-1873).
Foresaw the hidden value on
captain’s ships logs, when
analysed collectively.
Used time series data to
carry out analysis that would
enable him to recomend
optimal shipping routes.
6 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
The Value of Data
Data Mining, Open-source and Crowd-sourcing
7 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
The Value of Data
Data Mining, Open-source and Crowd-sourcing
In 1848, Captain Jackson was the the first person to try the route
recommended by Maury, and as a result he was able to save 17 days
on his outbond trip.
8 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
The Value of Data
Data Mining, Open-source and Crowd-sourcing
In 1848, Captain Jackson was the the first person to try the route
recommended by Maury, and as a result he was able to save 17 days
on his outbond trip.
Apart from collecting existing logs, Maury encouraged the collection
of more regular and systematic time series, by creating a template.
9 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
The Value of Data
Data Mining, Open-source and Crowd-sourcing
In 1848, Captain Jackson was the the first person to try the route
recommended by Maury, and as a result he was able to save 17 days
on his outbond trip.
Apart from collecting existing logs, Maury encouraged the collection
of more regular and systematic time series, by creating a template.
Collected Data: longitude, latitude, currents, magnetic variation, air
and water temperature, general wind direction, etc.
10 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Data Analysis
A multidisciplinary field
11 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Data Analysis
Traditional Stack
Spreadsheets (e.g.: Excel, OpenOffice)
RDBMS (e.g.: Oracle, PostgreSQL, MySQL)
Statistical Packages (e.g.: R, Matlab)
GIS Packages (e.g.: QGIS)
Scripting and programming languages (e.g.: Python, Java)
Libraries
12 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Big Data
What changed in recent years?
Differences in the way global business and transportation are
done have exploded the volume of traditional data sources.
Widespread increase in data from sensors.
13 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Big Data
Smart Citizen Project
https://smartcitizen.me/
14 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Big Data
Smart Citizen Project
Platform to generate
participatory processes of
people in the cities, by
connecting data, people and
knowledge.
https://smartcitizen.me/
15 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Big Data
Smart Citizen Project
Platform to generate
participatory processes of
people in the cities, by
connecting data, people and
knowledge.
Based on geolocation, Internet
and free hardware and software
for data collection and sharing.
https://smartcitizen.me/
16 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Big Data
Smart Citizen Project
Platform to generate
participatory processes of
people in the cities, by
connecting data, people and
knowledge.
Based on geolocation, Internet
and free hardware and software
for data collection and sharing.
Relies on the production of
objects to connect people with
their environment and their
city.
https://smartcitizen.me/
17 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Big Data
Based on reports, Fillipo Arrieta published maps
describing a multi-stage containment plan designed to
limit the plague in Bari (1864).
The iLab used the ushahidi platform to collect and display
crowd-sorcing information about Ebola in Liberia (2014).
18 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Big Data
A great deal of this data is actually geo-located (e.g.: satellite
navigate coordinates, ip addresses).
Based on reports, Fillipo Arrieta published maps
describing a multi-stage containment plan designed to
limit the plague in Bari (1864).
The iLab used the ushahidi platform to collect and display
crowd-sorcing information about Ebola in Liberia (2014).
19 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Big Data
A great deal of this data is actually geo-located (e.g.: satellite
navigate coordinates, ip addresses).
Geography has finally the opportunity to switch from being based
on guesses and samples, to become a truly data-driven science.
Based on reports, Fillipo Arrieta published maps
describing a multi-stage containment plan designed to
limit the plague in Bari (1864).
The iLab used the ushahidi platform to collect and display
crowd-sorcing information about Ebola in Liberia (2014).
20 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Big Data
What characterizes Big Data?
21 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Big Data
What characterizes Big Data?
Volume: Large amounts of data.
22 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Big Data
What characterizes Big Data?
Volume: Large amounts of data.
Variety: Different types of structured, unstructured, and
multi-structured data.
23 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Big Data
What characterizes Big Data?
Volume: Large amounts of data.
Variety: Different types of structured, unstructured, and
multi-structured data.
Velocity: Needs to be analyzed quickly.
24 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Technological Challenges
These characteristics map into challenges:
Scalability
Heterogeneity
Low latency
When the traditional stack is no longer enough, a paradigm shift is
required.
25 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
RDBMS vs NoSQL
26 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
RDBMS vs NoSQL
NoSQL databases trade away some capabilities of relational
databases (SQL), in order to improve scalability.
27 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
RDBMS vs NoSQL
NoSQL databases trade away some capabilities of relational
databases (SQL), in order to improve scalability.
Advantages: NoSQL databases are simpler, can handle
semi-structured and denormalized data and have an higher
scalability.
28 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
RDBMS vs NoSQL
NoSQL databases trade away some capabilities of relational
databases (SQL), in order to improve scalability.
Advantages: NoSQL databases are simpler, can handle
semi-structured and denormalized data and have an higher
scalability.
Disadvantages: loss of abstraction provided by the query
optimizer, that increases the complexity of the applications.
29 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
RDBMS vs NoSQL
NoSQL databases trade away some capabilities of relational
databases (SQL), in order to improve scalability.
Advantages: NoSQL databases are simpler, can handle
semi-structured and denormalized data and have an higher
scalability.
Disadvantages: loss of abstraction provided by the query
optimizer, that increases the complexity of the applications.
Recently, tools were developed that bring back the full power
of SQL language to the NoSQL ecosystem (e.g.: Apache Drill,
Hive). 30 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
RDBMS vs NoSQL
31 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Examples
32 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
MapReduce
Programming model for processing and generating large data sets
with a parallel, distributed algorithm on a cluster.
Map(): performs filtering and sorting.
Reduce(): performs a summary operation.
33 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
MapReduce
Programming model for processing and generating large data sets
with a parallel, distributed algorithm on a cluster.
Map(): performs filtering and sorting.
Reduce(): performs a summary operation.
The framework coordinates the processing, by marshalling the
distributed servers, running the tasks and parallel and
managing all communications and data between the various
parts of the system.
34 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
MapReduce
Programming model for processing and generating large data sets
with a parallel, distributed algorithm on a cluster.
Map(): performs filtering and sorting.
Reduce(): performs a summary operation.
The framework coordinates the processing, by marshalling the
distributed servers, running the tasks and parallel and
managing all communications and data between the various
parts of the system.
There are many libraries that implement MapReduce.
35 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
A Big Data Approach
Does this mean we have to throw away our traditional tools
and methods?
36 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
A Big Data Approach
Does this mean we have to throw away our traditional tools
and methods?
First ensure that you really have, or will have Big Data at
some point in the future.
37 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
A Big Data Approach
Does this mean we have to throw away our traditional tools
and methods?
First ensure that you really have, or will have Big Data at
some point in the future.
Then identify the stages of the workflow that are bottlenecks,
in terms of the current technologies.
38 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
A Big Data Approach
Does this mean we have to throw away our traditional tools
and methods?
First ensure that you really have, or will have Big Data at
some point in the future.
Then identify the stages of the workflow that are bottlenecks,
in terms of the current technologies.
It is possible to mix & and match.
39 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Analysing geo-located Tweets
40 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Analysing geo-located Tweets
The purpose of this use case was to analyse the stream of
geo-located Tweets, as a sensor for citizen presence.
41 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Analysing geo-located Tweets
The purpose of this use case was to analyse the stream of
geo-located Tweets, as a sensor for citizen presence.
Number of tweets in Catalunya in around 3 months: +- 6
million.
42 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Analysing geo-located Tweets
The purpose of this use case was to analyse the stream of
geo-located Tweets, as a sensor for citizen presence.
Number of tweets in Catalunya in around 3 months: +- 6
million.
Continuous stream of data.
43 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Analysing geo-located Tweets
The purpose of this use case was to analyse the stream of
geo-located Tweets, as a sensor for citizen presence.
Number of tweets in Catalunya in around 3 months: +- 6
million.
Continuous stream of data.
This amount of data is not easily assimilated by the
”human-eye”, so we decided to create clusters of Tweets.
44 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
An Use Case
Clustering
45 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
An Use Case
Clustering
It is a descriptive data mining technique, often used for
dimensionality reduction.
46 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
An Use Case
Clustering
It is a descriptive data mining technique, often used for
dimensionality reduction.
It groups a set of objects in such a way that objects in the
same group are more similar to each other, than to object in
other groups.
47 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
An Use Case
Clustering
It is a descriptive data mining technique, often used for
dimensionality reduction.
It groups a set of objects in such a way that objects in the
same group are more similar to each other, than to object in
other groups.
Strictly, it corresponds to a family of unsupervised machine
learning algorithms.
48 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
An Use Case
Clustering
It is a descriptive data mining technique, often used for
dimensionality reduction.
It groups a set of objects in such a way that objects in the
same group are more similar to each other, than to object in
other groups.
Strictly, it corresponds to a family of unsupervised machine
learning algorithms.
We wanted to implement this concept using only Hadoop, and
apply it to spatial attributes.
49 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Technological Stack
50 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Workflow
51 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Results
Using this workflow we were able to turn the original point cloud,
into a density grid, and then into clusters.
Clusters expose patterns of presence over time, and can help us to
re-define boundaries for the city that are data-driven, rather than
administrative.
52 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Final Remarks
53 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Final Remarks
It is ok to use non scalable tools, at certain points of the
workflow.
54 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Final Remarks
It is ok to use non scalable tools, at certain points of the
workflow.
We are working on the edge of existing technologies; some
functions were not implemented yet, and bugs are common.
55 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Final Remarks
It is ok to use non scalable tools, at certain points of the
workflow.
We are working on the edge of existing technologies; some
functions were not implemented yet, and bugs are common.
Since it is a niche, this is even more true for spatial
technologies.
56 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Final Remarks
It is ok to use non scalable tools, at certain points of the
workflow.
We are working on the edge of existing technologies; some
functions were not implemented yet, and bugs are common.
Since it is a niche, this is even more true for spatial
technologies.
There are not ready-made solutions: the particular stack and
workflow should be compiled for the specific case study.
57 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Final Remarks
It is ok to use non scalable tools, at certain points of the
workflow.
We are working on the edge of existing technologies; some
functions were not implemented yet, and bugs are common.
Since it is a niche, this is even more true for spatial
technologies.
There are not ready-made solutions: the particular stack and
workflow should be compiled for the specific case study.
this is one stack to solve this problem; it is not the only one,
and it may not even be the ”best” one;
58 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Acknowledgements
I would like to thank Ellen Friedman (MapR, Apache Mahout,
Apache Drill), for her inspiring work on Big Data and Machine
Learning.
59 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
References
Aji, A., Wang, F., Vo, H., Lee, R., Liu, Q., Zhang, X. and
Saltz, J.” Hadoop-GIS: A High Performance Spatial Data
Warehousing System over MapReduce”. Proceedings of the
VLDB Endowment International Conference on Very Large
Data Bases, 6(11), p1009. (2013)
Cuesta, H. ”Practical Data Analysis”. Packt Publishing (2013)
Dunning, T. and Friedman, E. ”Time Series Databases: New
Ways to Store and Access Data”. O’Reilly Media; 1 edition
(October, 2014)
Myatt, G and Johnson, W. ”Making Sense Of Data I: A
Practical Guide to Exploratory Data Analysis and Data
Mining”. O’Reilly : 2 edition (2014).
60 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Thank You!
This presentation is available at:
http://tinyurl.com/pcmgzxp
Next:
61 / 62
The Value of Data
The Big Data Revolution
An Use Case
Final Remarks
Thank You!
This presentation is available at:
http://tinyurl.com/pcmgzxp
Next:
9as Jornadas de SIG Libre - 26th-27th March 2015, Girona.
62 / 62

Mais conteúdo relacionado

Mais procurados

What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? Robert Grossman
 
A modified k means algorithm for big data clustering
A modified k means algorithm for big data clusteringA modified k means algorithm for big data clustering
A modified k means algorithm for big data clusteringSK Ahammad Fahad
 
applications-and-current-challenges-of-supercomputing-across-multiple-domains...
applications-and-current-challenges-of-supercomputing-across-multiple-domains...applications-and-current-challenges-of-supercomputing-across-multiple-domains...
applications-and-current-challenges-of-supercomputing-across-multiple-domains...Neha Gupta
 
What Are Science Clouds?
What Are Science Clouds?What Are Science Clouds?
What Are Science Clouds?Robert Grossman
 
Using the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchUsing the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchRobert Grossman
 
Map Reduce in Big fata
Map Reduce in Big fataMap Reduce in Big fata
Map Reduce in Big fataSuraj Sawant
 
re:Introduce Big Data and Hadoop Eco-system.
re:Introduce Big Data and Hadoop Eco-system.re:Introduce Big Data and Hadoop Eco-system.
re:Introduce Big Data and Hadoop Eco-system.Shakir Ali
 
New Trends and Directions in Data Science - MIT Information Quality Conferenc...
New Trends and Directions in Data Science - MIT Information Quality Conferenc...New Trends and Directions in Data Science - MIT Information Quality Conferenc...
New Trends and Directions in Data Science - MIT Information Quality Conferenc...Mario Faria
 
Visual Data Analytics in the Cloud for Exploratory Science
Visual Data Analytics in the Cloud for Exploratory ScienceVisual Data Analytics in the Cloud for Exploratory Science
Visual Data Analytics in the Cloud for Exploratory ScienceUniversity of Washington
 
Kid171 chap0 english version
Kid171 chap0 english versionKid171 chap0 english version
Kid171 chap0 english versionFrank S.C. Tseng
 
wireless sensor network
wireless sensor networkwireless sensor network
wireless sensor networkparry prabhu
 
A Model Design of Big Data Processing using HACE Theorem
A Model Design of Big Data Processing using HACE TheoremA Model Design of Big Data Processing using HACE Theorem
A Model Design of Big Data Processing using HACE TheoremAnthonyOtuonye
 
Smart Data and real-world semantic web applications (2004)
Smart Data and real-world semantic web applications (2004)Smart Data and real-world semantic web applications (2004)
Smart Data and real-world semantic web applications (2004)Amit Sheth
 
"Some Reflections on Data in the Public Sector" : Communia: The European Them...
"Some Reflections on Data in the Public Sector" : Communia: The European Them..."Some Reflections on Data in the Public Sector" : Communia: The European Them...
"Some Reflections on Data in the Public Sector" : Communia: The European Them...Tom Moritz
 
Bigdatacooltools
BigdatacooltoolsBigdatacooltools
Bigdatacooltoolssuresh sood
 
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...Amit Sheth
 
History of Big Data
History of Big DataHistory of Big Data
History of Big DataHEXANIKA
 

Mais procurados (20)

What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care?
 
A modified k means algorithm for big data clustering
A modified k means algorithm for big data clusteringA modified k means algorithm for big data clustering
A modified k means algorithm for big data clustering
 
applications-and-current-challenges-of-supercomputing-across-multiple-domains...
applications-and-current-challenges-of-supercomputing-across-multiple-domains...applications-and-current-challenges-of-supercomputing-across-multiple-domains...
applications-and-current-challenges-of-supercomputing-across-multiple-domains...
 
What Are Science Clouds?
What Are Science Clouds?What Are Science Clouds?
What Are Science Clouds?
 
Using the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchUsing the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science Research
 
Map Reduce in Big fata
Map Reduce in Big fataMap Reduce in Big fata
Map Reduce in Big fata
 
re:Introduce Big Data and Hadoop Eco-system.
re:Introduce Big Data and Hadoop Eco-system.re:Introduce Big Data and Hadoop Eco-system.
re:Introduce Big Data and Hadoop Eco-system.
 
New Trends and Directions in Data Science - MIT Information Quality Conferenc...
New Trends and Directions in Data Science - MIT Information Quality Conferenc...New Trends and Directions in Data Science - MIT Information Quality Conferenc...
New Trends and Directions in Data Science - MIT Information Quality Conferenc...
 
Visual Data Analytics in the Cloud for Exploratory Science
Visual Data Analytics in the Cloud for Exploratory ScienceVisual Data Analytics in the Cloud for Exploratory Science
Visual Data Analytics in the Cloud for Exploratory Science
 
Kid171 chap0 english version
Kid171 chap0 english versionKid171 chap0 english version
Kid171 chap0 english version
 
wireless sensor network
wireless sensor networkwireless sensor network
wireless sensor network
 
Cifar
CifarCifar
Cifar
 
A Model Design of Big Data Processing using HACE Theorem
A Model Design of Big Data Processing using HACE TheoremA Model Design of Big Data Processing using HACE Theorem
A Model Design of Big Data Processing using HACE Theorem
 
End-to-End eScience
End-to-End eScienceEnd-to-End eScience
End-to-End eScience
 
Smart Data and real-world semantic web applications (2004)
Smart Data and real-world semantic web applications (2004)Smart Data and real-world semantic web applications (2004)
Smart Data and real-world semantic web applications (2004)
 
Math2015
Math2015Math2015
Math2015
 
"Some Reflections on Data in the Public Sector" : Communia: The European Them...
"Some Reflections on Data in the Public Sector" : Communia: The European Them..."Some Reflections on Data in the Public Sector" : Communia: The European Them...
"Some Reflections on Data in the Public Sector" : Communia: The European Them...
 
Bigdatacooltools
BigdatacooltoolsBigdatacooltools
Bigdatacooltools
 
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
 
History of Big Data
History of Big DataHistory of Big Data
History of Big Data
 

Destaque

workshop - Posiciona-te no Linkedin
workshop - Posiciona-te no Linkedinworkshop - Posiciona-te no Linkedin
workshop - Posiciona-te no LinkedinMárcio Miranda
 
Estratégia de Comunicação e Marketing na Era Digital
Estratégia de Comunicação e Marketing na Era DigitalEstratégia de Comunicação e Marketing na Era Digital
Estratégia de Comunicação e Marketing na Era DigitalFábio Mesquita
 
Social Media Marketing & Copywriting | Content Development for Moche's Facebo...
Social Media Marketing & Copywriting | Content Development for Moche's Facebo...Social Media Marketing & Copywriting | Content Development for Moche's Facebo...
Social Media Marketing & Copywriting | Content Development for Moche's Facebo...Cíntia Pereira
 
Annual_Content_Strategy_Summit
Annual_Content_Strategy_SummitAnnual_Content_Strategy_Summit
Annual_Content_Strategy_SummitAlex Fryer
 
Word of Mouth Marketing Work
Word of Mouth Marketing WorkWord of Mouth Marketing Work
Word of Mouth Marketing WorkMárcio Miranda
 
Marcia Augusto Digital Marketing Plan
Marcia Augusto Digital Marketing PlanMarcia Augusto Digital Marketing Plan
Marcia Augusto Digital Marketing PlanMárcia Augusto
 
Identidade Profissional - Linkedin e Outras Redes Sociais
Identidade Profissional - Linkedin e Outras Redes SociaisIdentidade Profissional - Linkedin e Outras Redes Sociais
Identidade Profissional - Linkedin e Outras Redes SociaisMárcio Miranda
 

Destaque (12)

workshop - Posiciona-te no Linkedin
workshop - Posiciona-te no Linkedinworkshop - Posiciona-te no Linkedin
workshop - Posiciona-te no Linkedin
 
Estratégia de Comunicação e Marketing na Era Digital
Estratégia de Comunicação e Marketing na Era DigitalEstratégia de Comunicação e Marketing na Era Digital
Estratégia de Comunicação e Marketing na Era Digital
 
Social Media Marketing & Copywriting | Content Development for Moche's Facebo...
Social Media Marketing & Copywriting | Content Development for Moche's Facebo...Social Media Marketing & Copywriting | Content Development for Moche's Facebo...
Social Media Marketing & Copywriting | Content Development for Moche's Facebo...
 
Annual_Content_Strategy_Summit
Annual_Content_Strategy_SummitAnnual_Content_Strategy_Summit
Annual_Content_Strategy_Summit
 
Word of Mouth Marketing Work
Word of Mouth Marketing WorkWord of Mouth Marketing Work
Word of Mouth Marketing Work
 
Pinterest Work
Pinterest WorkPinterest Work
Pinterest Work
 
Gestão de Marketing
Gestão de MarketingGestão de Marketing
Gestão de Marketing
 
Six characteristics of compelling content
Six characteristics of compelling contentSix characteristics of compelling content
Six characteristics of compelling content
 
Marcia Augusto Digital Marketing Plan
Marcia Augusto Digital Marketing PlanMarcia Augusto Digital Marketing Plan
Marcia Augusto Digital Marketing Plan
 
SEO - Work
SEO - WorkSEO - Work
SEO - Work
 
E-commerce Work
E-commerce WorkE-commerce Work
E-commerce Work
 
Identidade Profissional - Linkedin e Outras Redes Sociais
Identidade Profissional - Linkedin e Outras Redes SociaisIdentidade Profissional - Linkedin e Outras Redes Sociais
Identidade Profissional - Linkedin e Outras Redes Sociais
 

Semelhante a geostack

Machine Learning meets Granular Computing
Machine Learning meets Granular ComputingMachine Learning meets Granular Computing
Machine Learning meets Granular ComputingJenny Midwinter
 
Big Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsBig Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsWay-Yen Lin
 
Big data final presentation
Big data final presentationBig data final presentation
Big data final presentationrohitpce
 
Big Data is changing abruptly, and where it is likely heading
Big Data is changing abruptly, and where it is likely headingBig Data is changing abruptly, and where it is likely heading
Big Data is changing abruptly, and where it is likely headingPaco Nathan
 
The Age of Exabytes: Tools & Approaches for Managing Big Data
The Age of Exabytes: Tools & Approaches for Managing Big DataThe Age of Exabytes: Tools & Approaches for Managing Big Data
The Age of Exabytes: Tools & Approaches for Managing Big DataReadWrite
 
The future of mobile and big data
The future of mobile and big dataThe future of mobile and big data
The future of mobile and big dataspirit conference
 
Big data: Challenges, Practices and Technologies
Big data: Challenges, Practices and TechnologiesBig data: Challenges, Practices and Technologies
Big data: Challenges, Practices and TechnologiesNavneet Randhawa
 
Fortune Time Institute: Big Data - Challenges for Smartcity
Fortune Time Institute: Big Data - Challenges for SmartcityFortune Time Institute: Big Data - Challenges for Smartcity
Fortune Time Institute: Big Data - Challenges for SmartcityVictoria López
 
MapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR Technologies
 
Massive Data Analysis- Challenges and Applications
Massive Data Analysis- Challenges and ApplicationsMassive Data Analysis- Challenges and Applications
Massive Data Analysis- Challenges and ApplicationsVijay Raghavan
 
Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)Kerstin Lehnert
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Alexandru Iosup
 
Internet Infrastructures for Big Data (Verisign's Distinguished Speaker Series)
Internet Infrastructures for Big Data (Verisign's Distinguished Speaker Series)Internet Infrastructures for Big Data (Verisign's Distinguished Speaker Series)
Internet Infrastructures for Big Data (Verisign's Distinguished Speaker Series)eXascale Infolab
 

Semelhante a geostack (20)

Machine Learning meets Granular Computing
Machine Learning meets Granular ComputingMachine Learning meets Granular Computing
Machine Learning meets Granular Computing
 
Big Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsBig Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data Scientists
 
Big data final presentation
Big data final presentationBig data final presentation
Big data final presentation
 
Addressing dm-cloud
Addressing dm-cloudAddressing dm-cloud
Addressing dm-cloud
 
Big Data is changing abruptly, and where it is likely heading
Big Data is changing abruptly, and where it is likely headingBig Data is changing abruptly, and where it is likely heading
Big Data is changing abruptly, and where it is likely heading
 
The Age of Exabytes: Tools & Approaches for Managing Big Data
The Age of Exabytes: Tools & Approaches for Managing Big DataThe Age of Exabytes: Tools & Approaches for Managing Big Data
The Age of Exabytes: Tools & Approaches for Managing Big Data
 
Big Data et eGovernment
Big Data et eGovernmentBig Data et eGovernment
Big Data et eGovernment
 
The future of mobile and big data
The future of mobile and big dataThe future of mobile and big data
The future of mobile and big data
 
Big data: Challenges, Practices and Technologies
Big data: Challenges, Practices and TechnologiesBig data: Challenges, Practices and Technologies
Big data: Challenges, Practices and Technologies
 
Fortune Time Institute: Big Data - Challenges for Smartcity
Fortune Time Institute: Big Data - Challenges for SmartcityFortune Time Institute: Big Data - Challenges for Smartcity
Fortune Time Institute: Big Data - Challenges for Smartcity
 
Data mining on big data
Data mining on big dataData mining on big data
Data mining on big data
 
Big dataorig
Big dataorigBig dataorig
Big dataorig
 
MapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR and Cisco Make IT Better
MapR and Cisco Make IT Better
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Massive Data Analysis- Challenges and Applications
Massive Data Analysis- Challenges and ApplicationsMassive Data Analysis- Challenges and Applications
Massive Data Analysis- Challenges and Applications
 
Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.
 
Big data mining
Big data miningBig data mining
Big data mining
 
Internet Infrastructures for Big Data (Verisign's Distinguished Speaker Series)
Internet Infrastructures for Big Data (Verisign's Distinguished Speaker Series)Internet Infrastructures for Big Data (Verisign's Distinguished Speaker Series)
Internet Infrastructures for Big Data (Verisign's Distinguished Speaker Series)
 

geostack

  • 1. The Value of Data The Big Data Revolution An Use Case Final Remarks A GEO-stack for Big Data Driving Spatial Analysis Beyond the Limits of Traditional Storage Joana Sim˜oes 1 1Bdigital, CASA, CICS.NOVA March 12, 2015 1 / 62
  • 2. The Value of Data The Big Data Revolution An Use Case Final Remarks Big Data Word Cloud; source:http://olap.com/big-data/ 2 / 62
  • 3. The Value of Data The Big Data Revolution An Use Case Final Remarks Table of Contents 1 The Value of Data 2 The Big Data Revolution 3 An Use Case 4 Final Remarks 3 / 62
  • 4. The Value of Data The Big Data Revolution An Use Case Final Remarks Warning * This presentation may contain tech talk, such as: databases, clusters, NoSQL, cloud, parallelism, scalability. If you are susceptible to these concepts, you may want to leave the room now. From this point beyond, you are at your own risk! * 4 / 62
  • 5. The Value of Data The Big Data Revolution An Use Case Final Remarks The Value of Data Not a New Idea! Matthew Fontaine Maury (1806-1873). 5 / 62
  • 6. The Value of Data The Big Data Revolution An Use Case Final Remarks The Value of Data Not a New Idea! Matthew Fontaine Maury (1806-1873). Foresaw the hidden value on captain’s ships logs, when analysed collectively. Used time series data to carry out analysis that would enable him to recomend optimal shipping routes. 6 / 62
  • 7. The Value of Data The Big Data Revolution An Use Case Final Remarks The Value of Data Data Mining, Open-source and Crowd-sourcing 7 / 62
  • 8. The Value of Data The Big Data Revolution An Use Case Final Remarks The Value of Data Data Mining, Open-source and Crowd-sourcing In 1848, Captain Jackson was the the first person to try the route recommended by Maury, and as a result he was able to save 17 days on his outbond trip. 8 / 62
  • 9. The Value of Data The Big Data Revolution An Use Case Final Remarks The Value of Data Data Mining, Open-source and Crowd-sourcing In 1848, Captain Jackson was the the first person to try the route recommended by Maury, and as a result he was able to save 17 days on his outbond trip. Apart from collecting existing logs, Maury encouraged the collection of more regular and systematic time series, by creating a template. 9 / 62
  • 10. The Value of Data The Big Data Revolution An Use Case Final Remarks The Value of Data Data Mining, Open-source and Crowd-sourcing In 1848, Captain Jackson was the the first person to try the route recommended by Maury, and as a result he was able to save 17 days on his outbond trip. Apart from collecting existing logs, Maury encouraged the collection of more regular and systematic time series, by creating a template. Collected Data: longitude, latitude, currents, magnetic variation, air and water temperature, general wind direction, etc. 10 / 62
  • 11. The Value of Data The Big Data Revolution An Use Case Final Remarks Data Analysis A multidisciplinary field 11 / 62
  • 12. The Value of Data The Big Data Revolution An Use Case Final Remarks Data Analysis Traditional Stack Spreadsheets (e.g.: Excel, OpenOffice) RDBMS (e.g.: Oracle, PostgreSQL, MySQL) Statistical Packages (e.g.: R, Matlab) GIS Packages (e.g.: QGIS) Scripting and programming languages (e.g.: Python, Java) Libraries 12 / 62
  • 13. The Value of Data The Big Data Revolution An Use Case Final Remarks Big Data What changed in recent years? Differences in the way global business and transportation are done have exploded the volume of traditional data sources. Widespread increase in data from sensors. 13 / 62
  • 14. The Value of Data The Big Data Revolution An Use Case Final Remarks Big Data Smart Citizen Project https://smartcitizen.me/ 14 / 62
  • 15. The Value of Data The Big Data Revolution An Use Case Final Remarks Big Data Smart Citizen Project Platform to generate participatory processes of people in the cities, by connecting data, people and knowledge. https://smartcitizen.me/ 15 / 62
  • 16. The Value of Data The Big Data Revolution An Use Case Final Remarks Big Data Smart Citizen Project Platform to generate participatory processes of people in the cities, by connecting data, people and knowledge. Based on geolocation, Internet and free hardware and software for data collection and sharing. https://smartcitizen.me/ 16 / 62
  • 17. The Value of Data The Big Data Revolution An Use Case Final Remarks Big Data Smart Citizen Project Platform to generate participatory processes of people in the cities, by connecting data, people and knowledge. Based on geolocation, Internet and free hardware and software for data collection and sharing. Relies on the production of objects to connect people with their environment and their city. https://smartcitizen.me/ 17 / 62
  • 18. The Value of Data The Big Data Revolution An Use Case Final Remarks Big Data Based on reports, Fillipo Arrieta published maps describing a multi-stage containment plan designed to limit the plague in Bari (1864). The iLab used the ushahidi platform to collect and display crowd-sorcing information about Ebola in Liberia (2014). 18 / 62
  • 19. The Value of Data The Big Data Revolution An Use Case Final Remarks Big Data A great deal of this data is actually geo-located (e.g.: satellite navigate coordinates, ip addresses). Based on reports, Fillipo Arrieta published maps describing a multi-stage containment plan designed to limit the plague in Bari (1864). The iLab used the ushahidi platform to collect and display crowd-sorcing information about Ebola in Liberia (2014). 19 / 62
  • 20. The Value of Data The Big Data Revolution An Use Case Final Remarks Big Data A great deal of this data is actually geo-located (e.g.: satellite navigate coordinates, ip addresses). Geography has finally the opportunity to switch from being based on guesses and samples, to become a truly data-driven science. Based on reports, Fillipo Arrieta published maps describing a multi-stage containment plan designed to limit the plague in Bari (1864). The iLab used the ushahidi platform to collect and display crowd-sorcing information about Ebola in Liberia (2014). 20 / 62
  • 21. The Value of Data The Big Data Revolution An Use Case Final Remarks Big Data What characterizes Big Data? 21 / 62
  • 22. The Value of Data The Big Data Revolution An Use Case Final Remarks Big Data What characterizes Big Data? Volume: Large amounts of data. 22 / 62
  • 23. The Value of Data The Big Data Revolution An Use Case Final Remarks Big Data What characterizes Big Data? Volume: Large amounts of data. Variety: Different types of structured, unstructured, and multi-structured data. 23 / 62
  • 24. The Value of Data The Big Data Revolution An Use Case Final Remarks Big Data What characterizes Big Data? Volume: Large amounts of data. Variety: Different types of structured, unstructured, and multi-structured data. Velocity: Needs to be analyzed quickly. 24 / 62
  • 25. The Value of Data The Big Data Revolution An Use Case Final Remarks Technological Challenges These characteristics map into challenges: Scalability Heterogeneity Low latency When the traditional stack is no longer enough, a paradigm shift is required. 25 / 62
  • 26. The Value of Data The Big Data Revolution An Use Case Final Remarks RDBMS vs NoSQL 26 / 62
  • 27. The Value of Data The Big Data Revolution An Use Case Final Remarks RDBMS vs NoSQL NoSQL databases trade away some capabilities of relational databases (SQL), in order to improve scalability. 27 / 62
  • 28. The Value of Data The Big Data Revolution An Use Case Final Remarks RDBMS vs NoSQL NoSQL databases trade away some capabilities of relational databases (SQL), in order to improve scalability. Advantages: NoSQL databases are simpler, can handle semi-structured and denormalized data and have an higher scalability. 28 / 62
  • 29. The Value of Data The Big Data Revolution An Use Case Final Remarks RDBMS vs NoSQL NoSQL databases trade away some capabilities of relational databases (SQL), in order to improve scalability. Advantages: NoSQL databases are simpler, can handle semi-structured and denormalized data and have an higher scalability. Disadvantages: loss of abstraction provided by the query optimizer, that increases the complexity of the applications. 29 / 62
  • 30. The Value of Data The Big Data Revolution An Use Case Final Remarks RDBMS vs NoSQL NoSQL databases trade away some capabilities of relational databases (SQL), in order to improve scalability. Advantages: NoSQL databases are simpler, can handle semi-structured and denormalized data and have an higher scalability. Disadvantages: loss of abstraction provided by the query optimizer, that increases the complexity of the applications. Recently, tools were developed that bring back the full power of SQL language to the NoSQL ecosystem (e.g.: Apache Drill, Hive). 30 / 62
  • 31. The Value of Data The Big Data Revolution An Use Case Final Remarks RDBMS vs NoSQL 31 / 62
  • 32. The Value of Data The Big Data Revolution An Use Case Final Remarks Examples 32 / 62
  • 33. The Value of Data The Big Data Revolution An Use Case Final Remarks MapReduce Programming model for processing and generating large data sets with a parallel, distributed algorithm on a cluster. Map(): performs filtering and sorting. Reduce(): performs a summary operation. 33 / 62
  • 34. The Value of Data The Big Data Revolution An Use Case Final Remarks MapReduce Programming model for processing and generating large data sets with a parallel, distributed algorithm on a cluster. Map(): performs filtering and sorting. Reduce(): performs a summary operation. The framework coordinates the processing, by marshalling the distributed servers, running the tasks and parallel and managing all communications and data between the various parts of the system. 34 / 62
  • 35. The Value of Data The Big Data Revolution An Use Case Final Remarks MapReduce Programming model for processing and generating large data sets with a parallel, distributed algorithm on a cluster. Map(): performs filtering and sorting. Reduce(): performs a summary operation. The framework coordinates the processing, by marshalling the distributed servers, running the tasks and parallel and managing all communications and data between the various parts of the system. There are many libraries that implement MapReduce. 35 / 62
  • 36. The Value of Data The Big Data Revolution An Use Case Final Remarks A Big Data Approach Does this mean we have to throw away our traditional tools and methods? 36 / 62
  • 37. The Value of Data The Big Data Revolution An Use Case Final Remarks A Big Data Approach Does this mean we have to throw away our traditional tools and methods? First ensure that you really have, or will have Big Data at some point in the future. 37 / 62
  • 38. The Value of Data The Big Data Revolution An Use Case Final Remarks A Big Data Approach Does this mean we have to throw away our traditional tools and methods? First ensure that you really have, or will have Big Data at some point in the future. Then identify the stages of the workflow that are bottlenecks, in terms of the current technologies. 38 / 62
  • 39. The Value of Data The Big Data Revolution An Use Case Final Remarks A Big Data Approach Does this mean we have to throw away our traditional tools and methods? First ensure that you really have, or will have Big Data at some point in the future. Then identify the stages of the workflow that are bottlenecks, in terms of the current technologies. It is possible to mix & and match. 39 / 62
  • 40. The Value of Data The Big Data Revolution An Use Case Final Remarks Analysing geo-located Tweets 40 / 62
  • 41. The Value of Data The Big Data Revolution An Use Case Final Remarks Analysing geo-located Tweets The purpose of this use case was to analyse the stream of geo-located Tweets, as a sensor for citizen presence. 41 / 62
  • 42. The Value of Data The Big Data Revolution An Use Case Final Remarks Analysing geo-located Tweets The purpose of this use case was to analyse the stream of geo-located Tweets, as a sensor for citizen presence. Number of tweets in Catalunya in around 3 months: +- 6 million. 42 / 62
  • 43. The Value of Data The Big Data Revolution An Use Case Final Remarks Analysing geo-located Tweets The purpose of this use case was to analyse the stream of geo-located Tweets, as a sensor for citizen presence. Number of tweets in Catalunya in around 3 months: +- 6 million. Continuous stream of data. 43 / 62
  • 44. The Value of Data The Big Data Revolution An Use Case Final Remarks Analysing geo-located Tweets The purpose of this use case was to analyse the stream of geo-located Tweets, as a sensor for citizen presence. Number of tweets in Catalunya in around 3 months: +- 6 million. Continuous stream of data. This amount of data is not easily assimilated by the ”human-eye”, so we decided to create clusters of Tweets. 44 / 62
  • 45. The Value of Data The Big Data Revolution An Use Case Final Remarks An Use Case Clustering 45 / 62
  • 46. The Value of Data The Big Data Revolution An Use Case Final Remarks An Use Case Clustering It is a descriptive data mining technique, often used for dimensionality reduction. 46 / 62
  • 47. The Value of Data The Big Data Revolution An Use Case Final Remarks An Use Case Clustering It is a descriptive data mining technique, often used for dimensionality reduction. It groups a set of objects in such a way that objects in the same group are more similar to each other, than to object in other groups. 47 / 62
  • 48. The Value of Data The Big Data Revolution An Use Case Final Remarks An Use Case Clustering It is a descriptive data mining technique, often used for dimensionality reduction. It groups a set of objects in such a way that objects in the same group are more similar to each other, than to object in other groups. Strictly, it corresponds to a family of unsupervised machine learning algorithms. 48 / 62
  • 49. The Value of Data The Big Data Revolution An Use Case Final Remarks An Use Case Clustering It is a descriptive data mining technique, often used for dimensionality reduction. It groups a set of objects in such a way that objects in the same group are more similar to each other, than to object in other groups. Strictly, it corresponds to a family of unsupervised machine learning algorithms. We wanted to implement this concept using only Hadoop, and apply it to spatial attributes. 49 / 62
  • 50. The Value of Data The Big Data Revolution An Use Case Final Remarks Technological Stack 50 / 62
  • 51. The Value of Data The Big Data Revolution An Use Case Final Remarks Workflow 51 / 62
  • 52. The Value of Data The Big Data Revolution An Use Case Final Remarks Results Using this workflow we were able to turn the original point cloud, into a density grid, and then into clusters. Clusters expose patterns of presence over time, and can help us to re-define boundaries for the city that are data-driven, rather than administrative. 52 / 62
  • 53. The Value of Data The Big Data Revolution An Use Case Final Remarks Final Remarks 53 / 62
  • 54. The Value of Data The Big Data Revolution An Use Case Final Remarks Final Remarks It is ok to use non scalable tools, at certain points of the workflow. 54 / 62
  • 55. The Value of Data The Big Data Revolution An Use Case Final Remarks Final Remarks It is ok to use non scalable tools, at certain points of the workflow. We are working on the edge of existing technologies; some functions were not implemented yet, and bugs are common. 55 / 62
  • 56. The Value of Data The Big Data Revolution An Use Case Final Remarks Final Remarks It is ok to use non scalable tools, at certain points of the workflow. We are working on the edge of existing technologies; some functions were not implemented yet, and bugs are common. Since it is a niche, this is even more true for spatial technologies. 56 / 62
  • 57. The Value of Data The Big Data Revolution An Use Case Final Remarks Final Remarks It is ok to use non scalable tools, at certain points of the workflow. We are working on the edge of existing technologies; some functions were not implemented yet, and bugs are common. Since it is a niche, this is even more true for spatial technologies. There are not ready-made solutions: the particular stack and workflow should be compiled for the specific case study. 57 / 62
  • 58. The Value of Data The Big Data Revolution An Use Case Final Remarks Final Remarks It is ok to use non scalable tools, at certain points of the workflow. We are working on the edge of existing technologies; some functions were not implemented yet, and bugs are common. Since it is a niche, this is even more true for spatial technologies. There are not ready-made solutions: the particular stack and workflow should be compiled for the specific case study. this is one stack to solve this problem; it is not the only one, and it may not even be the ”best” one; 58 / 62
  • 59. The Value of Data The Big Data Revolution An Use Case Final Remarks Acknowledgements I would like to thank Ellen Friedman (MapR, Apache Mahout, Apache Drill), for her inspiring work on Big Data and Machine Learning. 59 / 62
  • 60. The Value of Data The Big Data Revolution An Use Case Final Remarks References Aji, A., Wang, F., Vo, H., Lee, R., Liu, Q., Zhang, X. and Saltz, J.” Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce”. Proceedings of the VLDB Endowment International Conference on Very Large Data Bases, 6(11), p1009. (2013) Cuesta, H. ”Practical Data Analysis”. Packt Publishing (2013) Dunning, T. and Friedman, E. ”Time Series Databases: New Ways to Store and Access Data”. O’Reilly Media; 1 edition (October, 2014) Myatt, G and Johnson, W. ”Making Sense Of Data I: A Practical Guide to Exploratory Data Analysis and Data Mining”. O’Reilly : 2 edition (2014). 60 / 62
  • 61. The Value of Data The Big Data Revolution An Use Case Final Remarks Thank You! This presentation is available at: http://tinyurl.com/pcmgzxp Next: 61 / 62
  • 62. The Value of Data The Big Data Revolution An Use Case Final Remarks Thank You! This presentation is available at: http://tinyurl.com/pcmgzxp Next: 9as Jornadas de SIG Libre - 26th-27th March 2015, Girona. 62 / 62