SlideShare uma empresa Scribd logo
1 de 39
Baixar para ler offline
Slides by Gabriela Antunes Vieira
Course by Toon Vanagt
Data visualization is a generic term used which describes
any attempt to help understanding of data by providing
visual representation.
◘ Visualization of data makes it much easier to analyse
and understand the textual and numeric data.
◘ Saves time for decision making
Course by Toon Vanagt
Combine, correlate and improve
quality of data sets
The most complicated data will
look easy when it gets through the
process of visualization.
Visual analytics offers a story to
the viewers
Easier for business owners and
organisations to understand their
large data in a simple to
understand format
Course by Toon Vanagt
Avoid distorting what the data has
to say
Induce the viewer to think about the
substance rather than about
methodology, graphic design, etc…
Serve a reasonably clear purpose:
description, exploration, tabulation
or decoration
Make large data sets coherent
Show the data
Encourage the eye to compare
different pieces of data
Course by Toon Vanagt
Course by Toon Vanagt
Course by Toon Vanagt
Move up the information ladder by
asking users/patients for input
Combine, correlate and improve
quality of data sets
Bring new value from raw (open)
data sets
Visualise in new ways
Mine deeper to dig out “insights”
(not just basic statistics)
Any company can now run its
“own Google”
Bring new value from raw (open)
data sets
Mine deeper to dig out “insights”
(not just basic statistics)
Any company can now run its
“own Google”
Course by Toon Vanagt
Course by Toon Vanagt
Data Cleansing is
about identifying
the wrong or
inacurate records in
a data set and
making appropriate
corrections to the
erronous records
Identify incomplete,
Inacurate and
incorrect parts of
data (elements)
Replace them with
correct data or
delete the incorrect
data element
Course by Toon Vanagt
Course by Toon Vanagt
OpenRefine (formerly Google
Refine) is a powerful tool for
working with messy data:
◘ cleaning it
◘transforming it from one format
into another
◘ extending it with web services
and external data.
Course by Toon Vanagt
https://bit.ly/2YzBHwu
Course by Toon Vanagt
https://bit.ly/2HWdmLh
This tutorial will walk you through some of the most common data-manipulation tasks
you’ll need to perform. When you’re done, you should know how to:
•clean up spelling inconsistencies
•remove leading and trailing whitespace
•split cells into multiple columns
Course by Toon Vanagt
DOWNLOAD LINK FOR OPEN REFINE :
https://github.com/OpenRefine/OpenRefine/releases/
DOWNLOAD LINK FOR THIS EXERCISE DATA:
https://www.dropbox.com/s/w8gz5oifkvh376q/NJShipwrecks.csv?dl=0
Course by Toon Vanagt
Course by Toon Vanagt
Click on Create Project and then Choose Files. Navigate to the
NJShipwrecks.csv file and then click Next.
Course by Toon Vanagt
Just click on “Create Project”
Course by Toon Vanagt
This is the main interface you’ll use to work with your data. It sort of looks like Excel, but notice it shows
you only 10 records at a time. That’s because you’re not supposed to be working with your data record
by record; you’ll find ways to group it into batches and then work with it. We’ll try that next.
Course by Toon Vanagt
A facet is a way to isolate certain records that share features. It’s easier to see what I mean when you
try it yourself. Click on the down-arrow right next to the VESSEL TYPE column heading. Then
select Facet, and then Text Facet.
Course by Toon Vanagt
Look at the VESSEL TYPE list that appears on the
lefthand side of the OpenRefine window.
OpenRefine’s facet function has grouped together every
term that appears in the VESSEL TYPE column, along
with how many times it appears.
You can sort the list of terms alphabetically by name, or
by count, according to how many times those terms
appear on the list. If you click on one of the terms, only
those rows that contain that term will be selected. This
allows you to work on your data one chunk at a time.
Course by Toon Vanagt
Look closely at that list of terms. You’ll see that it
includes two terms that are probably meant to be the
same: Bark steamer and Bark Steamer. Even though a
human can tell they’re meant to refer to the same
thing, a computer doesn’t know that. So it’s important
to clean up this data to create accurate visualizations
and analyses.
Hover over the Bark Steamer term in the facet list, so
that you can see the Edit option. Press Edit and, in the
box that appears, change Bark Steamer to Bark
steamer and press Apply. Now the two terms will
merge into one.
Course by Toon Vanagt
Look again at the Facet box. You’ll see a button
marked Cluster. Click it.
The resulting box shows you terms that OpenRefine
thinks should be merged together. Check the boxes of
the terms you think should be merged and then
click Merge Selected and Re-Cluster.
Now experiment with some of the other items on
the Method dropdown menu. What happens when you
try different methods? Each uses a different algorithm
to try to match terms.
When you’re finished experimenting, click Close. You’ll
notice you have fewer terms in your facet list.
Course by Toon Vanagt
A lot of the problems with the data in the VESSEL
TYPE were the result of variant cases (e.g., Pilot
schooner versus Pilot Schooner). One way to eliminate
these problems would be to make all of the terms
lowercase. Let’s do that now.
Click on the down arrow next to VESSEL TYPE. From the
dropdown menu, click Edit cells, and then Common
transforms. Finally, select To lowercase. Voila! All the
vessel types are now lowercase.
Course by Toon Vanagt
One common problem with data is extra spaces before
and after the values. Those are easy to get rid of with
OpenRefine. On the Year Built column, click the down
arrow, then click Edit cells, then Common transforms.
Finally, click Trim leading and trailing whitespace.
Much better!
Course by Toon Vanagt
Several of our columns contain location, formatted as
City, State. But let’s say we want states to appear in
their own column. That’s easy to do with OpenRefine.
Scroll to the Departure Point column. Click the down
arrow, then Edit columns, and finally Split multi-valued
cells. The popup window asks which separator
currently separates the values. Enter a comma and a
space, since those are the two characters that lie
between city and state. Then click OK.
You now have two columns! You can rename them by
clicking on the down arrow, then Edit column and
then Rename.
Course by Toon Vanagt
If you make a mistake in OpenRefine, no worries! It’s easy to
undo. Just click on the Undo/Redo link on the lefthand side of
the screen. Then click on the next-to-last step in the list. Your
last action will be reversed. If you change your mind about
redoing it, you can just click the last step.
Course by Toon Vanagt
Let’s say we want to add the prefix S.S. to the name of any boat
that has the vessel type schooner. We’ll do that by first using our
vessel type facet to select all the rows with the term schooner in
the VESSEL TYPE column.
Once you have all of the schooners selected, head to the SHIP’S
NAME column. Click on the down arrow, then select Edit cells, and
then Transform…
The popup box that follows wants you to use a language called the
Google Refine Expression Language (GREL) to transform your data.
You don’t have to actually know GREL; you just have to be able to
look up the pattern for the expression you want to write.
When you want to add a prefix to some data in OpenRefine, the
pattern looks like this:
“prefix”+value
So in the blank text box, type
“S.S. “+value
You’ll see a preview of how your data will look in the lower right-
hand column. When you’re satisfied, press OK.
Now the title of every schooner is prefaced with “S.S.”!
Course by Toon Vanagt
Once you’ve cleaned up your data, you’ll want to
get it out of OpenRefine. To do that, click on
the Exportbutton in the upper right-hand corner.
Then click on Comma-separated value. Your
cleaned-up spreadsheet should begin
downloading. You can download your data as
many times as you want, at any stage of the
project.
To close OpenRefine, just close the window or tab
in your browser.
Course by Toon Vanagt
These are some of the most common tasks you’ll
want to perform in OpenRefine, but OpenRefine
can also handle tasks of much greater complexity.
To get a sense of some of these tasks, see the
resources on the OpenRefine
Resources page: http://miriamposner.com/classe
s/dh101f17/tutorials-guides/data-
manipulation/openrefine-resources/
Course by Toon Vanagt
• Data.gov ( http://data.gov)
• The US Government pledged last
year to make all government data
available freely online.
• Socrata is another interesting
place to explore government-
related data, with some
visualisation tools built-in.
• European Union Open Data
Portal ( http://open-
data.europa.eu/en/data/
• Data.gov.uk http://data.gov.uk/ D
ata from the UK Government,
including the British National
Bibliography – metadata on all UK
books and publications since 1950.
• The CIA World
Factbook https://www.cia.gov/libr
ary/publications/the-world-
factbook/Information on history,
population, economy,
government, infrastructure and
military of 267 countries.
• UNICEF offers statistics on the
situation of women and children
worldwide.
• World Health Organization offers
world hunger, health, and disease
statistics.
• Amazon Web Services public
datasets http://aws.amazon.com/
datasets Huge resource of public
data
• Face.com: A fascinating tool for
facial recognition data.
• Data Market is a place to check
out data related to economics,
healthcare, food and agriculture,
and the automotive industry.
• Google Public data
explorer includes data from world
development indicators, OECD,
and human development
indicators, mostly related to
economics data and the world.
• Junar is a data scraping service
that also includes data feeds.
• Buzzdata is a social data sharing
service that allows you to upload
your own data and connect with
others who are uploading their
data.
• GoogleFinance https://www.googl
e.com/finance 40 years’ worth of
stock market data, updated in real
time.
• DBPedia http://wiki.dbpedia.org
Wikipedia is comprised of millions
of pieces of data, structured and
unstructured on every subject
under the sun.
• UCI Machine Learning
Repository is a dataset specifically
pre-processed for machine
learning.
• … And So many more, these are just a
few examples
http://sites.miis.edu/metalab/resources/open-refine/
https://drive.google.com/file/d/0B9OONoAXyPa
5YU9kRDVMNXZrdVE/view
https://drive.google.com/file/d/0B9OONoAXyPa5eV84X
1hzWFJ4VEE/view?usp=sharing
https://www.propublica.org/nerds/using-google-refine-
for-data-cleaning
Course by Toon Vanagt
Course by Toon Vanagt
VIDI
Let’s you create a visualization of your data for free. All
you have to do is upload your data, select type, do a
little customization and you are good to go.
QLIK SENSE DESKTOP
lets you create interactive data reports visualization for
free.
MICROSOFT BI PLATFORM
Allows you to update your data from different sources
and makes a report out of it.
GOOGLE FUSION TABLES
one of the simple tools to visualize the data. You can
simply upload a file and choose how to display it
And many more here : https://bit.ly/2TAlAe4
Course by Toon Vanagt
Course by Toon Vanagt
ON 28 APRIL 2019, INFRABEL IS OPENING ITS DATA
AND HOLDING ITS FIRST HACKATHON.
REGISTRATION : https://www.eventbrite.be/e/trackathon-
an-open-data-hackathon-registration-56970650750
Course by Toon Vanagt
Course 6 (part 2)   data visualisation by toon vanagt
Course 6 (part 2)   data visualisation by toon vanagt

Mais conteúdo relacionado

Mais procurados

Office 365 Productivity Tips -- November Smack-Down
Office 365 Productivity Tips -- November Smack-DownOffice 365 Productivity Tips -- November Smack-Down
Office 365 Productivity Tips -- November Smack-DownChristian Buckley
 
Office 365 Productivity Tips November 2017 November Smack-Down
Office 365 Productivity Tips November 2017 November Smack-DownOffice 365 Productivity Tips November 2017 November Smack-Down
Office 365 Productivity Tips November 2017 November Smack-DownThomas Duff
 
feelDweb official guide
feelDweb official guidefeelDweb official guide
feelDweb official guideSuperestratega
 
Pks ms access unit 4_bcomcs
Pks ms access unit 4_bcomcsPks ms access unit 4_bcomcs
Pks ms access unit 4_bcomcsKALAISELVI P
 
Microsoft Access 2010 - a jargon free guide
Microsoft Access 2010 - a jargon free guideMicrosoft Access 2010 - a jargon free guide
Microsoft Access 2010 - a jargon free guidePaul Barnett
 
2010 01 Introduction To Ms Word2007
2010 01 Introduction To Ms Word20072010 01 Introduction To Ms Word2007
2010 01 Introduction To Ms Word2007Buffalo Seminary
 
L3 ms access 2010 interface
L3 ms access 2010 interfaceL3 ms access 2010 interface
L3 ms access 2010 interfaceBryan Corpuz
 
Manual for Fillable Form
Manual for Fillable FormManual for Fillable Form
Manual for Fillable FormClayton Boessen
 
Microsoft word -_microsoft_word_exercise
Microsoft word -_microsoft_word_exerciseMicrosoft word -_microsoft_word_exercise
Microsoft word -_microsoft_word_exerciseSubeesh Up
 
Form4 cd4
Form4 cd4Form4 cd4
Form4 cd4smktsj2
 
Microsoft excel
Microsoft excelMicrosoft excel
Microsoft excelPremnath R
 

Mais procurados (20)

Office 365 Productivity Tips -- November Smack-Down
Office 365 Productivity Tips -- November Smack-DownOffice 365 Productivity Tips -- November Smack-Down
Office 365 Productivity Tips -- November Smack-Down
 
Office 365 Productivity Tips November 2017 November Smack-Down
Office 365 Productivity Tips November 2017 November Smack-DownOffice 365 Productivity Tips November 2017 November Smack-Down
Office 365 Productivity Tips November 2017 November Smack-Down
 
Microsoft Word Seminar Part 1
Microsoft Word Seminar   Part 1Microsoft Word Seminar   Part 1
Microsoft Word Seminar Part 1
 
feelDweb official guide
feelDweb official guidefeelDweb official guide
feelDweb official guide
 
Pks ms access unit 4_bcomcs
Pks ms access unit 4_bcomcsPks ms access unit 4_bcomcs
Pks ms access unit 4_bcomcs
 
Word
WordWord
Word
 
06.ms acess
06.ms acess06.ms acess
06.ms acess
 
Microsoft Access 2010 - a jargon free guide
Microsoft Access 2010 - a jargon free guideMicrosoft Access 2010 - a jargon free guide
Microsoft Access 2010 - a jargon free guide
 
2010 01 Introduction To Ms Word2007
2010 01 Introduction To Ms Word20072010 01 Introduction To Ms Word2007
2010 01 Introduction To Ms Word2007
 
L3 ms access 2010 interface
L3 ms access 2010 interfaceL3 ms access 2010 interface
L3 ms access 2010 interface
 
Notacd04
Notacd04Notacd04
Notacd04
 
Notacd04
Notacd04Notacd04
Notacd04
 
Manual for Fillable Form
Manual for Fillable FormManual for Fillable Form
Manual for Fillable Form
 
Microsoft word -_microsoft_word_exercise
Microsoft word -_microsoft_word_exerciseMicrosoft word -_microsoft_word_exercise
Microsoft word -_microsoft_word_exercise
 
Chapter.10
Chapter.10Chapter.10
Chapter.10
 
Intro to Microsoft Word
Intro to Microsoft WordIntro to Microsoft Word
Intro to Microsoft Word
 
MS ACCESS (REPORT)
MS ACCESS (REPORT)MS ACCESS (REPORT)
MS ACCESS (REPORT)
 
Form4 cd4
Form4 cd4Form4 cd4
Form4 cd4
 
Excel10basics
Excel10basicsExcel10basics
Excel10basics
 
Microsoft excel
Microsoft excelMicrosoft excel
Microsoft excel
 

Semelhante a Course 6 (part 2) data visualisation by toon vanagt

Access tips access and sql part 5 more instant queries 1
Access tips  access and sql part 5  more instant queries 1Access tips  access and sql part 5  more instant queries 1
Access tips access and sql part 5 more instant queries 1quest2900
 
Reports in Horizon
Reports in HorizonReports in Horizon
Reports in HorizonJohnny Pe
 
Microsoft word tutorials COMPLETE by Gulshan K Maheshwari(QAU)
Microsoft word tutorials COMPLETE by Gulshan  K Maheshwari(QAU)Microsoft word tutorials COMPLETE by Gulshan  K Maheshwari(QAU)
Microsoft word tutorials COMPLETE by Gulshan K Maheshwari(QAU)GulshanKumar368
 
ACTG Tableau intro labDownload the TableauLab file and look ov.docx
ACTG   Tableau intro labDownload the TableauLab file and look ov.docxACTG   Tableau intro labDownload the TableauLab file and look ov.docx
ACTG Tableau intro labDownload the TableauLab file and look ov.docxnettletondevon
 
MS Access - Create Form.pdf
MS Access - Create Form.pdfMS Access - Create Form.pdf
MS Access - Create Form.pdfSRReliability
 
My aspiration as a Graduate Student in the Davenport University Ma.docx
My aspiration as a Graduate Student in the Davenport University Ma.docxMy aspiration as a Graduate Student in the Davenport University Ma.docx
My aspiration as a Graduate Student in the Davenport University Ma.docxrosemarybdodson23141
 
5 tips to improve ebs navigation
5 tips to improve ebs navigation5 tips to improve ebs navigation
5 tips to improve ebs navigationkjkombrink
 
Introduction to Microsoft Access2.ppt
Introduction to Microsoft Access2.pptIntroduction to Microsoft Access2.ppt
Introduction to Microsoft Access2.pptJoshCasas1
 
Presentation111122222.pptx
Presentation111122222.pptxPresentation111122222.pptx
Presentation111122222.pptxJustusEkeneNwaka
 
Microsoft Access Notes 2007 Ecdl
Microsoft Access Notes 2007 EcdlMicrosoft Access Notes 2007 Ecdl
Microsoft Access Notes 2007 EcdlRichard Butler
 
B.sc i agri u 4 introduction to ms access
B.sc i agri u 4 introduction to ms accessB.sc i agri u 4 introduction to ms access
B.sc i agri u 4 introduction to ms accessRai University
 

Semelhante a Course 6 (part 2) data visualisation by toon vanagt (20)

Access tips access and sql part 5 more instant queries 1
Access tips  access and sql part 5  more instant queries 1Access tips  access and sql part 5  more instant queries 1
Access tips access and sql part 5 more instant queries 1
 
Reports in Horizon
Reports in HorizonReports in Horizon
Reports in Horizon
 
3320 lab1
3320 lab13320 lab1
3320 lab1
 
WORD LESSON (1).pptx
WORD LESSON (1).pptxWORD LESSON (1).pptx
WORD LESSON (1).pptx
 
Ms Access
Ms AccessMs Access
Ms Access
 
Microsoft word tutorials COMPLETE by Gulshan K Maheshwari(QAU)
Microsoft word tutorials COMPLETE by Gulshan  K Maheshwari(QAU)Microsoft word tutorials COMPLETE by Gulshan  K Maheshwari(QAU)
Microsoft word tutorials COMPLETE by Gulshan K Maheshwari(QAU)
 
ACTG Tableau intro labDownload the TableauLab file and look ov.docx
ACTG   Tableau intro labDownload the TableauLab file and look ov.docxACTG   Tableau intro labDownload the TableauLab file and look ov.docx
ACTG Tableau intro labDownload the TableauLab file and look ov.docx
 
Word2010 advanced
Word2010 advancedWord2010 advanced
Word2010 advanced
 
Word processing
Word processingWord processing
Word processing
 
MS Access - Create Form.pdf
MS Access - Create Form.pdfMS Access - Create Form.pdf
MS Access - Create Form.pdf
 
My aspiration as a Graduate Student in the Davenport University Ma.docx
My aspiration as a Graduate Student in the Davenport University Ma.docxMy aspiration as a Graduate Student in the Davenport University Ma.docx
My aspiration as a Graduate Student in the Davenport University Ma.docx
 
5 tips to improve ebs navigation
5 tips to improve ebs navigation5 tips to improve ebs navigation
5 tips to improve ebs navigation
 
Introduction to Microsoft Access2.ppt
Introduction to Microsoft Access2.pptIntroduction to Microsoft Access2.ppt
Introduction to Microsoft Access2.ppt
 
Mca 504 dotnet_unit5
Mca 504 dotnet_unit5Mca 504 dotnet_unit5
Mca 504 dotnet_unit5
 
Presentation111122222.pptx
Presentation111122222.pptxPresentation111122222.pptx
Presentation111122222.pptx
 
G7 q1w1.pptx
G7 q1w1.pptxG7 q1w1.pptx
G7 q1w1.pptx
 
Microsoft Access Notes 2007 Ecdl
Microsoft Access Notes 2007 EcdlMicrosoft Access Notes 2007 Ecdl
Microsoft Access Notes 2007 Ecdl
 
Tugas testing
Tugas testingTugas testing
Tugas testing
 
(Manual spss)
(Manual spss)(Manual spss)
(Manual spss)
 
B.sc i agri u 4 introduction to ms access
B.sc i agri u 4 introduction to ms accessB.sc i agri u 4 introduction to ms access
B.sc i agri u 4 introduction to ms access
 

Mais de Betacowork

Course 8 : How to start your big data project by Eric Rodriguez
Course 8 : How to start your big data project by Eric Rodriguez Course 8 : How to start your big data project by Eric Rodriguez
Course 8 : How to start your big data project by Eric Rodriguez Betacowork
 
Course 10 : Introduction to machine learning by Christoph Evers
Course 10 :  Introduction to machine learning by Christoph EversCourse 10 :  Introduction to machine learning by Christoph Evers
Course 10 : Introduction to machine learning by Christoph EversBetacowork
 
Course 5: GDPR & Big Data by Sari Depreeuw
Course 5: GDPR & Big Data by Sari DepreeuwCourse 5: GDPR & Big Data by Sari Depreeuw
Course 5: GDPR & Big Data by Sari DepreeuwBetacowork
 
Course 4 : Big Data Structuring, Integration and Management Systems by Daan G...
Course 4 : Big Data Structuring, Integration and Management Systems by Daan G...Course 4 : Big Data Structuring, Integration and Management Systems by Daan G...
Course 4 : Big Data Structuring, Integration and Management Systems by Daan G...Betacowork
 
Course 3 : Types of data and opportunities by Nikolaos Deligiannis
Course 3 : Types of data and opportunities by Nikolaos DeligiannisCourse 3 : Types of data and opportunities by Nikolaos Deligiannis
Course 3 : Types of data and opportunities by Nikolaos DeligiannisBetacowork
 
Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)
Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)
Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)Betacowork
 

Mais de Betacowork (6)

Course 8 : How to start your big data project by Eric Rodriguez
Course 8 : How to start your big data project by Eric Rodriguez Course 8 : How to start your big data project by Eric Rodriguez
Course 8 : How to start your big data project by Eric Rodriguez
 
Course 10 : Introduction to machine learning by Christoph Evers
Course 10 :  Introduction to machine learning by Christoph EversCourse 10 :  Introduction to machine learning by Christoph Evers
Course 10 : Introduction to machine learning by Christoph Evers
 
Course 5: GDPR & Big Data by Sari Depreeuw
Course 5: GDPR & Big Data by Sari DepreeuwCourse 5: GDPR & Big Data by Sari Depreeuw
Course 5: GDPR & Big Data by Sari Depreeuw
 
Course 4 : Big Data Structuring, Integration and Management Systems by Daan G...
Course 4 : Big Data Structuring, Integration and Management Systems by Daan G...Course 4 : Big Data Structuring, Integration and Management Systems by Daan G...
Course 4 : Big Data Structuring, Integration and Management Systems by Daan G...
 
Course 3 : Types of data and opportunities by Nikolaos Deligiannis
Course 3 : Types of data and opportunities by Nikolaos DeligiannisCourse 3 : Types of data and opportunities by Nikolaos Deligiannis
Course 3 : Types of data and opportunities by Nikolaos Deligiannis
 
Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)
Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)
Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)
 

Último

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsJoseMangaJr1
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...amitlee9823
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 

Último (20)

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 

Course 6 (part 2) data visualisation by toon vanagt

  • 1. Slides by Gabriela Antunes Vieira
  • 2. Course by Toon Vanagt Data visualization is a generic term used which describes any attempt to help understanding of data by providing visual representation. ◘ Visualization of data makes it much easier to analyse and understand the textual and numeric data. ◘ Saves time for decision making
  • 3. Course by Toon Vanagt Combine, correlate and improve quality of data sets The most complicated data will look easy when it gets through the process of visualization. Visual analytics offers a story to the viewers Easier for business owners and organisations to understand their large data in a simple to understand format
  • 4. Course by Toon Vanagt Avoid distorting what the data has to say Induce the viewer to think about the substance rather than about methodology, graphic design, etc… Serve a reasonably clear purpose: description, exploration, tabulation or decoration Make large data sets coherent Show the data Encourage the eye to compare different pieces of data
  • 5. Course by Toon Vanagt
  • 6. Course by Toon Vanagt
  • 7. Course by Toon Vanagt Move up the information ladder by asking users/patients for input Combine, correlate and improve quality of data sets Bring new value from raw (open) data sets Visualise in new ways Mine deeper to dig out “insights” (not just basic statistics) Any company can now run its “own Google” Bring new value from raw (open) data sets Mine deeper to dig out “insights” (not just basic statistics) Any company can now run its “own Google”
  • 8. Course by Toon Vanagt
  • 9. Course by Toon Vanagt Data Cleansing is about identifying the wrong or inacurate records in a data set and making appropriate corrections to the erronous records Identify incomplete, Inacurate and incorrect parts of data (elements) Replace them with correct data or delete the incorrect data element
  • 10. Course by Toon Vanagt
  • 11. Course by Toon Vanagt OpenRefine (formerly Google Refine) is a powerful tool for working with messy data: ◘ cleaning it ◘transforming it from one format into another ◘ extending it with web services and external data.
  • 12. Course by Toon Vanagt https://bit.ly/2YzBHwu
  • 13. Course by Toon Vanagt https://bit.ly/2HWdmLh
  • 14. This tutorial will walk you through some of the most common data-manipulation tasks you’ll need to perform. When you’re done, you should know how to: •clean up spelling inconsistencies •remove leading and trailing whitespace •split cells into multiple columns
  • 15. Course by Toon Vanagt DOWNLOAD LINK FOR OPEN REFINE : https://github.com/OpenRefine/OpenRefine/releases/ DOWNLOAD LINK FOR THIS EXERCISE DATA: https://www.dropbox.com/s/w8gz5oifkvh376q/NJShipwrecks.csv?dl=0
  • 16. Course by Toon Vanagt
  • 17. Course by Toon Vanagt Click on Create Project and then Choose Files. Navigate to the NJShipwrecks.csv file and then click Next.
  • 18. Course by Toon Vanagt Just click on “Create Project”
  • 19. Course by Toon Vanagt This is the main interface you’ll use to work with your data. It sort of looks like Excel, but notice it shows you only 10 records at a time. That’s because you’re not supposed to be working with your data record by record; you’ll find ways to group it into batches and then work with it. We’ll try that next.
  • 20. Course by Toon Vanagt A facet is a way to isolate certain records that share features. It’s easier to see what I mean when you try it yourself. Click on the down-arrow right next to the VESSEL TYPE column heading. Then select Facet, and then Text Facet.
  • 21. Course by Toon Vanagt Look at the VESSEL TYPE list that appears on the lefthand side of the OpenRefine window. OpenRefine’s facet function has grouped together every term that appears in the VESSEL TYPE column, along with how many times it appears. You can sort the list of terms alphabetically by name, or by count, according to how many times those terms appear on the list. If you click on one of the terms, only those rows that contain that term will be selected. This allows you to work on your data one chunk at a time.
  • 22. Course by Toon Vanagt Look closely at that list of terms. You’ll see that it includes two terms that are probably meant to be the same: Bark steamer and Bark Steamer. Even though a human can tell they’re meant to refer to the same thing, a computer doesn’t know that. So it’s important to clean up this data to create accurate visualizations and analyses. Hover over the Bark Steamer term in the facet list, so that you can see the Edit option. Press Edit and, in the box that appears, change Bark Steamer to Bark steamer and press Apply. Now the two terms will merge into one.
  • 23. Course by Toon Vanagt Look again at the Facet box. You’ll see a button marked Cluster. Click it. The resulting box shows you terms that OpenRefine thinks should be merged together. Check the boxes of the terms you think should be merged and then click Merge Selected and Re-Cluster. Now experiment with some of the other items on the Method dropdown menu. What happens when you try different methods? Each uses a different algorithm to try to match terms. When you’re finished experimenting, click Close. You’ll notice you have fewer terms in your facet list.
  • 24. Course by Toon Vanagt A lot of the problems with the data in the VESSEL TYPE were the result of variant cases (e.g., Pilot schooner versus Pilot Schooner). One way to eliminate these problems would be to make all of the terms lowercase. Let’s do that now. Click on the down arrow next to VESSEL TYPE. From the dropdown menu, click Edit cells, and then Common transforms. Finally, select To lowercase. Voila! All the vessel types are now lowercase.
  • 25. Course by Toon Vanagt One common problem with data is extra spaces before and after the values. Those are easy to get rid of with OpenRefine. On the Year Built column, click the down arrow, then click Edit cells, then Common transforms. Finally, click Trim leading and trailing whitespace. Much better!
  • 26. Course by Toon Vanagt Several of our columns contain location, formatted as City, State. But let’s say we want states to appear in their own column. That’s easy to do with OpenRefine. Scroll to the Departure Point column. Click the down arrow, then Edit columns, and finally Split multi-valued cells. The popup window asks which separator currently separates the values. Enter a comma and a space, since those are the two characters that lie between city and state. Then click OK. You now have two columns! You can rename them by clicking on the down arrow, then Edit column and then Rename.
  • 27. Course by Toon Vanagt If you make a mistake in OpenRefine, no worries! It’s easy to undo. Just click on the Undo/Redo link on the lefthand side of the screen. Then click on the next-to-last step in the list. Your last action will be reversed. If you change your mind about redoing it, you can just click the last step.
  • 28. Course by Toon Vanagt Let’s say we want to add the prefix S.S. to the name of any boat that has the vessel type schooner. We’ll do that by first using our vessel type facet to select all the rows with the term schooner in the VESSEL TYPE column. Once you have all of the schooners selected, head to the SHIP’S NAME column. Click on the down arrow, then select Edit cells, and then Transform… The popup box that follows wants you to use a language called the Google Refine Expression Language (GREL) to transform your data. You don’t have to actually know GREL; you just have to be able to look up the pattern for the expression you want to write. When you want to add a prefix to some data in OpenRefine, the pattern looks like this: “prefix”+value So in the blank text box, type “S.S. “+value You’ll see a preview of how your data will look in the lower right- hand column. When you’re satisfied, press OK. Now the title of every schooner is prefaced with “S.S.”!
  • 29. Course by Toon Vanagt Once you’ve cleaned up your data, you’ll want to get it out of OpenRefine. To do that, click on the Exportbutton in the upper right-hand corner. Then click on Comma-separated value. Your cleaned-up spreadsheet should begin downloading. You can download your data as many times as you want, at any stage of the project. To close OpenRefine, just close the window or tab in your browser.
  • 30. Course by Toon Vanagt These are some of the most common tasks you’ll want to perform in OpenRefine, but OpenRefine can also handle tasks of much greater complexity. To get a sense of some of these tasks, see the resources on the OpenRefine Resources page: http://miriamposner.com/classe s/dh101f17/tutorials-guides/data- manipulation/openrefine-resources/
  • 31. Course by Toon Vanagt • Data.gov ( http://data.gov) • The US Government pledged last year to make all government data available freely online. • Socrata is another interesting place to explore government- related data, with some visualisation tools built-in. • European Union Open Data Portal ( http://open- data.europa.eu/en/data/ • Data.gov.uk http://data.gov.uk/ D ata from the UK Government, including the British National Bibliography – metadata on all UK books and publications since 1950. • The CIA World Factbook https://www.cia.gov/libr ary/publications/the-world- factbook/Information on history, population, economy, government, infrastructure and military of 267 countries. • UNICEF offers statistics on the situation of women and children worldwide. • World Health Organization offers world hunger, health, and disease statistics. • Amazon Web Services public datasets http://aws.amazon.com/ datasets Huge resource of public data • Face.com: A fascinating tool for facial recognition data. • Data Market is a place to check out data related to economics, healthcare, food and agriculture, and the automotive industry. • Google Public data explorer includes data from world development indicators, OECD, and human development indicators, mostly related to economics data and the world. • Junar is a data scraping service that also includes data feeds. • Buzzdata is a social data sharing service that allows you to upload your own data and connect with others who are uploading their data. • GoogleFinance https://www.googl e.com/finance 40 years’ worth of stock market data, updated in real time. • DBPedia http://wiki.dbpedia.org Wikipedia is comprised of millions of pieces of data, structured and unstructured on every subject under the sun. • UCI Machine Learning Repository is a dataset specifically pre-processed for machine learning. • … And So many more, these are just a few examples
  • 33. Course by Toon Vanagt
  • 34. Course by Toon Vanagt VIDI Let’s you create a visualization of your data for free. All you have to do is upload your data, select type, do a little customization and you are good to go. QLIK SENSE DESKTOP lets you create interactive data reports visualization for free. MICROSOFT BI PLATFORM Allows you to update your data from different sources and makes a report out of it. GOOGLE FUSION TABLES one of the simple tools to visualize the data. You can simply upload a file and choose how to display it And many more here : https://bit.ly/2TAlAe4
  • 35. Course by Toon Vanagt
  • 36. Course by Toon Vanagt ON 28 APRIL 2019, INFRABEL IS OPENING ITS DATA AND HOLDING ITS FIRST HACKATHON. REGISTRATION : https://www.eventbrite.be/e/trackathon- an-open-data-hackathon-registration-56970650750
  • 37. Course by Toon Vanagt