SlideShare uma empresa Scribd logo
1 de 46
Baixar para ler offline
An Introduction to Data
       Visualisation for Analysis
                 Exploring the Dataset -
            Textual, Numerical and Otherwise




http://www.slideshare.net/shawnday/m-phil-datavisforanalysis
Agenda
  Thoughts from last week - wordpress.com?
  Introduction
  What do we mean by Data Analysis?
  Some foundation terms and concepts
  The Data Visualisation Process
  Tools and Methods
  Extending your toolset
  An Exercise
Objective


    To appreciate the rich variety of techniques and
   tools available to digital humanities scholars for
            data visualisation and analysis.
     The intention is to be able to add tools to your
   arsenal and to have a sense of where to look for
                          more.
Breakpoint

        One of the keys to good visualization is
   understanding what your immediate goals are.
  Are you visualizing data to understand what’s in it,
    or are you trying to communicate meaning to
                        others?
         You - Visualisation for Data Analysis
        Others - Visualisation for Presentation
Speaking of Data Analysis
   SPSS
   SAS
   OS Equivalents
So Why Would You Want to Visualise
Your Data?
   Bypass language centres to tap directly into the
   visual cortex
   Leverage ability to recognise patterns - what they
   call visual sense-making
   Powerful graphics engines now allow for live
   data processing and sophisticated animations
   and interactive research environments




                               Sources: Geoff McGhee, Getting Started with Data Viz
So Why Would You Want to Visualise
Your Data?
   Work with new data to create new knowledge
   Explore data to discover things that used to be
   unknown, unknowable or impractical to know
   Take a new perspective on the familiar to reveal
   previously hidden insights
Visualising New Information




                  Tourists vs Locals, Eric Fischer, 2010 - Flickr
Visualising New Information




            Flickr Flow, Martin Wattenberg and Fernanda Viegas, 2009
The Familiar through New Eyes




                  The Times Atlas
How Could You Use Data Analysis
   “In the Lab” - for your own analysis
   Online as part of collabourative groups
   Through dissemination for extension of own work
   - crowdsourcing
   Others?
The Time Ribbon and the Tree Map
Visualisation Objective
   Exploring the ordinary life of rural pioneers in
     nineteenth century Ontario
Farm Journal




               William Sunter Farm Diary, 1858
Diaries: the raw materials
   • 100s of pages
   • Varying hands
   • Varying quality
The Process
  • Generate word frequency (Voyeur, TAPoR)
  • Isolate known farm activities (NLP -
    LanguageWare)
  • Collocate to link activity references to time,
    duration, and resources (Voyeur)
Example: Medical Diary




                         Medical Diary by BlueChillies
Example: History Flow




                        History flow by Martin Wattenberg and Fernanda Viegas
The Result/ New Patterns
The Result/ New Patterns
•Less time haying
•The impact of technology
•More tasks faster
How Else Could this be done?
What is the Value of this Visualisation

  • Easier to compare over intervals
  • Multiple vectors with greater granularity in a
    compressed space
  • The challenge is to find rich enough source
    materials to yield substantive datasets
The Tree Map
Example: Newsmap
Example: Panopticon
Case Study:
Occupations of Politicians
   • What are we studying?
     – Self-declared occupations of politicians
   • Why?
     – What bias might they bring to their job?
   • How?
     – Visualising past occupation and mapping to political
       platform of party affiliated with
Occupations of TDs in the 30th Dáil
Occupations of MPs in the 2nd Parliament
Occupations of MPs in the 37th Parliament
The Result/ New Patterns
  • The emergence of the professional politician with
    no private sector experience
  • Occupational continuity across changes in
    governing party
How Else Could this be Done?
The Value of Data Vis for Analysis
   • New ways of presenting allow new ways of seeing
   • Hidden patterns become evident
   • Suggest other hypothesis to test
Basic Terms
   Datamining
   Statistics
   Structured/Unstructured Data
   Visualisation
   Modelling
Types of Data to Visualise
   Audio Data            Network Data
   Categorical Data          Social
   Cartographic Data         Other

   Collections           Numerical Data
   Image Data            Temporal Data
     Still               Textual Data
     Moving                  Narrative
   Metadata                  Qualitative

   Multimedia Data       ????
General Steps in Data Vis for DH
   Discovery / Acquisition
   Cleaning / ‘Munging’
   Analysis / Exploratory Vis
   Presentation
Discovery / Acquisition
   Original Research      Scraping
     Spreadsheets           Junar
     Databases              Outwit Hub
     Digitized Media        ScraperWiki

   Other Downloads
     Public Data
     Archives/Libraries
     Academic Partners
     Purchase
Demo/Hands-On: Junar
  http://www.junar.com
Cleaning / Munging
(Normalisation, Format Conversion)
    Tools:
      Data Wrangler
      Google Refine
      Mr. Data Converter

    Data Wrangler
      Does simple, split, clear, fold/unfold transforms on data
      See example --> Data and Script

    Google Refine
      Works with larger datasets
Hands-On: Data Wrangler
   http://vis.stanford.edu/wrangler/app/
Hands-On: Google Refine
   http://code.google.com/p/google-refine/
Hands-On: Mr Data Converter
   http://shancarter.com/data_converter/
Analysis / Exploratory Visualisation
     Web Services
       Google Fusion Tables
       Google Spreadsheets
       IBM ManyEyes
       TimeFlow
     Applications
       Tableau/Tableau Public
       MS Office
       OpenOffice
       Gephi
       Node XL (plug-in for Excel)
       Spotfire
       R Processing
Google NGram Viewers
  Examine word frequency in digitised books
  Currently about 4% of books ever published
  In English, Chinese, French, German, Hebrew,
  Russian, and Spanish
  Changes in word usage
  Trends

  Check out the Cultural Observatory @ Harvard
Google NGram Viewer
Wordle
  Visually present word frequency using size,
  weight, colour




  Consider Word Clouds Considered Harmful
Exercise
   Choose a dataset from a source such as:
      The CSO
      Project Guttenberg
      or your own material
   Choose an appropriate Data Visualisation from a webservice we explored
   in workshop.
   Explain the process and how you madeyour choice and embed it in your
   own blog using wordpress.com as we explored last week.
   Suggest a research question that can be answered by using this data
   visualisation as a research environment
   Send the link to me at: days@tcd.ie
   Maybe: http://politicalreform.ie/2011/12/04/state-of-enda-sunday-
   business-post-red-c-poll-4th-september-2011/

Mais conteúdo relacionado

Mais procurados

PASTEUR4OA Data visualisation
PASTEUR4OA Data visualisationPASTEUR4OA Data visualisation
PASTEUR4OA Data visualisationMarieke Guy
 
Data Management for Mountain Observatories Workshop
Data Management for Mountain Observatories WorkshopData Management for Mountain Observatories Workshop
Data Management for Mountain Observatories WorkshopCarly Strasser
 
creating a trading zone around twitter srchives. case study: paris attacks
creating a trading zone around twitter srchives. case study: paris attackscreating a trading zone around twitter srchives. case study: paris attacks
creating a trading zone around twitter srchives. case study: paris attacksFIAT/IFTA
 
Big and Small Web Data
Big and Small Web DataBig and Small Web Data
Big and Small Web DataMarieke Guy
 
ESA Ignite talk on UC3 Dash platform for data sharing
ESA Ignite talk on UC3 Dash platform for data sharingESA Ignite talk on UC3 Dash platform for data sharing
ESA Ignite talk on UC3 Dash platform for data sharingCarly Strasser
 
Advanced web searching
Advanced web searchingAdvanced web searching
Advanced web searchingelisacho
 
Designing a second generation of open data platforms
Designing a second generation of open data platformsDesigning a second generation of open data platforms
Designing a second generation of open data platformsYannis Charalabidis
 
Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Herbert Van de Sompel
 
Life after MARC: Cataloging Tools of the Future
Life after MARC: Cataloging Tools of the FutureLife after MARC: Cataloging Tools of the Future
Life after MARC: Cataloging Tools of the FutureEmily Nimsakont
 
ACS Summer Institute - Emerging Roles of Librarians - 14_0731
ACS Summer Institute - Emerging Roles of Librarians - 14_0731ACS Summer Institute - Emerging Roles of Librarians - 14_0731
ACS Summer Institute - Emerging Roles of Librarians - 14_0731jeffreylancaster
 
IIIF as an Enabler to Interoperability within a Single Institution
IIIF as an Enabler to Interoperability within a Single InstitutionIIIF as an Enabler to Interoperability within a Single Institution
IIIF as an Enabler to Interoperability within a Single InstitutionIIIF_io
 
Zeng marcia ifla-subjectaccesssmartdatadh
Zeng marcia ifla-subjectaccesssmartdatadhZeng marcia ifla-subjectaccesssmartdatadh
Zeng marcia ifla-subjectaccesssmartdatadhMarcia Zeng
 
GLAM Rocks! London Semantic Web Meetup
GLAM Rocks! London Semantic Web MeetupGLAM Rocks! London Semantic Web Meetup
GLAM Rocks! London Semantic Web MeetupAdrian Stevenson
 
Life after MARC: Experimenting with Cataloging Tools of the Future
Life after MARC: Experimenting with Cataloging Tools of the FutureLife after MARC: Experimenting with Cataloging Tools of the Future
Life after MARC: Experimenting with Cataloging Tools of the FutureEmily Nimsakont
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so farEnrico Daga
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open ScienceSarah Jones
 
What is Linked Data, and What Does It Mean for Libraries?
What is Linked Data, and What Does It Mean for Libraries?What is Linked Data, and What Does It Mean for Libraries?
What is Linked Data, and What Does It Mean for Libraries?Emily Nimsakont
 

Mais procurados (20)

PASTEUR4OA Data visualisation
PASTEUR4OA Data visualisationPASTEUR4OA Data visualisation
PASTEUR4OA Data visualisation
 
Data Management for Mountain Observatories Workshop
Data Management for Mountain Observatories WorkshopData Management for Mountain Observatories Workshop
Data Management for Mountain Observatories Workshop
 
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
 
creating a trading zone around twitter srchives. case study: paris attacks
creating a trading zone around twitter srchives. case study: paris attackscreating a trading zone around twitter srchives. case study: paris attacks
creating a trading zone around twitter srchives. case study: paris attacks
 
Big and Small Web Data
Big and Small Web DataBig and Small Web Data
Big and Small Web Data
 
ESA Ignite talk on UC3 Dash platform for data sharing
ESA Ignite talk on UC3 Dash platform for data sharingESA Ignite talk on UC3 Dash platform for data sharing
ESA Ignite talk on UC3 Dash platform for data sharing
 
Advanced web searching
Advanced web searchingAdvanced web searching
Advanced web searching
 
Designing a second generation of open data platforms
Designing a second generation of open data platformsDesigning a second generation of open data platforms
Designing a second generation of open data platforms
 
Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)
 
Life after MARC: Cataloging Tools of the Future
Life after MARC: Cataloging Tools of the FutureLife after MARC: Cataloging Tools of the Future
Life after MARC: Cataloging Tools of the Future
 
ACS Summer Institute - Emerging Roles of Librarians - 14_0731
ACS Summer Institute - Emerging Roles of Librarians - 14_0731ACS Summer Institute - Emerging Roles of Librarians - 14_0731
ACS Summer Institute - Emerging Roles of Librarians - 14_0731
 
IIIF as an Enabler to Interoperability within a Single Institution
IIIF as an Enabler to Interoperability within a Single InstitutionIIIF as an Enabler to Interoperability within a Single Institution
IIIF as an Enabler to Interoperability within a Single Institution
 
Zeng marcia ifla-subjectaccesssmartdatadh
Zeng marcia ifla-subjectaccesssmartdatadhZeng marcia ifla-subjectaccesssmartdatadh
Zeng marcia ifla-subjectaccesssmartdatadh
 
GLAM Rocks! London Semantic Web Meetup
GLAM Rocks! London Semantic Web MeetupGLAM Rocks! London Semantic Web Meetup
GLAM Rocks! London Semantic Web Meetup
 
Life after MARC: Experimenting with Cataloging Tools of the Future
Life after MARC: Experimenting with Cataloging Tools of the FutureLife after MARC: Experimenting with Cataloging Tools of the Future
Life after MARC: Experimenting with Cataloging Tools of the Future
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 
Carpenter "The Future of the Scholarly Record"
Carpenter "The Future of the Scholarly Record"Carpenter "The Future of the Scholarly Record"
Carpenter "The Future of the Scholarly Record"
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open Science
 
Broad Data
Broad DataBroad Data
Broad Data
 
What is Linked Data, and What Does It Mean for Libraries?
What is Linked Data, and What Does It Mean for Libraries?What is Linked Data, and What Does It Mean for Libraries?
What is Linked Data, and What Does It Mean for Libraries?
 

Semelhante a MPhil Lecture on Data Vis for Analysis

Claudia Gold: Learning Data Science Online
Claudia Gold: Learning Data Science OnlineClaudia Gold: Learning Data Science Online
Claudia Gold: Learning Data Science Onlinesfdatascience
 
MARKET RESEARCH WEEK LESSONN PLAN 5.pptx
MARKET RESEARCH WEEK LESSONN PLAN 5.pptxMARKET RESEARCH WEEK LESSONN PLAN 5.pptx
MARKET RESEARCH WEEK LESSONN PLAN 5.pptxPreciousChanaiwa
 
Data Science Highlights
Data Science Highlights Data Science Highlights
Data Science Highlights Joe Lamantia
 
Process And Methodology Research
Process And Methodology ResearchProcess And Methodology Research
Process And Methodology ResearchMiles Price
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?Elena Simperl
 
Open Source Framework for Deploying Data Science Models and Cloud Based Appli...
Open Source Framework for Deploying Data Science Models and Cloud Based Appli...Open Source Framework for Deploying Data Science Models and Cloud Based Appli...
Open Source Framework for Deploying Data Science Models and Cloud Based Appli...ETCenter
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsSri Ambati
 
Göteborg university(condensed)
Göteborg university(condensed)Göteborg university(condensed)
Göteborg university(condensed)Zenodia Charpy
 
Data journalism presentation
Data journalism presentationData journalism presentation
Data journalism presentationKwami Ahiabenu,II
 
Digital Tools, Trends and Methodologies in the Humanities and Social Sciences
Digital Tools, Trends and Methodologies in the Humanities and Social SciencesDigital Tools, Trends and Methodologies in the Humanities and Social Sciences
Digital Tools, Trends and Methodologies in the Humanities and Social SciencesShawn Day
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data ChallengesPhilip Bourne
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Paul Groth
 
The Student's and Researcher's Guide to Discovery: Exploring Scientific Field...
The Student's and Researcher's Guide to Discovery: Exploring Scientific Field...The Student's and Researcher's Guide to Discovery: Exploring Scientific Field...
The Student's and Researcher's Guide to Discovery: Exploring Scientific Field...Open Knowledge Maps
 
Data and information visualization tools 2012
Data and information visualization tools 2012Data and information visualization tools 2012
Data and information visualization tools 2012Euforic Services
 
Tips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistTips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistLisa Cohen
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectbodaceacat
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSara-Jayne Terp
 

Semelhante a MPhil Lecture on Data Vis for Analysis (20)

Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARL
 
Claudia Gold: Learning Data Science Online
Claudia Gold: Learning Data Science OnlineClaudia Gold: Learning Data Science Online
Claudia Gold: Learning Data Science Online
 
Big Ugly Datasets For Thumb-Fingered Journalists
Big Ugly Datasets For Thumb-Fingered JournalistsBig Ugly Datasets For Thumb-Fingered Journalists
Big Ugly Datasets For Thumb-Fingered Journalists
 
Big Data for Library Services (2017)
Big Data for Library Services (2017)Big Data for Library Services (2017)
Big Data for Library Services (2017)
 
MARKET RESEARCH WEEK LESSONN PLAN 5.pptx
MARKET RESEARCH WEEK LESSONN PLAN 5.pptxMARKET RESEARCH WEEK LESSONN PLAN 5.pptx
MARKET RESEARCH WEEK LESSONN PLAN 5.pptx
 
Data Science Highlights
Data Science Highlights Data Science Highlights
Data Science Highlights
 
Process And Methodology Research
Process And Methodology ResearchProcess And Methodology Research
Process And Methodology Research
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?
 
Open Source Framework for Deploying Data Science Models and Cloud Based Appli...
Open Source Framework for Deploying Data Science Models and Cloud Based Appli...Open Source Framework for Deploying Data Science Models and Cloud Based Appli...
Open Source Framework for Deploying Data Science Models and Cloud Based Appli...
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data Scientists
 
Göteborg university(condensed)
Göteborg university(condensed)Göteborg university(condensed)
Göteborg university(condensed)
 
Data journalism presentation
Data journalism presentationData journalism presentation
Data journalism presentation
 
Digital Tools, Trends and Methodologies in the Humanities and Social Sciences
Digital Tools, Trends and Methodologies in the Humanities and Social SciencesDigital Tools, Trends and Methodologies in the Humanities and Social Sciences
Digital Tools, Trends and Methodologies in the Humanities and Social Sciences
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.
 
The Student's and Researcher's Guide to Discovery: Exploring Scientific Field...
The Student's and Researcher's Guide to Discovery: Exploring Scientific Field...The Student's and Researcher's Guide to Discovery: Exploring Scientific Field...
The Student's and Researcher's Guide to Discovery: Exploring Scientific Field...
 
Data and information visualization tools 2012
Data and information visualization tools 2012Data and information visualization tools 2012
Data and information visualization tools 2012
 
Tips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistTips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data Scientist
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 

Mais de Shawn Day

Place of Irish Craft Beer - August 2018
Place of Irish Craft Beer - August 2018Place of Irish Craft Beer - August 2018
Place of Irish Craft Beer - August 2018Shawn Day
 
Digital Narratives for Transylvania DH
Digital Narratives for Transylvania DHDigital Narratives for Transylvania DH
Digital Narratives for Transylvania DHShawn Day
 
Data Vis for Transylvania DH
Data Vis for Transylvania DHData Vis for Transylvania DH
Data Vis for Transylvania DHShawn Day
 
Requirements Engineering for the Humanities
Requirements Engineering for the HumanitiesRequirements Engineering for the Humanities
Requirements Engineering for the HumanitiesShawn Day
 
Google Tools for Digital Humanities Scholars
Google Tools for Digital Humanities ScholarsGoogle Tools for Digital Humanities Scholars
Google Tools for Digital Humanities ScholarsShawn Day
 
Putting Your Data on a Map
Putting Your Data on a MapPutting Your Data on a Map
Putting Your Data on a MapShawn Day
 
Comparing and Considering: Exhibit vs Palladio
Comparing and Considering: Exhibit vs PalladioComparing and Considering: Exhibit vs Palladio
Comparing and Considering: Exhibit vs PalladioShawn Day
 
Tools for Digital Humanities Scholarly Innovation: Timemap, Juxtapose, Story Map
Tools for Digital Humanities Scholarly Innovation: Timemap, Juxtapose, Story MapTools for Digital Humanities Scholarly Innovation: Timemap, Juxtapose, Story Map
Tools for Digital Humanities Scholarly Innovation: Timemap, Juxtapose, Story MapShawn Day
 
Creating Narrative with Digital Objects
Creating Narrative with Digital ObjectsCreating Narrative with Digital Objects
Creating Narrative with Digital ObjectsShawn Day
 
Digital Project Success
Digital Project SuccessDigital Project Success
Digital Project SuccessShawn Day
 
Sharing - Collecting our DAH Thoughts
Sharing  - Collecting our DAH ThoughtsSharing  - Collecting our DAH Thoughts
Sharing - Collecting our DAH ThoughtsShawn Day
 
Presenting Your Digital Research
Presenting Your Digital ResearchPresenting Your Digital Research
Presenting Your Digital ResearchShawn Day
 
Presenting Spatial Data: Whats so spatial about spatial?
Presenting Spatial Data: Whats so spatial about spatial?Presenting Spatial Data: Whats so spatial about spatial?
Presenting Spatial Data: Whats so spatial about spatial?Shawn Day
 
Digital Project Management for Digital Humanities
Digital Project Management for Digital HumanitiesDigital Project Management for Digital Humanities
Digital Project Management for Digital HumanitiesShawn Day
 
Does DH Scholarship Take Place in the Lab?
Does DH Scholarship Take Place in the Lab?Does DH Scholarship Take Place in the Lab?
Does DH Scholarship Take Place in the Lab?Shawn Day
 
ICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital Objects
ICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital ObjectsICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital Objects
ICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital ObjectsShawn Day
 
Finding (a) Place in Time
Finding (a) Place in TimeFinding (a) Place in Time
Finding (a) Place in TimeShawn Day
 
Curation and Digital Storytelling
Curation and Digital StorytellingCuration and Digital Storytelling
Curation and Digital StorytellingShawn Day
 
Exploring the DH Ecosystem from and Irish Perspective
Exploring the DH Ecosystem from and Irish PerspectiveExploring the DH Ecosystem from and Irish Perspective
Exploring the DH Ecosystem from and Irish PerspectiveShawn Day
 
Introduction to Omeka
Introduction to OmekaIntroduction to Omeka
Introduction to OmekaShawn Day
 

Mais de Shawn Day (20)

Place of Irish Craft Beer - August 2018
Place of Irish Craft Beer - August 2018Place of Irish Craft Beer - August 2018
Place of Irish Craft Beer - August 2018
 
Digital Narratives for Transylvania DH
Digital Narratives for Transylvania DHDigital Narratives for Transylvania DH
Digital Narratives for Transylvania DH
 
Data Vis for Transylvania DH
Data Vis for Transylvania DHData Vis for Transylvania DH
Data Vis for Transylvania DH
 
Requirements Engineering for the Humanities
Requirements Engineering for the HumanitiesRequirements Engineering for the Humanities
Requirements Engineering for the Humanities
 
Google Tools for Digital Humanities Scholars
Google Tools for Digital Humanities ScholarsGoogle Tools for Digital Humanities Scholars
Google Tools for Digital Humanities Scholars
 
Putting Your Data on a Map
Putting Your Data on a MapPutting Your Data on a Map
Putting Your Data on a Map
 
Comparing and Considering: Exhibit vs Palladio
Comparing and Considering: Exhibit vs PalladioComparing and Considering: Exhibit vs Palladio
Comparing and Considering: Exhibit vs Palladio
 
Tools for Digital Humanities Scholarly Innovation: Timemap, Juxtapose, Story Map
Tools for Digital Humanities Scholarly Innovation: Timemap, Juxtapose, Story MapTools for Digital Humanities Scholarly Innovation: Timemap, Juxtapose, Story Map
Tools for Digital Humanities Scholarly Innovation: Timemap, Juxtapose, Story Map
 
Creating Narrative with Digital Objects
Creating Narrative with Digital ObjectsCreating Narrative with Digital Objects
Creating Narrative with Digital Objects
 
Digital Project Success
Digital Project SuccessDigital Project Success
Digital Project Success
 
Sharing - Collecting our DAH Thoughts
Sharing  - Collecting our DAH ThoughtsSharing  - Collecting our DAH Thoughts
Sharing - Collecting our DAH Thoughts
 
Presenting Your Digital Research
Presenting Your Digital ResearchPresenting Your Digital Research
Presenting Your Digital Research
 
Presenting Spatial Data: Whats so spatial about spatial?
Presenting Spatial Data: Whats so spatial about spatial?Presenting Spatial Data: Whats so spatial about spatial?
Presenting Spatial Data: Whats so spatial about spatial?
 
Digital Project Management for Digital Humanities
Digital Project Management for Digital HumanitiesDigital Project Management for Digital Humanities
Digital Project Management for Digital Humanities
 
Does DH Scholarship Take Place in the Lab?
Does DH Scholarship Take Place in the Lab?Does DH Scholarship Take Place in the Lab?
Does DH Scholarship Take Place in the Lab?
 
ICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital Objects
ICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital ObjectsICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital Objects
ICRH Winter Institute Strand 4 Day 1 - Building Narratives with Digital Objects
 
Finding (a) Place in Time
Finding (a) Place in TimeFinding (a) Place in Time
Finding (a) Place in Time
 
Curation and Digital Storytelling
Curation and Digital StorytellingCuration and Digital Storytelling
Curation and Digital Storytelling
 
Exploring the DH Ecosystem from and Irish Perspective
Exploring the DH Ecosystem from and Irish PerspectiveExploring the DH Ecosystem from and Irish Perspective
Exploring the DH Ecosystem from and Irish Perspective
 
Introduction to Omeka
Introduction to OmekaIntroduction to Omeka
Introduction to Omeka
 

Último

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 

Último (20)

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 

MPhil Lecture on Data Vis for Analysis

  • 1. An Introduction to Data Visualisation for Analysis Exploring the Dataset - Textual, Numerical and Otherwise http://www.slideshare.net/shawnday/m-phil-datavisforanalysis
  • 2. Agenda Thoughts from last week - wordpress.com? Introduction What do we mean by Data Analysis? Some foundation terms and concepts The Data Visualisation Process Tools and Methods Extending your toolset An Exercise
  • 3. Objective To appreciate the rich variety of techniques and tools available to digital humanities scholars for data visualisation and analysis. The intention is to be able to add tools to your arsenal and to have a sense of where to look for more.
  • 4. Breakpoint One of the keys to good visualization is understanding what your immediate goals are. Are you visualizing data to understand what’s in it, or are you trying to communicate meaning to others? You - Visualisation for Data Analysis Others - Visualisation for Presentation
  • 5. Speaking of Data Analysis SPSS SAS OS Equivalents
  • 6. So Why Would You Want to Visualise Your Data? Bypass language centres to tap directly into the visual cortex Leverage ability to recognise patterns - what they call visual sense-making Powerful graphics engines now allow for live data processing and sophisticated animations and interactive research environments Sources: Geoff McGhee, Getting Started with Data Viz
  • 7. So Why Would You Want to Visualise Your Data? Work with new data to create new knowledge Explore data to discover things that used to be unknown, unknowable or impractical to know Take a new perspective on the familiar to reveal previously hidden insights
  • 8. Visualising New Information Tourists vs Locals, Eric Fischer, 2010 - Flickr
  • 9. Visualising New Information Flickr Flow, Martin Wattenberg and Fernanda Viegas, 2009
  • 10. The Familiar through New Eyes The Times Atlas
  • 11. How Could You Use Data Analysis “In the Lab” - for your own analysis Online as part of collabourative groups Through dissemination for extension of own work - crowdsourcing Others?
  • 12. The Time Ribbon and the Tree Map
  • 13. Visualisation Objective Exploring the ordinary life of rural pioneers in nineteenth century Ontario
  • 14. Farm Journal William Sunter Farm Diary, 1858
  • 15. Diaries: the raw materials • 100s of pages • Varying hands • Varying quality
  • 16. The Process • Generate word frequency (Voyeur, TAPoR) • Isolate known farm activities (NLP - LanguageWare) • Collocate to link activity references to time, duration, and resources (Voyeur)
  • 17. Example: Medical Diary Medical Diary by BlueChillies
  • 18. Example: History Flow History flow by Martin Wattenberg and Fernanda Viegas
  • 19. The Result/ New Patterns
  • 20. The Result/ New Patterns •Less time haying •The impact of technology •More tasks faster
  • 21. How Else Could this be done?
  • 22. What is the Value of this Visualisation • Easier to compare over intervals • Multiple vectors with greater granularity in a compressed space • The challenge is to find rich enough source materials to yield substantive datasets
  • 26. Case Study: Occupations of Politicians • What are we studying? – Self-declared occupations of politicians • Why? – What bias might they bring to their job? • How? – Visualising past occupation and mapping to political platform of party affiliated with
  • 27. Occupations of TDs in the 30th Dáil
  • 28. Occupations of MPs in the 2nd Parliament
  • 29. Occupations of MPs in the 37th Parliament
  • 30. The Result/ New Patterns • The emergence of the professional politician with no private sector experience • Occupational continuity across changes in governing party
  • 31. How Else Could this be Done?
  • 32. The Value of Data Vis for Analysis • New ways of presenting allow new ways of seeing • Hidden patterns become evident • Suggest other hypothesis to test
  • 33. Basic Terms Datamining Statistics Structured/Unstructured Data Visualisation Modelling
  • 34. Types of Data to Visualise Audio Data Network Data Categorical Data Social Cartographic Data Other Collections Numerical Data Image Data Temporal Data Still Textual Data Moving Narrative Metadata Qualitative Multimedia Data ????
  • 35. General Steps in Data Vis for DH Discovery / Acquisition Cleaning / ‘Munging’ Analysis / Exploratory Vis Presentation
  • 36. Discovery / Acquisition Original Research Scraping Spreadsheets Junar Databases Outwit Hub Digitized Media ScraperWiki Other Downloads Public Data Archives/Libraries Academic Partners Purchase
  • 37. Demo/Hands-On: Junar http://www.junar.com
  • 38. Cleaning / Munging (Normalisation, Format Conversion) Tools: Data Wrangler Google Refine Mr. Data Converter Data Wrangler Does simple, split, clear, fold/unfold transforms on data See example --> Data and Script Google Refine Works with larger datasets
  • 39. Hands-On: Data Wrangler http://vis.stanford.edu/wrangler/app/
  • 40. Hands-On: Google Refine http://code.google.com/p/google-refine/
  • 41. Hands-On: Mr Data Converter http://shancarter.com/data_converter/
  • 42. Analysis / Exploratory Visualisation Web Services Google Fusion Tables Google Spreadsheets IBM ManyEyes TimeFlow Applications Tableau/Tableau Public MS Office OpenOffice Gephi Node XL (plug-in for Excel) Spotfire R Processing
  • 43. Google NGram Viewers Examine word frequency in digitised books Currently about 4% of books ever published In English, Chinese, French, German, Hebrew, Russian, and Spanish Changes in word usage Trends Check out the Cultural Observatory @ Harvard
  • 45. Wordle Visually present word frequency using size, weight, colour Consider Word Clouds Considered Harmful
  • 46. Exercise Choose a dataset from a source such as: The CSO Project Guttenberg or your own material Choose an appropriate Data Visualisation from a webservice we explored in workshop. Explain the process and how you madeyour choice and embed it in your own blog using wordpress.com as we explored last week. Suggest a research question that can be answered by using this data visualisation as a research environment Send the link to me at: days@tcd.ie Maybe: http://politicalreform.ie/2011/12/04/state-of-enda-sunday- business-post-red-c-poll-4th-september-2011/