SlideShare uma empresa Scribd logo
1 de 15
Baixar para ler offline
Discovering & Dealing with Data

                                       Presented by


                           Kimberly Silk, MLS, Data Librarian,
                    Martin Prosperity Institute, University of Toronto




17 September 2012
Agenda
    • The MPI information environment
    • Common data sources & authority
    • Data management, discovery and access
    • What is Open Data? Big Data?
    • Fun with data visualization
    • Q&A



2
About the MPI
• The Martin Prosperity Institute is a economic
  think-tank; we are part of the Rotman School
  within the University of Toronto
• My client group consists of grad students, post-
  docs, visiting faculty and researchers who use
  social-science data to support their research
• To support their research process, I procure,
  curate, preserve and make discoverable data sets.
• The MPI has our own data repository that has
  grown to 4 TB in size.
                                                  3
Data Sources
    • Common & Very authoritative sources
      – StatsCan via the Data Liberation Initiative
      – Bureau of Labor Statistics, Bureau of Economic
        Analysis, American Fact Finder (Census)
      – OECD eLibrary
      – World Bank
      – Int’l sources such as UK Data Archive, Swedish
        National Data Service, etc.
      – Pew Research Center
      – Gallup
4
More data sources
    • Less authoritative??
      – Chinese Data Center
      – Rolling Stone
      – MySpace
      – CrunchBase




5
Data Challenge: Discovery

• Lots of research data
  being collected and
  added, but no method
  to manage it, catalogue
  it, or make it findable
• Demands from various
  clients: faculty,
  students, researchers,
  staff, administration
• The shared network
  drive was no longer
  effective




                            6
Show & Share…
    • We want the world to see our data catalogue
    • But, we don’t want the world to be able to
      copy or change what’s in the catalogue, or the
      catalogue itself
    • We need to manage access to our data; who
      are you? Where are you from? Why do you
      want the data? What are you going to do with
      it? Will you share your results?

7
Data Discovery Platforms
    • I reviewed several platforms that would work in
      an academic environment:
      – Nesstar – developed in Norway by Norwegian Social
        Science Data Services, used by StatsCan, UK Data
        Archive, NORC at UChicago
      – Islandora – Open source system based on Fedora
        developed at UPEI
      – ODESI – proprietary system developed and used by
        Scholars Portal
      – Dataverse – Open source system developed by the
        Institute for Quantitative Social Science at Harvard,
        used by NBER, and many academic think tanks.

8
Dataverse
    • Dataverse was a good choice since we could
      install an iteration at UToronto, in the UToronto
      cloud, and I could manage it myself
    • It was free, and my colleagues at Scholar’s Portal
      was interested in installing it – I was the perfect
      guinea pig
    • Slowly, I am cataloguing my data collection; I
      have set up a lending agreement, and it’s working
      very well.
    • Demo:
      http://dataverse.scholarsportal.info/dvn/dv/mpi

9
Open Data
 • Open data is an idea, that certain data should be
   freely available to everyone to use, reuse, and
   redistribute without restriction.
 • Governments around the world have begun to
   “open up” some of their data: US, UK, New
   Zealand, Norway, Russia, Australia, Morocco,
   Netherlands, Chile, Spain, Uruguay, France, Brazil,
   Estonia, Portugal, etc.
 • State- and municipal-levels of government have
   also created open data sites.

10
Open Data Opportunities…
 • Governments open up their data to foster
   better citizenship and improve transparency
 • Open Data can spur grass-roots innovation:
   citizens access open data to use in software
   programs to solve problems, such as finding a
   local daycare, knowing when the next bus will
   come, reporting crime on-the-fly, or watching
   congress proceedings in real time.

11
… and Challenges
 • Open Data takes commitment. Successful
   implementations have a dedicated team of
   people who decide what data to release
   according to usefulness and demand
 • The data must be anonymized, cleansed and
   in a non-proprietary format
 • Organizations must be prepared to listen to
   the citizens, be responsive, and trouble-shoot.
 • Open data is a public service.

12
Big Data
 • Big Data is a collection of data sets that is too
   large for the average database management tool
   (Access and Excel, for instance).
 • Examples come from meteorology, genomics and
   physics. At MPI we wrestle with large GIS data
   sets (maps and satellite data), and deal with data
   at the terabyte (1 trillion bytes) level.
 • Larger data sets deal with petabytes (1
   quadrillion bytes) and exabytes (1 quintillion
   bytes).

13
Data Visualizations
 • The visual representation of data ---- literally,
   a picture can say a thousand [numbers]
 • Edward Tufte is a key pioneer:
   http://www.edwardtufte.com/tufte/
 • Fantastic examples at Flowing Data:
   http://flowingdata.com/
 • RSA Animate: http://www.thersa.org/


14
Q&A

                                (and, Thank You!)



                           Kimberly Silk, MLS, Data Librarian,
                    Martin Prosperity Institute, University of Toronto
                          kimberly.silk@martinprosperity.org




17 September 2012

Mais conteúdo relacionado

Mais procurados

The HathiTrust Research Center: An Overview of Advanced Computational Services
The HathiTrust Research Center: An Overview of Advanced Computational ServicesThe HathiTrust Research Center: An Overview of Advanced Computational Services
The HathiTrust Research Center: An Overview of Advanced Computational ServicesRobert H. McDonald
 
LOD/LAM Presentation
LOD/LAM PresentationLOD/LAM Presentation
LOD/LAM PresentationHafabe
 
The culture of researchData
The culture of researchDataThe culture of researchData
The culture of researchDatapetermurrayrust
 
Linked data: what it means, why it matters. Karen Coyle
Linked data: what it means, why it matters. Karen CoyleLinked data: what it means, why it matters. Karen Coyle
Linked data: what it means, why it matters. Karen CoyleBiblioteca Nacional de España
 
Efforts to Promote Open Science in European Research Libraries
Efforts to Promote Open Science in European Research LibrariesEfforts to Promote Open Science in European Research Libraries
Efforts to Promote Open Science in European Research LibrariesLIBER Europe
 
LIBER Webinar: Research Data Services Survey
LIBER Webinar: Research Data Services Survey LIBER Webinar: Research Data Services Survey
LIBER Webinar: Research Data Services Survey LIBER Europe
 
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...Alison Hitchens
 
A theory of digital library metadata : enrich then filter
A theory of digital library metadata : enrich then filter A theory of digital library metadata : enrich then filter
A theory of digital library metadata : enrich then filter Getaneh Alemu
 
Research into Practice case study 2: Library linked data implementations an...
	Research into Practice case study 2:  Library linked data implementations an...	Research into Practice case study 2:  Library linked data implementations an...
Research into Practice case study 2: Library linked data implementations an...Hazel Hall
 
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Stefan Dietze
 
DYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the HumanitiesDYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the Humanitiesariadnenetwork
 
Open data and open access landscape in Tanzania/Zaituni Kaijage
Open data and open access landscape in Tanzania/Zaituni KaijageOpen data and open access landscape in Tanzania/Zaituni Kaijage
Open data and open access landscape in Tanzania/Zaituni KaijageAfrican Open Science Platform
 
Linked open data and libraries
Linked open data and librariesLinked open data and libraries
Linked open data and librariesAlison Hitchens
 
What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)Alison Hitchens
 
Increase usage of online resources Edina presentation
Increase usage of online resources Edina presentationIncrease usage of online resources Edina presentation
Increase usage of online resources Edina presentationJISC RSC Eastern
 
Life of a data archive: Workflow, staff, skills, partnerships. ADP example
Life of a data archive: Workflow, staff, skills, partnerships. ADP exampleLife of a data archive: Workflow, staff, skills, partnerships. ADP example
Life of a data archive: Workflow, staff, skills, partnerships. ADP exampleArhiv družboslovnih podatkov
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliLEARN Project
 
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste..."Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...Biblioteca Nacional de España
 

Mais procurados (20)

The HathiTrust Research Center: An Overview of Advanced Computational Services
The HathiTrust Research Center: An Overview of Advanced Computational ServicesThe HathiTrust Research Center: An Overview of Advanced Computational Services
The HathiTrust Research Center: An Overview of Advanced Computational Services
 
LOD/LAM Presentation
LOD/LAM PresentationLOD/LAM Presentation
LOD/LAM Presentation
 
The culture of researchData
The culture of researchDataThe culture of researchData
The culture of researchData
 
Linked data: what it means, why it matters. Karen Coyle
Linked data: what it means, why it matters. Karen CoyleLinked data: what it means, why it matters. Karen Coyle
Linked data: what it means, why it matters. Karen Coyle
 
Efforts to Promote Open Science in European Research Libraries
Efforts to Promote Open Science in European Research LibrariesEfforts to Promote Open Science in European Research Libraries
Efforts to Promote Open Science in European Research Libraries
 
LIBER Webinar: Research Data Services Survey
LIBER Webinar: Research Data Services Survey LIBER Webinar: Research Data Services Survey
LIBER Webinar: Research Data Services Survey
 
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
 
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
 
A theory of digital library metadata : enrich then filter
A theory of digital library metadata : enrich then filter A theory of digital library metadata : enrich then filter
A theory of digital library metadata : enrich then filter
 
Research into Practice case study 2: Library linked data implementations an...
	Research into Practice case study 2:  Library linked data implementations an...	Research into Practice case study 2:  Library linked data implementations an...
Research into Practice case study 2: Library linked data implementations an...
 
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
 
DYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the HumanitiesDYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the Humanities
 
Open data and open access landscape in Tanzania/Zaituni Kaijage
Open data and open access landscape in Tanzania/Zaituni KaijageOpen data and open access landscape in Tanzania/Zaituni Kaijage
Open data and open access landscape in Tanzania/Zaituni Kaijage
 
Linked open data and libraries
Linked open data and librariesLinked open data and libraries
Linked open data and libraries
 
What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)
 
November 18, 2015 NISO Webinar: Text Mining: Digging Deep for Knowledge
November 18, 2015 NISO Webinar: Text Mining: Digging Deep for KnowledgeNovember 18, 2015 NISO Webinar: Text Mining: Digging Deep for Knowledge
November 18, 2015 NISO Webinar: Text Mining: Digging Deep for Knowledge
 
Increase usage of online resources Edina presentation
Increase usage of online resources Edina presentationIncrease usage of online resources Edina presentation
Increase usage of online resources Edina presentation
 
Life of a data archive: Workflow, staff, skills, partnerships. ADP example
Life of a data archive: Workflow, staff, skills, partnerships. ADP exampleLife of a data archive: Workflow, staff, skills, partnerships. ADP example
Life of a data archive: Workflow, staff, skills, partnerships. ADP example
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina Leonelli
 
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste..."Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
 

Destaque

Day2 1 nijsten_dealing with data
Day2 1 nijsten_dealing with dataDay2 1 nijsten_dealing with data
Day2 1 nijsten_dealing with datagroundwatercop
 
Dealing with Unstructured Data: Scaling to Infinity
Dealing with Unstructured Data: Scaling to InfinityDealing with Unstructured Data: Scaling to Infinity
Dealing with Unstructured Data: Scaling to InfinityGreat Wide Open
 
Dealing With Data
Dealing With DataDealing With Data
Dealing With Datapsjelinek
 
The Near Future of CSS
The Near Future of CSSThe Near Future of CSS
The Near Future of CSSRachel Andrew
 
Essential things that should always be in your car
Essential things that should always be in your carEssential things that should always be in your car
Essential things that should always be in your carEason Chan
 
Classroom Management Tips for Kids and Adolescents
Classroom Management Tips for Kids and AdolescentsClassroom Management Tips for Kids and Adolescents
Classroom Management Tips for Kids and AdolescentsShelly Sanchez Terrell
 

Destaque (7)

Day2 1 nijsten_dealing with data
Day2 1 nijsten_dealing with dataDay2 1 nijsten_dealing with data
Day2 1 nijsten_dealing with data
 
Dealing with Unstructured Data: Scaling to Infinity
Dealing with Unstructured Data: Scaling to InfinityDealing with Unstructured Data: Scaling to Infinity
Dealing with Unstructured Data: Scaling to Infinity
 
Dealing With Data
Dealing With DataDealing With Data
Dealing With Data
 
Back-to-School Survey 2016
Back-to-School Survey 2016Back-to-School Survey 2016
Back-to-School Survey 2016
 
The Near Future of CSS
The Near Future of CSSThe Near Future of CSS
The Near Future of CSS
 
Essential things that should always be in your car
Essential things that should always be in your carEssential things that should always be in your car
Essential things that should always be in your car
 
Classroom Management Tips for Kids and Adolescents
Classroom Management Tips for Kids and AdolescentsClassroom Management Tips for Kids and Adolescents
Classroom Management Tips for Kids and Adolescents
 

Semelhante a APLIC 2012: Discovering & Dealing with Data

Big and Small Web Data
Big and Small Web DataBig and Small Web Data
Big and Small Web DataMarieke Guy
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptxAkhirulAminulloh2
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxHASHEMHASH
 
Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypseENUG
 
Supporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data ManagementSupporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data ManagementMarieke Guy
 
ICPSR Data Services
ICPSR Data ServicesICPSR Data Services
ICPSR Data ServicesICPSR
 
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open DataODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open DataMartin Kaltenböck
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessAbby Clobridge
 
ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13ICPSR
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...hsuleslie
 
Open Data Publication - Requirements, Good practices, and Benefits
Open Data Publication - Requirements, Good practices, and BenefitsOpen Data Publication - Requirements, Good practices, and Benefits
Open Data Publication - Requirements, Good practices, and Benefitsariadnenetwork
 
RDA - The Research Data Alliance in a Nutshell
RDA - The Research Data Alliance in a NutshellRDA - The Research Data Alliance in a Nutshell
RDA - The Research Data Alliance in a NutshellResearch Data Alliance
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonAfrican Open Science Platform
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...ICPSR
 
Requirements for Open Sharing of Archaeological Research Data
Requirements for Open Sharing of Archaeological Research DataRequirements for Open Sharing of Archaeological Research Data
Requirements for Open Sharing of Archaeological Research Dataariadnenetwork
 

Semelhante a APLIC 2012: Discovering & Dealing with Data (20)

Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
Big and Small Web Data
Big and Small Web DataBig and Small Web Data
Big and Small Web Data
 
Open Sesame: Open Data, Data Liberation and Opportunities for Librarians
Open Sesame: Open Data, Data Liberation and Opportunities for LibrariansOpen Sesame: Open Data, Data Liberation and Opportunities for Librarians
Open Sesame: Open Data, Data Liberation and Opportunities for Librarians
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptx
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptx
 
Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypse
 
Gettind data used
Gettind data usedGettind data used
Gettind data used
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Supporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data ManagementSupporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data Management
 
ICPSR Data Services
ICPSR Data ServicesICPSR Data Services
ICPSR Data Services
 
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open DataODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open Access
 
ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
 
Open Data Publication - Requirements, Good practices, and Benefits
Open Data Publication - Requirements, Good practices, and BenefitsOpen Data Publication - Requirements, Good practices, and Benefits
Open Data Publication - Requirements, Good practices, and Benefits
 
RDA - The Research Data Alliance in a Nutshell
RDA - The Research Data Alliance in a NutshellRDA - The Research Data Alliance in a Nutshell
RDA - The Research Data Alliance in a Nutshell
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
Requirements for Open Sharing of Archaeological Research Data
Requirements for Open Sharing of Archaeological Research DataRequirements for Open Sharing of Archaeological Research Data
Requirements for Open Sharing of Archaeological Research Data
 

Mais de Hamilton Public Library

OLA Super Conference 2019: Data Skills for 21st Century Library Practice
OLA Super Conference 2019: Data Skills for 21st Century Library PracticeOLA Super Conference 2019: Data Skills for 21st Century Library Practice
OLA Super Conference 2019: Data Skills for 21st Century Library PracticeHamilton Public Library
 
OLA Super Conference 2019: Research Round-up
OLA Super Conference 2019: Research Round-upOLA Super Conference 2019: Research Round-up
OLA Super Conference 2019: Research Round-upHamilton Public Library
 
OLA Super Conference 2019: Changing Stakeholder Perceptions About Library Value
OLA Super Conference 2019: Changing Stakeholder Perceptions About Library ValueOLA Super Conference 2019: Changing Stakeholder Perceptions About Library Value
OLA Super Conference 2019: Changing Stakeholder Perceptions About Library ValueHamilton Public Library
 
Constructing a Strategic Plan: Essential Processes and Components
Constructing a Strategic Plan: Essential Processes and ComponentsConstructing a Strategic Plan: Essential Processes and Components
Constructing a Strategic Plan: Essential Processes and ComponentsHamilton Public Library
 
Library Space Use Study: What we Learned
Library Space Use Study: What we Learned Library Space Use Study: What we Learned
Library Space Use Study: What we Learned Hamilton Public Library
 
Surfacing Integration in the Digital Scholarship Ecosystem
Surfacing Integration in the Digital Scholarship EcosystemSurfacing Integration in the Digital Scholarship Ecosystem
Surfacing Integration in the Digital Scholarship EcosystemHamilton Public Library
 
All Together Now: Collaboration and Coordination in Canada's Digital Scholars...
All Together Now: Collaboration and Coordination in Canada's Digital Scholars...All Together Now: Collaboration and Coordination in Canada's Digital Scholars...
All Together Now: Collaboration and Coordination in Canada's Digital Scholars...Hamilton Public Library
 
L-Index: Designing a New Method for Measuring Library Impact in Canada
L-Index: Designing a New Method for Measuring Library Impact in CanadaL-Index: Designing a New Method for Measuring Library Impact in Canada
L-Index: Designing a New Method for Measuring Library Impact in CanadaHamilton Public Library
 
Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...
Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...
Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...Hamilton Public Library
 
Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...
Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...
Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...Hamilton Public Library
 
Strategic Metrics Workshop: Computers in Libraries Conference, April 2015
Strategic Metrics Workshop: Computers in Libraries Conference, April 2015Strategic Metrics Workshop: Computers in Libraries Conference, April 2015
Strategic Metrics Workshop: Computers in Libraries Conference, April 2015Hamilton Public Library
 
CLA 2014: The Economic Impact of Libraries
CLA 2014: The Economic Impact of LibrariesCLA 2014: The Economic Impact of Libraries
CLA 2014: The Economic Impact of LibrariesHamilton Public Library
 
So Much More: The Economic Impact of Toronto Public Library on the City of To...
So Much More: The Economic Impact of Toronto Public Library on the City of To...So Much More: The Economic Impact of Toronto Public Library on the City of To...
So Much More: The Economic Impact of Toronto Public Library on the City of To...Hamilton Public Library
 
TRY 2011 - Mentoring the 21st Century Information Professional
TRY 2011 - Mentoring the 21st Century Information ProfessionalTRY 2011 - Mentoring the 21st Century Information Professional
TRY 2011 - Mentoring the 21st Century Information ProfessionalHamilton Public Library
 
Internet Librarian 2010 - Using Design Thinking to Enable Innovation
Internet Librarian 2010 - Using Design Thinking to Enable InnovationInternet Librarian 2010 - Using Design Thinking to Enable Innovation
Internet Librarian 2010 - Using Design Thinking to Enable InnovationHamilton Public Library
 

Mais de Hamilton Public Library (20)

OLA Super Conference 2019: Data Skills for 21st Century Library Practice
OLA Super Conference 2019: Data Skills for 21st Century Library PracticeOLA Super Conference 2019: Data Skills for 21st Century Library Practice
OLA Super Conference 2019: Data Skills for 21st Century Library Practice
 
OLA Super Conference 2019: Research Round-up
OLA Super Conference 2019: Research Round-upOLA Super Conference 2019: Research Round-up
OLA Super Conference 2019: Research Round-up
 
OLA Super Conference 2019: Changing Stakeholder Perceptions About Library Value
OLA Super Conference 2019: Changing Stakeholder Perceptions About Library ValueOLA Super Conference 2019: Changing Stakeholder Perceptions About Library Value
OLA Super Conference 2019: Changing Stakeholder Perceptions About Library Value
 
Constructing a Strategic Plan: Essential Processes and Components
Constructing a Strategic Plan: Essential Processes and ComponentsConstructing a Strategic Plan: Essential Processes and Components
Constructing a Strategic Plan: Essential Processes and Components
 
Library Space Use Study: What we Learned
Library Space Use Study: What we Learned Library Space Use Study: What we Learned
Library Space Use Study: What we Learned
 
Surfacing Integration in the Digital Scholarship Ecosystem
Surfacing Integration in the Digital Scholarship EcosystemSurfacing Integration in the Digital Scholarship Ecosystem
Surfacing Integration in the Digital Scholarship Ecosystem
 
Library Value Projects
Library Value ProjectsLibrary Value Projects
Library Value Projects
 
Trends in Demonstrating Library Value
Trends in Demonstrating Library ValueTrends in Demonstrating Library Value
Trends in Demonstrating Library Value
 
All Together Now: Collaboration and Coordination in Canada's Digital Scholars...
All Together Now: Collaboration and Coordination in Canada's Digital Scholars...All Together Now: Collaboration and Coordination in Canada's Digital Scholars...
All Together Now: Collaboration and Coordination in Canada's Digital Scholars...
 
L-Index: Designing a New Method for Measuring Library Impact in Canada
L-Index: Designing a New Method for Measuring Library Impact in CanadaL-Index: Designing a New Method for Measuring Library Impact in Canada
L-Index: Designing a New Method for Measuring Library Impact in Canada
 
Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...
Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...
Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...
 
Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...
Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...
Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...
 
Strategic Metrics Workshop: Computers in Libraries Conference, April 2015
Strategic Metrics Workshop: Computers in Libraries Conference, April 2015Strategic Metrics Workshop: Computers in Libraries Conference, April 2015
Strategic Metrics Workshop: Computers in Libraries Conference, April 2015
 
Evidence-Based Innovation
Evidence-Based InnovationEvidence-Based Innovation
Evidence-Based Innovation
 
Library Impact Studies: Lessons Learned
Library Impact Studies: Lessons LearnedLibrary Impact Studies: Lessons Learned
Library Impact Studies: Lessons Learned
 
Data, Metrics, and our Profession
Data, Metrics, and our ProfessionData, Metrics, and our Profession
Data, Metrics, and our Profession
 
CLA 2014: The Economic Impact of Libraries
CLA 2014: The Economic Impact of LibrariesCLA 2014: The Economic Impact of Libraries
CLA 2014: The Economic Impact of Libraries
 
So Much More: The Economic Impact of Toronto Public Library on the City of To...
So Much More: The Economic Impact of Toronto Public Library on the City of To...So Much More: The Economic Impact of Toronto Public Library on the City of To...
So Much More: The Economic Impact of Toronto Public Library on the City of To...
 
TRY 2011 - Mentoring the 21st Century Information Professional
TRY 2011 - Mentoring the 21st Century Information ProfessionalTRY 2011 - Mentoring the 21st Century Information Professional
TRY 2011 - Mentoring the 21st Century Information Professional
 
Internet Librarian 2010 - Using Design Thinking to Enable Innovation
Internet Librarian 2010 - Using Design Thinking to Enable InnovationInternet Librarian 2010 - Using Design Thinking to Enable Innovation
Internet Librarian 2010 - Using Design Thinking to Enable Innovation
 

Último

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Último (20)

Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

APLIC 2012: Discovering & Dealing with Data

  • 1. Discovering & Dealing with Data Presented by Kimberly Silk, MLS, Data Librarian, Martin Prosperity Institute, University of Toronto 17 September 2012
  • 2. Agenda • The MPI information environment • Common data sources & authority • Data management, discovery and access • What is Open Data? Big Data? • Fun with data visualization • Q&A 2
  • 3. About the MPI • The Martin Prosperity Institute is a economic think-tank; we are part of the Rotman School within the University of Toronto • My client group consists of grad students, post- docs, visiting faculty and researchers who use social-science data to support their research • To support their research process, I procure, curate, preserve and make discoverable data sets. • The MPI has our own data repository that has grown to 4 TB in size. 3
  • 4. Data Sources • Common & Very authoritative sources – StatsCan via the Data Liberation Initiative – Bureau of Labor Statistics, Bureau of Economic Analysis, American Fact Finder (Census) – OECD eLibrary – World Bank – Int’l sources such as UK Data Archive, Swedish National Data Service, etc. – Pew Research Center – Gallup 4
  • 5. More data sources • Less authoritative?? – Chinese Data Center – Rolling Stone – MySpace – CrunchBase 5
  • 6. Data Challenge: Discovery • Lots of research data being collected and added, but no method to manage it, catalogue it, or make it findable • Demands from various clients: faculty, students, researchers, staff, administration • The shared network drive was no longer effective 6
  • 7. Show & Share… • We want the world to see our data catalogue • But, we don’t want the world to be able to copy or change what’s in the catalogue, or the catalogue itself • We need to manage access to our data; who are you? Where are you from? Why do you want the data? What are you going to do with it? Will you share your results? 7
  • 8. Data Discovery Platforms • I reviewed several platforms that would work in an academic environment: – Nesstar – developed in Norway by Norwegian Social Science Data Services, used by StatsCan, UK Data Archive, NORC at UChicago – Islandora – Open source system based on Fedora developed at UPEI – ODESI – proprietary system developed and used by Scholars Portal – Dataverse – Open source system developed by the Institute for Quantitative Social Science at Harvard, used by NBER, and many academic think tanks. 8
  • 9. Dataverse • Dataverse was a good choice since we could install an iteration at UToronto, in the UToronto cloud, and I could manage it myself • It was free, and my colleagues at Scholar’s Portal was interested in installing it – I was the perfect guinea pig • Slowly, I am cataloguing my data collection; I have set up a lending agreement, and it’s working very well. • Demo: http://dataverse.scholarsportal.info/dvn/dv/mpi 9
  • 10. Open Data • Open data is an idea, that certain data should be freely available to everyone to use, reuse, and redistribute without restriction. • Governments around the world have begun to “open up” some of their data: US, UK, New Zealand, Norway, Russia, Australia, Morocco, Netherlands, Chile, Spain, Uruguay, France, Brazil, Estonia, Portugal, etc. • State- and municipal-levels of government have also created open data sites. 10
  • 11. Open Data Opportunities… • Governments open up their data to foster better citizenship and improve transparency • Open Data can spur grass-roots innovation: citizens access open data to use in software programs to solve problems, such as finding a local daycare, knowing when the next bus will come, reporting crime on-the-fly, or watching congress proceedings in real time. 11
  • 12. … and Challenges • Open Data takes commitment. Successful implementations have a dedicated team of people who decide what data to release according to usefulness and demand • The data must be anonymized, cleansed and in a non-proprietary format • Organizations must be prepared to listen to the citizens, be responsive, and trouble-shoot. • Open data is a public service. 12
  • 13. Big Data • Big Data is a collection of data sets that is too large for the average database management tool (Access and Excel, for instance). • Examples come from meteorology, genomics and physics. At MPI we wrestle with large GIS data sets (maps and satellite data), and deal with data at the terabyte (1 trillion bytes) level. • Larger data sets deal with petabytes (1 quadrillion bytes) and exabytes (1 quintillion bytes). 13
  • 14. Data Visualizations • The visual representation of data ---- literally, a picture can say a thousand [numbers] • Edward Tufte is a key pioneer: http://www.edwardtufte.com/tufte/ • Fantastic examples at Flowing Data: http://flowingdata.com/ • RSA Animate: http://www.thersa.org/ 14
  • 15. Q&A (and, Thank You!) Kimberly Silk, MLS, Data Librarian, Martin Prosperity Institute, University of Toronto kimberly.silk@martinprosperity.org 17 September 2012