SlideShare a Scribd company logo
1 of 29
Download to read offline
Data crunchers, this
session is for you
FOIAfest 2016 | March 12, 2016
Denise Linn, Program Analyst for the Smart Chicago Collaborative
smartchicagocollaborative.org
@DKLinn | @SmartChicago
Agenda
Introductions
Some Excel Tricks & Time Savers
Quick, Easy Mapping: Chicago Library Branches
Analysis in Excel: Lobbyist Compensation in Chicago
● Filtering, IFs, COUNTIFs, SUMIFs, Pivot Tables….
Resources
@DKLinn | @SmartChicago
Introductions
@DKLinn | @SmartChicago
Tricks & Time Savers
@DKLinn | @SmartChicago
Some basics:
Operations = + * / ^
Order of Operations: BEDMAS
Basic formulas: =SUM(), =AVERAGE(), =COUNT(), =RANK()
Cheatsheets are your friend! The Internet has many to offer. Here is a good
one: http://www.collegedegreesearch.net/excel-cheats/
@DKLinn | @SmartChicago
Shortcuts for Mac:
@DKLinn | @SmartChicago
Time savers
Shift + space → select an entire row
Control + space → selects an entire column
Control + a → selects an entire worksheet
Ctl/Command + up key → jump up to the top of the workbook
Ctl/Command + down key → jump up to the bottom of the workbook
@DKLinn | @SmartChicago
Time Savers (cont.)
Excel will finish patterns in a row or column for you — days, dates, etc.
Double clicking on the bottom right box of a cell with a formula or pattern in
it will continue that calculation or pattern for the entire column of work - no
need to click, hold, and scroll.
@DKLinn | @SmartChicago
Let’s Map Chicago’s
Public Libraries
@DKLinn | @SmartChicago
First - find the dataset
Go to Chicago’s Open Data Portal
Search, “Library Locations Hours Contact”. You’ll find this Dataset: https:
//data.cityofchicago.org/Education/Libraries-Locations-Hours-and-Contact-
Information/x8fc-8rcq
Click “Export.” On the right hand side, choose “CSV for Excel”
Open the file.
@DKLinn | @SmartChicago
We have enough to make a map
Insert a blank to the right of “CYBERNAVIGATOR.”
● In D2 type formula
○ =IF(C2="Yes","CyberNavigator","No CyberNavigator")
● Doubleclick the lower right box to continue the formula for the whole
sheet.
● Label cell D1 “GROUP”
Now go to http://batchgeo.com. Copy the whole worksheet from Excel and
paste it into the webpage. Click “Map Now” and wait.
Note the cool “advanced” options you have for mapping.
@DKLinn | @SmartChicago
The Result:
@DKLinn | @SmartChicago
Let’s Use Excel to
Analyze Chicago’s
Lobbyist Compensation
Data
@DKLinn | @SmartChicago
First, Let’s Find the Data
Let’s go to Chicago’s Open Data Portal https://data.cityofchicago.org
Search, “Lobbyist Compensation”
Click “Export.” On the right hand side, choose “CSV for Excel” (so we can
capture all the rows)
Open the “Lobbyist Data Compensation” file. Save on your machine as as
xlsx file.
@DKLinn | @SmartChicago
Familiarize Yourself with the Data
Note that every row is a unique compensation ID
Note that clients, lobbyists, and lobbyist codes repeat.
Note this data is between 2012 - 2015.
Note the lobbyist codes & names
There are 24,391 compensation IDs!
What are the first questions that spring to your mind when you see this
information? Has anyone worked with this before?
@DKLinn | @SmartChicago
Make the Data Easier to Work With
Preserve the original data before you start working with it! Copy & paste the
whole “Lobbyist_Data_-_Compensation.cs” worksheet onto a different
sheet. Label that second sheet “Analysis”.
Click cell A2 on the second row of the “Analysis” sheet. Go to Window and
select “Freeze Panes”
Click a cell on the top row, then click the filter icon.
What’s the first thing you want to do?
@DKLinn | @SmartChicago
The Extremes
To find the smallest compensation amount:
● =MIN(H2:H24392)
To find the largest compensation amount:
● =MAX(H2:H24392)
Click the filter button next to “COMPENSATION_AMOUNT” and select
“descending.” Who is the client that paid out the largest compensation
amount? Which lobbyist did it go to?
@DKLinn | @SmartChicago
The “Average”
To find the average compensation amount:
● =AVERAGE(H2:H24392)
To find the median compensation amount:
● =MEDIAN(H2:H24392)
To find out how many compensation amounts have been over $100K:
● =COUNTIF(H2:H24392, ">100000")
Takeaway: There are outliers — substantially larger compensation
amounts that pull the average up.
@DKLinn | @SmartChicago
Summing Compensations by Client
On your “Analysis” sheet, filter the COMPENSATION_AMOUNT in
descending order. Look at the client names. Anything strike your
interest?
Calculate the total amount of compensation given by any client by
using =SUMIF(J2:J24392,"NAME",H2:H24392). For example:
● =SUMIF(J2:J24392,"Lucas Museum of Narrative Art",H2:H24392)
● =SUMIF(J2:J24392,"CVS Caremark Corp",H2:H24392)
● =SUMIF(J2:J24392,"Uber Technologies",H2:H24392)
@DKLinn | @SmartChicago
Sort Client names Z → A. Find “Uber” under client name
(CTL/Command + f). What else do you see?
Some clients are listed under several related names. This makes
things difficult for us and our SUMIFs...
One way to tackle this issue: pivot tables!
A pivot table helps us group, regroup, and analyze our data in an
interactive way.
@DKLinn | @SmartChicago
Summing Compensations by Client (cont.)
Prepping Our Data for Pivot Table Making
Go to the original “Lobbyist_Data_-_Compensation.cs” worksheet.
Select all, copy, & paste into a new sheet. Label this, “Pivot Table”
In the sheet “Pivot Table,” select the column
“LOBBYIST_MIDDLE_INITIAL” and delete it.
● FYI, we’re only doing this because, if we want to make a pivot
table, we can’t have blank cells
Create a new column “G” after “LOBBYIST_LAST_NAME.” Label G1
“LOBBYIST_FULL_NAME”. Into G2 type
● =CONCATENATE(“E1”, “ ”, “F1”)
● Double click the right bottom corner!
@DKLinn | @SmartChicago
Making a Pivot Table
On the “Pivot Table” sheet, click
“refresh data” button.
Select any cell, go to the Pivot Table
icon under the “Data” section, and
select “Create Manual Pivot Table…”
This window will pop up. Make sure
you have the following selections
chosen (see left)
@DKLinn | @SmartChicago
Building Pivot Table #1
“Pivot Table Builder” will pop up. It will look
like the image to the left.
● Drag and drop “CLIENT_NAME” under
Row Labels.
● Drag and drop “Sum of Compensations”
under Values.
Click on the B5 cell. Go up to the Sort Icon
and select “descending”.
What do we see?
@DKLinn | @SmartChicago
@DKLinn | @SmartChicago
Building Pivot Table #2
Make a new pivot table in another sheet.
“Pivot Table Builder” will pop up. It will look like
the image to the right.
● Drag and drop “CLIENT_NAME” under
Report Filter.
● Drag and drop “LOBBYIST_LAST_NAME”
under Column Labels.
● Drag and drop “COMPENSATION_ID”
under Row Labels
● Drag and drop “Sum of Compensations”
under Values.
@DKLinn | @SmartChicago
Pivot Tables V
Go to the filter icon in B1, search for “Uber,” and select Uber, Uber
Technologies, and Uber Technologies, Inc. This is the pivot table you will see:
@DKLinn | @SmartChicago
Play with this pivot tables more...
Build a pivot table displaying compensations sums by Lobbyist full
name:
● Drag & drop “LOBBYIST_FULL_NAME” into “Row Labels”
● Drag & drop “COMPENSATION” into the summation box
● Sort descending!
To look up clients and money BY lobbyist….
● Drag & drop “LOBBYIST_FULL_NAME” into “reporter filter”
category
● Drag & drop “CLIENT_NAME” into the “Row Labels” box
● Drag & drop “COMPENSATION” into the summation box
@DKLinn | @SmartChicago
More resources & other free tools:
Visualization tools:
http://timeline.knightlab.com/
http://timemapper.okfnlabs.org/
https://datawrapper.de/
Microsoft Power BI
https://storymap.knightlab.com/
Google sheets/charts
Excel help:
https://support.office.com/en-us/excel
https://support.office.com/en-us/article/Excel-functions-by-category-
5F91F4E9-7B42-46D2-9BD1-63F26A86C0EB (all functions by category)
@DKLinn | @SmartChicago
Questions?
@DKLinn | @SmartChicago

More Related Content

Viewers also liked

Мохамед Мезгани "Peer review of Moscow city transport system and policy"
Мохамед Мезгани "Peer review of Moscow city transport system and policy"Мохамед Мезгани "Peer review of Moscow city transport system and policy"
Мохамед Мезгани "Peer review of Moscow city transport system and policy"mosurban
 
Dogmas /certified fixed orthodontic courses by Indian dental academy
Dogmas /certified fixed orthodontic courses by Indian dental academy Dogmas /certified fixed orthodontic courses by Indian dental academy
Dogmas /certified fixed orthodontic courses by Indian dental academy Indian dental academy
 
Trinity Kings World Leadership: Recruit People Who Seize Opportunities(*June ...
Trinity Kings World Leadership: Recruit People Who Seize Opportunities(*June ...Trinity Kings World Leadership: Recruit People Who Seize Opportunities(*June ...
Trinity Kings World Leadership: Recruit People Who Seize Opportunities(*June ...Terrell Patillo
 
Pilvipalvelut opetuksessa 25.4.16
Pilvipalvelut opetuksessa 25.4.16 Pilvipalvelut opetuksessa 25.4.16
Pilvipalvelut opetuksessa 25.4.16 Matleena Laakso
 

Viewers also liked (7)

Мохамед Мезгани "Peer review of Moscow city transport system and policy"
Мохамед Мезгани "Peer review of Moscow city transport system and policy"Мохамед Мезгани "Peer review of Moscow city transport system and policy"
Мохамед Мезгани "Peer review of Moscow city transport system and policy"
 
Interview template
Interview templateInterview template
Interview template
 
Dogmas /certified fixed orthodontic courses by Indian dental academy
Dogmas /certified fixed orthodontic courses by Indian dental academy Dogmas /certified fixed orthodontic courses by Indian dental academy
Dogmas /certified fixed orthodontic courses by Indian dental academy
 
Trinity Kings World Leadership: Recruit People Who Seize Opportunities(*June ...
Trinity Kings World Leadership: Recruit People Who Seize Opportunities(*June ...Trinity Kings World Leadership: Recruit People Who Seize Opportunities(*June ...
Trinity Kings World Leadership: Recruit People Who Seize Opportunities(*June ...
 
Bronchial ashtma Drugs by Dr Harikrishna
Bronchial ashtma Drugs by Dr HarikrishnaBronchial ashtma Drugs by Dr Harikrishna
Bronchial ashtma Drugs by Dr Harikrishna
 
Yves rocher
Yves rocherYves rocher
Yves rocher
 
Pilvipalvelut opetuksessa 25.4.16
Pilvipalvelut opetuksessa 25.4.16 Pilvipalvelut opetuksessa 25.4.16
Pilvipalvelut opetuksessa 25.4.16
 

Similar to FOIAfest 2016 Session: Data crunchers, this session is for you

Informatica complex transformation ii
Informatica complex transformation iiInformatica complex transformation ii
Informatica complex transformation iiAmit Sharma
 
6 tableau visualization tips to inspire your creative eyes
6 tableau visualization tips to inspire your creative eyes6 tableau visualization tips to inspire your creative eyes
6 tableau visualization tips to inspire your creative eyesGrazitti Interactive
 
You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...
You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...
You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...HostedbyConfluent
 
Intro to Financial Modeling - EI
Intro to Financial Modeling - EIIntro to Financial Modeling - EI
Intro to Financial Modeling - EIMartin Zych
 
Informatica complex transformation i
Informatica complex transformation iInformatica complex transformation i
Informatica complex transformation iAmit Sharma
 
CIS 3100 - Database Design and ImplementationProducts on Sale Da.docx
CIS 3100 - Database Design and ImplementationProducts on Sale Da.docxCIS 3100 - Database Design and ImplementationProducts on Sale Da.docx
CIS 3100 - Database Design and ImplementationProducts on Sale Da.docxmccormicknadine86
 
- Database Design and ImplementationProducts on Sale Database fo.docx
- Database Design and ImplementationProducts on Sale Database fo.docx- Database Design and ImplementationProducts on Sale Database fo.docx
- Database Design and ImplementationProducts on Sale Database fo.docxgertrudebellgrove
 
Product Supplier59,58,8,2.50CIS 3100 - Database Desig.docx
Product Supplier59,58,8,2.50CIS 3100 - Database Desig.docxProduct Supplier59,58,8,2.50CIS 3100 - Database Desig.docx
Product Supplier59,58,8,2.50CIS 3100 - Database Desig.docxstilliegeorgiana
 
CIS 3100 - Database Design and ImplementationScoring Rubric.docx
CIS 3100 - Database Design and ImplementationScoring Rubric.docxCIS 3100 - Database Design and ImplementationScoring Rubric.docx
CIS 3100 - Database Design and ImplementationScoring Rubric.docxmccormicknadine86
 
SAP BPC 10.1 NW Master Data loading
SAP BPC 10.1 NW Master Data loading SAP BPC 10.1 NW Master Data loading
SAP BPC 10.1 NW Master Data loading Manoj Kumar
 
Geo prompt dashboard
Geo prompt dashboardGeo prompt dashboard
Geo prompt dashboardAmit Sharma
 
Get started with Sketch: a fast (and awesome) communication and design tool
Get started with Sketch: a fast (and awesome) communication and design toolGet started with Sketch: a fast (and awesome) communication and design tool
Get started with Sketch: a fast (and awesome) communication and design toolChristopher Azar
 
Dynamic Business Modeling
Dynamic Business ModelingDynamic Business Modeling
Dynamic Business Modelingchadbachman
 
Advanced Remarketing in Google Analytics Using CRM Data
Advanced Remarketing in Google Analytics Using CRM DataAdvanced Remarketing in Google Analytics Using CRM Data
Advanced Remarketing in Google Analytics Using CRM Datametricmogul
 
Porfolio of Setfocus work
Porfolio of Setfocus workPorfolio of Setfocus work
Porfolio of Setfocus workKevinPSF
 
Joining the Club: Using Spark to Accelerate Big Data at Dollar Shave Club
Joining the Club: Using Spark to Accelerate Big Data at Dollar Shave ClubJoining the Club: Using Spark to Accelerate Big Data at Dollar Shave Club
Joining the Club: Using Spark to Accelerate Big Data at Dollar Shave ClubData Con LA
 
Excel Pivot Tables and Graphing for Auditors
Excel Pivot Tables and Graphing for AuditorsExcel Pivot Tables and Graphing for Auditors
Excel Pivot Tables and Graphing for AuditorsJim Kaplan CIA CFE
 

Similar to FOIAfest 2016 Session: Data crunchers, this session is for you (20)

Informatica complex transformation ii
Informatica complex transformation iiInformatica complex transformation ii
Informatica complex transformation ii
 
6 tableau visualization tips to inspire your creative eyes
6 tableau visualization tips to inspire your creative eyes6 tableau visualization tips to inspire your creative eyes
6 tableau visualization tips to inspire your creative eyes
 
SmartCatalog 2010 for Microsoft Dynamics CRM
SmartCatalog 2010 for Microsoft Dynamics CRM SmartCatalog 2010 for Microsoft Dynamics CRM
SmartCatalog 2010 for Microsoft Dynamics CRM
 
You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...
You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...
You Put *What* in Your Stream?! Patterns and Practices for Event Design with ...
 
Intro to Financial Modeling - EI
Intro to Financial Modeling - EIIntro to Financial Modeling - EI
Intro to Financial Modeling - EI
 
Informatica complex transformation i
Informatica complex transformation iInformatica complex transformation i
Informatica complex transformation i
 
CIS 3100 - Database Design and ImplementationProducts on Sale Da.docx
CIS 3100 - Database Design and ImplementationProducts on Sale Da.docxCIS 3100 - Database Design and ImplementationProducts on Sale Da.docx
CIS 3100 - Database Design and ImplementationProducts on Sale Da.docx
 
- Database Design and ImplementationProducts on Sale Database fo.docx
- Database Design and ImplementationProducts on Sale Database fo.docx- Database Design and ImplementationProducts on Sale Database fo.docx
- Database Design and ImplementationProducts on Sale Database fo.docx
 
Product Supplier59,58,8,2.50CIS 3100 - Database Desig.docx
Product Supplier59,58,8,2.50CIS 3100 - Database Desig.docxProduct Supplier59,58,8,2.50CIS 3100 - Database Desig.docx
Product Supplier59,58,8,2.50CIS 3100 - Database Desig.docx
 
CIS 3100 - Database Design and ImplementationScoring Rubric.docx
CIS 3100 - Database Design and ImplementationScoring Rubric.docxCIS 3100 - Database Design and ImplementationScoring Rubric.docx
CIS 3100 - Database Design and ImplementationScoring Rubric.docx
 
SAP BPC 10.1 NW Master Data loading
SAP BPC 10.1 NW Master Data loading SAP BPC 10.1 NW Master Data loading
SAP BPC 10.1 NW Master Data loading
 
Geo prompt dashboard
Geo prompt dashboardGeo prompt dashboard
Geo prompt dashboard
 
Get started with Sketch: a fast (and awesome) communication and design tool
Get started with Sketch: a fast (and awesome) communication and design toolGet started with Sketch: a fast (and awesome) communication and design tool
Get started with Sketch: a fast (and awesome) communication and design tool
 
Dynamic Business Modeling
Dynamic Business ModelingDynamic Business Modeling
Dynamic Business Modeling
 
Advanced Remarketing in Google Analytics Using CRM Data
Advanced Remarketing in Google Analytics Using CRM DataAdvanced Remarketing in Google Analytics Using CRM Data
Advanced Remarketing in Google Analytics Using CRM Data
 
Porfolio of Setfocus work
Porfolio of Setfocus workPorfolio of Setfocus work
Porfolio of Setfocus work
 
Award Setup-Screenshots
Award Setup-ScreenshotsAward Setup-Screenshots
Award Setup-Screenshots
 
Joining the Club: Using Spark to Accelerate Big Data at Dollar Shave Club
Joining the Club: Using Spark to Accelerate Big Data at Dollar Shave ClubJoining the Club: Using Spark to Accelerate Big Data at Dollar Shave Club
Joining the Club: Using Spark to Accelerate Big Data at Dollar Shave Club
 
Excel Pivot Tables and Graphing for Auditors
Excel Pivot Tables and Graphing for AuditorsExcel Pivot Tables and Graphing for Auditors
Excel Pivot Tables and Graphing for Auditors
 
Award Set Up: Research Gifts-screen shots
Award Set Up: Research Gifts-screen shotsAward Set Up: Research Gifts-screen shots
Award Set Up: Research Gifts-screen shots
 

More from Smart Chicago Collaborative

Microsoft DigiSeniors Module: Computing with Confidence
Microsoft DigiSeniors Module: Computing with ConfidenceMicrosoft DigiSeniors Module: Computing with Confidence
Microsoft DigiSeniors Module: Computing with ConfidenceSmart Chicago Collaborative
 
Inclusive Smart Cities - Net Inclusion 2017 Presentation
Inclusive Smart Cities - Net Inclusion 2017 PresentationInclusive Smart Cities - Net Inclusion 2017 Presentation
Inclusive Smart Cities - Net Inclusion 2017 PresentationSmart Chicago Collaborative
 
5.20.17 Community Technology Forum at Windsor Park Lutheran Evangelical Church
5.20.17 Community Technology Forum at Windsor Park Lutheran Evangelical Church5.20.17 Community Technology Forum at Windsor Park Lutheran Evangelical Church
5.20.17 Community Technology Forum at Windsor Park Lutheran Evangelical ChurchSmart Chicago Collaborative
 
Final Report for CUTGroup #28 - City of Chicago Open Data Portal Homepage
Final Report for CUTGroup #28 - City of Chicago Open Data Portal HomepageFinal Report for CUTGroup #28 - City of Chicago Open Data Portal Homepage
Final Report for CUTGroup #28 - City of Chicago Open Data Portal HomepageSmart Chicago Collaborative
 
Juvenile Expungement Presentation to The Chicago Community Trust
Juvenile Expungement Presentation to The Chicago Community TrustJuvenile Expungement Presentation to The Chicago Community Trust
Juvenile Expungement Presentation to The Chicago Community TrustSmart Chicago Collaborative
 
CUTGroup Detroit Slides for CUTGroup Collective Call
CUTGroup Detroit Slides for CUTGroup Collective CallCUTGroup Detroit Slides for CUTGroup Collective Call
CUTGroup Detroit Slides for CUTGroup Collective CallSmart Chicago Collaborative
 
Connect Chicago Digital Skills Road Map Working Group #1
Connect Chicago Digital Skills Road Map Working Group #1Connect Chicago Digital Skills Road Map Working Group #1
Connect Chicago Digital Skills Road Map Working Group #1Smart Chicago Collaborative
 
2016 Smart Chicago Collaborative Youth-Led Tech Instructors
2016 Smart Chicago Collaborative Youth-Led Tech Instructors2016 Smart Chicago Collaborative Youth-Led Tech Instructors
2016 Smart Chicago Collaborative Youth-Led Tech InstructorsSmart Chicago Collaborative
 

More from Smart Chicago Collaborative (20)

Chicago School of Data Book
Chicago School of Data BookChicago School of Data Book
Chicago School of Data Book
 
10/18/17 Array of Things Public Meeting Flyer
10/18/17 Array of Things Public Meeting Flyer10/18/17 Array of Things Public Meeting Flyer
10/18/17 Array of Things Public Meeting Flyer
 
Microsoft DigiSeniors Module: Computing with Confidence
Microsoft DigiSeniors Module: Computing with ConfidenceMicrosoft DigiSeniors Module: Computing with Confidence
Microsoft DigiSeniors Module: Computing with Confidence
 
DigiSeniors Curriculum - Leaders Guide
DigiSeniors Curriculum - Leaders GuideDigiSeniors Curriculum - Leaders Guide
DigiSeniors Curriculum - Leaders Guide
 
Inclusive Smart Cities - Net Inclusion 2017 Presentation
Inclusive Smart Cities - Net Inclusion 2017 PresentationInclusive Smart Cities - Net Inclusion 2017 Presentation
Inclusive Smart Cities - Net Inclusion 2017 Presentation
 
5.20.17 Community Technology Forum at Windsor Park Lutheran Evangelical Church
5.20.17 Community Technology Forum at Windsor Park Lutheran Evangelical Church5.20.17 Community Technology Forum at Windsor Park Lutheran Evangelical Church
5.20.17 Community Technology Forum at Windsor Park Lutheran Evangelical Church
 
Juvenile Expungement Help Desk Presentation
Juvenile Expungement Help Desk PresentationJuvenile Expungement Help Desk Presentation
Juvenile Expungement Help Desk Presentation
 
Quick facts about Juvenile Expungement
Quick facts about Juvenile ExpungementQuick facts about Juvenile Expungement
Quick facts about Juvenile Expungement
 
LAF Chicago Juvenile Expungement Clinics
LAF Chicago Juvenile Expungement ClinicsLAF Chicago Juvenile Expungement Clinics
LAF Chicago Juvenile Expungement Clinics
 
Juvenile Expungement Help Desk Flyer
Juvenile Expungement Help Desk FlyerJuvenile Expungement Help Desk Flyer
Juvenile Expungement Help Desk Flyer
 
Final Report for CUTGroup #28 - City of Chicago Open Data Portal Homepage
Final Report for CUTGroup #28 - City of Chicago Open Data Portal HomepageFinal Report for CUTGroup #28 - City of Chicago Open Data Portal Homepage
Final Report for CUTGroup #28 - City of Chicago Open Data Portal Homepage
 
Juvenile Expungement Presentation to The Chicago Community Trust
Juvenile Expungement Presentation to The Chicago Community TrustJuvenile Expungement Presentation to The Chicago Community Trust
Juvenile Expungement Presentation to The Chicago Community Trust
 
CUTGroup Detroit Slides for CUTGroup Collective Call
CUTGroup Detroit Slides for CUTGroup Collective CallCUTGroup Detroit Slides for CUTGroup Collective Call
CUTGroup Detroit Slides for CUTGroup Collective Call
 
Final Report for CUTGroup #24 - OpenGrid
Final Report for CUTGroup #24 - OpenGridFinal Report for CUTGroup #24 - OpenGrid
Final Report for CUTGroup #24 - OpenGrid
 
Connect Chicago Digital Skills Road Map Working Group #1
Connect Chicago Digital Skills Road Map Working Group #1Connect Chicago Digital Skills Road Map Working Group #1
Connect Chicago Digital Skills Road Map Working Group #1
 
2016 Smart Chicago Collaborative Youth-Led Tech Instructors
2016 Smart Chicago Collaborative Youth-Led Tech Instructors2016 Smart Chicago Collaborative Youth-Led Tech Instructors
2016 Smart Chicago Collaborative Youth-Led Tech Instructors
 
Component type
Component typeComponent type
Component type
 
App template
App templateApp template
App template
 
App inventor basics
App inventor basicsApp inventor basics
App inventor basics
 
Goal setting
Goal settingGoal setting
Goal setting
 

Recently uploaded

RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024Timothy Spann
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGIThomas Poetter
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 

Recently uploaded (20)

RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 

FOIAfest 2016 Session: Data crunchers, this session is for you

  • 1. Data crunchers, this session is for you FOIAfest 2016 | March 12, 2016 Denise Linn, Program Analyst for the Smart Chicago Collaborative smartchicagocollaborative.org @DKLinn | @SmartChicago
  • 2. Agenda Introductions Some Excel Tricks & Time Savers Quick, Easy Mapping: Chicago Library Branches Analysis in Excel: Lobbyist Compensation in Chicago ● Filtering, IFs, COUNTIFs, SUMIFs, Pivot Tables…. Resources @DKLinn | @SmartChicago
  • 4. Tricks & Time Savers @DKLinn | @SmartChicago
  • 5. Some basics: Operations = + * / ^ Order of Operations: BEDMAS Basic formulas: =SUM(), =AVERAGE(), =COUNT(), =RANK() Cheatsheets are your friend! The Internet has many to offer. Here is a good one: http://www.collegedegreesearch.net/excel-cheats/ @DKLinn | @SmartChicago
  • 6. Shortcuts for Mac: @DKLinn | @SmartChicago
  • 7. Time savers Shift + space → select an entire row Control + space → selects an entire column Control + a → selects an entire worksheet Ctl/Command + up key → jump up to the top of the workbook Ctl/Command + down key → jump up to the bottom of the workbook @DKLinn | @SmartChicago
  • 8. Time Savers (cont.) Excel will finish patterns in a row or column for you — days, dates, etc. Double clicking on the bottom right box of a cell with a formula or pattern in it will continue that calculation or pattern for the entire column of work - no need to click, hold, and scroll. @DKLinn | @SmartChicago
  • 9. Let’s Map Chicago’s Public Libraries @DKLinn | @SmartChicago
  • 10. First - find the dataset Go to Chicago’s Open Data Portal Search, “Library Locations Hours Contact”. You’ll find this Dataset: https: //data.cityofchicago.org/Education/Libraries-Locations-Hours-and-Contact- Information/x8fc-8rcq Click “Export.” On the right hand side, choose “CSV for Excel” Open the file. @DKLinn | @SmartChicago
  • 11. We have enough to make a map Insert a blank to the right of “CYBERNAVIGATOR.” ● In D2 type formula ○ =IF(C2="Yes","CyberNavigator","No CyberNavigator") ● Doubleclick the lower right box to continue the formula for the whole sheet. ● Label cell D1 “GROUP” Now go to http://batchgeo.com. Copy the whole worksheet from Excel and paste it into the webpage. Click “Map Now” and wait. Note the cool “advanced” options you have for mapping. @DKLinn | @SmartChicago
  • 12. The Result: @DKLinn | @SmartChicago
  • 13. Let’s Use Excel to Analyze Chicago’s Lobbyist Compensation Data @DKLinn | @SmartChicago
  • 14. First, Let’s Find the Data Let’s go to Chicago’s Open Data Portal https://data.cityofchicago.org Search, “Lobbyist Compensation” Click “Export.” On the right hand side, choose “CSV for Excel” (so we can capture all the rows) Open the “Lobbyist Data Compensation” file. Save on your machine as as xlsx file. @DKLinn | @SmartChicago
  • 15. Familiarize Yourself with the Data Note that every row is a unique compensation ID Note that clients, lobbyists, and lobbyist codes repeat. Note this data is between 2012 - 2015. Note the lobbyist codes & names There are 24,391 compensation IDs! What are the first questions that spring to your mind when you see this information? Has anyone worked with this before? @DKLinn | @SmartChicago
  • 16. Make the Data Easier to Work With Preserve the original data before you start working with it! Copy & paste the whole “Lobbyist_Data_-_Compensation.cs” worksheet onto a different sheet. Label that second sheet “Analysis”. Click cell A2 on the second row of the “Analysis” sheet. Go to Window and select “Freeze Panes” Click a cell on the top row, then click the filter icon. What’s the first thing you want to do? @DKLinn | @SmartChicago
  • 17. The Extremes To find the smallest compensation amount: ● =MIN(H2:H24392) To find the largest compensation amount: ● =MAX(H2:H24392) Click the filter button next to “COMPENSATION_AMOUNT” and select “descending.” Who is the client that paid out the largest compensation amount? Which lobbyist did it go to? @DKLinn | @SmartChicago
  • 18. The “Average” To find the average compensation amount: ● =AVERAGE(H2:H24392) To find the median compensation amount: ● =MEDIAN(H2:H24392) To find out how many compensation amounts have been over $100K: ● =COUNTIF(H2:H24392, ">100000") Takeaway: There are outliers — substantially larger compensation amounts that pull the average up. @DKLinn | @SmartChicago
  • 19. Summing Compensations by Client On your “Analysis” sheet, filter the COMPENSATION_AMOUNT in descending order. Look at the client names. Anything strike your interest? Calculate the total amount of compensation given by any client by using =SUMIF(J2:J24392,"NAME",H2:H24392). For example: ● =SUMIF(J2:J24392,"Lucas Museum of Narrative Art",H2:H24392) ● =SUMIF(J2:J24392,"CVS Caremark Corp",H2:H24392) ● =SUMIF(J2:J24392,"Uber Technologies",H2:H24392) @DKLinn | @SmartChicago
  • 20. Sort Client names Z → A. Find “Uber” under client name (CTL/Command + f). What else do you see? Some clients are listed under several related names. This makes things difficult for us and our SUMIFs... One way to tackle this issue: pivot tables! A pivot table helps us group, regroup, and analyze our data in an interactive way. @DKLinn | @SmartChicago Summing Compensations by Client (cont.)
  • 21. Prepping Our Data for Pivot Table Making Go to the original “Lobbyist_Data_-_Compensation.cs” worksheet. Select all, copy, & paste into a new sheet. Label this, “Pivot Table” In the sheet “Pivot Table,” select the column “LOBBYIST_MIDDLE_INITIAL” and delete it. ● FYI, we’re only doing this because, if we want to make a pivot table, we can’t have blank cells Create a new column “G” after “LOBBYIST_LAST_NAME.” Label G1 “LOBBYIST_FULL_NAME”. Into G2 type ● =CONCATENATE(“E1”, “ ”, “F1”) ● Double click the right bottom corner! @DKLinn | @SmartChicago
  • 22. Making a Pivot Table On the “Pivot Table” sheet, click “refresh data” button. Select any cell, go to the Pivot Table icon under the “Data” section, and select “Create Manual Pivot Table…” This window will pop up. Make sure you have the following selections chosen (see left) @DKLinn | @SmartChicago
  • 23. Building Pivot Table #1 “Pivot Table Builder” will pop up. It will look like the image to the left. ● Drag and drop “CLIENT_NAME” under Row Labels. ● Drag and drop “Sum of Compensations” under Values. Click on the B5 cell. Go up to the Sort Icon and select “descending”. What do we see? @DKLinn | @SmartChicago
  • 25. Building Pivot Table #2 Make a new pivot table in another sheet. “Pivot Table Builder” will pop up. It will look like the image to the right. ● Drag and drop “CLIENT_NAME” under Report Filter. ● Drag and drop “LOBBYIST_LAST_NAME” under Column Labels. ● Drag and drop “COMPENSATION_ID” under Row Labels ● Drag and drop “Sum of Compensations” under Values. @DKLinn | @SmartChicago
  • 26. Pivot Tables V Go to the filter icon in B1, search for “Uber,” and select Uber, Uber Technologies, and Uber Technologies, Inc. This is the pivot table you will see: @DKLinn | @SmartChicago
  • 27. Play with this pivot tables more... Build a pivot table displaying compensations sums by Lobbyist full name: ● Drag & drop “LOBBYIST_FULL_NAME” into “Row Labels” ● Drag & drop “COMPENSATION” into the summation box ● Sort descending! To look up clients and money BY lobbyist…. ● Drag & drop “LOBBYIST_FULL_NAME” into “reporter filter” category ● Drag & drop “CLIENT_NAME” into the “Row Labels” box ● Drag & drop “COMPENSATION” into the summation box @DKLinn | @SmartChicago
  • 28. More resources & other free tools: Visualization tools: http://timeline.knightlab.com/ http://timemapper.okfnlabs.org/ https://datawrapper.de/ Microsoft Power BI https://storymap.knightlab.com/ Google sheets/charts Excel help: https://support.office.com/en-us/excel https://support.office.com/en-us/article/Excel-functions-by-category- 5F91F4E9-7B42-46D2-9BD1-63F26A86C0EB (all functions by category) @DKLinn | @SmartChicago