11. Total count of print monograph volumes in the Texas
State Library
Top 20 Library Systems in the Southwest ranked by
annual budget
Total overdue fines paid at the Albany Public Library in
FY 2016-2017
Responses collected from a suggestion box at the
circulation desk of the Heermance Public Library
Children’s reading levels by age range
The zip codes encompassing your library’s service area
Quick check:
ID the Datatype
13. Do do you have to
collect your own data?
First party data
Collected by entity doing the analysis
Unique; often a direct relationship to
data source
Trustworthy (?)
Smaller datasets (mostly)
14. Second party data
Access from external platform,
but you can obtain it
Repositories
Creator of platform has direct
relationship to data source
Trustworthy (?)
15. Third party data
Access from another platform
Collected anonymously; without
user consent
“data exhaust”
Large, aggregated datasets
Trustworthy (?)
24. Apply a mathematical formula to correct for skew
Log
Square Root/Cube Root/Square
Inversion
For non-numeric data:
Create frequency tables
Assign a scale to a category
Category dis/aggregation
What are transformations?
45. Ethics
How is the data collected?
How is the data used?
How is the data stored and preserved
and what are the implications?
How and when is the data disposed
of?
In our line of work, mostly from error in data enterer/human
Can be when a sensor fails or equipment malfunctions
Measurement calibrarions are off etc.