ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor Daniel Martin Katz
1. Complex Systems Models
in the Social Sciences
(Lecture 6)
daniel martin katz
illinois institute of technology
chicago kent college of law
@computationaldanielmartinkatz.com computationallegalstudies.com
2. Today I Would Like to
Sketch (In Part)
Where the World
is Heading
11. This is the Era of “Big Data”
Decreasing Data Storage Costs
Increasing Computing Power
Fundamentally Altering the Scope of Scientific &
Technical Possibility
29. Erik Brynjolfsson is the Schussel Family
Professor at the MIT Sloan School of
Management , Director of the MIT
Center for Digital Business, Chair of
the MIT Sloan Management Review ,
and the Editor of the Information
Systems Network
Andrew McAfee, a principal research
scientist at MIT’s Center for Digital
Business, studies the ways that
information technology (IT) affects
business.
50. Financial Services
Help agents find patterns, understand context and
flag abuse by analyzing customer correspondence in
conjunction with structured information.
Predict consumer behavior by using cross-line of
business details to correlate consumer debt ratios
and transaction patterns.
Automate entity identification and risk profiling based
on SEC and other regulatory filings, with the ability
to assemble, model and stay current on large
volumes of information.
Consume, analyze and act on real-time market data
while maintaining sub-millisecond response times, even
under extreme data loads.
51. Retail
Analyze social media data in conjunction with
customer buying data to predict customer behavior,
and track sentiment and brand perception.
Monitor social media sources for rumors,
deliberate false information, and impersonation of
employees to more quickly understand and correct
misinformation.
Issue marketing promotions, analyze their success
in real time, and adapt promotions to optimize
outcome.
Analyze linkages between specific online
advertising and recommendations to buying
behavior so that you can strategically merchandise
and place effective advertising.
Run highly complex pattern detection algorithms
on years, rather than months of transaction data,
allowing the organization to rapidly detect and
respond to new fraud scenarios and exposures.
(Also True of Financial services)
52. Healthcare
Perform complex real-time analytics on
physiological streams of data in ICU
environments to detect life-threatening
conditions in time to proactively intervene.
Manage and analyze real-time sensor data to
assist chronic disease patients.
Capture and analyze clinical information from
electronic health records to speed the creation
and diffusion of medical knowledge.
Track a wide variety of data streams and
leverage prior benchmarks to potentially help
to expose the early signs of an epidemic
53. Telecom
Perform real-time mediation with the ability to
handle billions of call detail records per day.
Process real-time call data to predict customer
churn and remediate customer satisfaction issues
(ex: dropped calls) as soon as they happen.
Enable real-time geo-mapping and marketing
promotions.
Analyze social media data with customer buying
data to predict customer behavior and track
sentiment and brand perception.
Unlock the insights embedded in call center voice
recordings by doing voice-to-text conversions,
then performing advanced text analytics on the
converted recordings.
75. A Brief Word About Data
Driven Theory Building
One Billion Clicks Can Be the
Basis for a Theory
Generation
The Hypothesis Testing Frame
is not the only way to do
Science
76. Validation Can Be Achieved
Through Either:
(1) Out of Sample Testing
(2) Forward Prediction
A Brief Word About Data
Driven Theory Building
http://en.wikipedia.org/wiki/Netflix_Prize
http://www.wired.com/epicenter/2009/09/
how-the-netflix-prize-was-won/
Example:
77.
78. This is the Age of
Aspirational Spelling
(Spelling is 1.0 Thinking)
Additional Examples
80. The Science of Similarity
What Makes a Set of High Dimensional
Objects ‘Similar”?
Movies People Words Music Books
Crowd Sourced Human Reasoning is Helping
Develop a ‘New’ Science of Similarity
81. Computer
Forensics
Drawn from Jesse Kornblum
1. Feature Extraction
2. Feature Selection
3. Comparison
4. Clustering
5. Classification
6. ???
The ‘???’ means:
– Which features to extract
– Which similarity measure to use
– Which classification algorithm
The Science of Similarity
125. Computer
Forensics
Drawn from Jesse Kornblum
1. Feature Extraction
2. Feature Selection
3. Comparison
4. Clustering
5. Classification
6. ???
The ‘???’ means:
– Which features to extract
– Which similarity measure to use
– Which classification algorithm
126. My hope is that the AI Revolution
will allow us to better understand
analogical reasoning as it is a critical
for a large number questions we
care about .....