3. Review
• The New Epistemology
– Rise of Big Data: massive, available, social
– Shifts our relationship to primary sources
– From reading to quantitative methods and
visualizations
– Example of media determinism
• Manovich
– Consistent with database logic
– Applies spirit of Big Data methods to art
4.
5. Review
• Rationalization Effects
– What are we looking at?
– What is theory?
– What are models?
– What is culture?
– What are the humanities?
6. Overview
• Combined Studio and Lecture
• Lecture
– Google’s NGram Viewer
– Culturomics
• Studio:
– Collaborative Topic Index
8. Google NGrams
• Google Books comprises 11% of the corpus of
published books, about 2 trillion words
• NGrams uses 5.2 million books (4% of the
corpus)
• 500 billion words
• Published between 1500-1800
• In English, French, Spanish, German, Chinese
and Russian (Hebrew too)
21. Studio
• We are now at the point where we have all the
pieces in place
– HTML markup, CSS, JavaScript
– Structured data (table in Google Docs)
– Visualization tools
• Create Character Index
– We will use everything we have done so far – notes,
network visualizations, etc.
– Today we begin to collaboratively create the Character
Index (a subset of a full topic index)