SlideShare uma empresa Scribd logo
1 de 91
Baixar para ler offline
COCO’s Memory Palace:
A Strange Fantasia
Lynn Cherny
@arnicas
Images
Images
Poetry
Images
Poetry
Memory
Project 1: Color
Poems
Color analysis & Google Vision API, Unsplash API
Martin Krzywinski’s color summarizer API
http://mkweb.bcgsc.ca/colorsummarizer/
Martin Krzywinski’s color summarizer API
http://mkweb.bcgsc.ca/colorsummarizer/
“My list uses the following sources: bang-v2, beer SRM chart,
colorhexa, crayola, n3, ntc, pantone, raveling, resene, rgb.txt
(X11), steve.hollasch, wikipedia, and xkcd.”
9284 Named Colors
http://mkweb.bcgsc.ca/colornames/
Some More Design Details
• BigText.js for text layout (space-filling width/size)
• Complicated drop shadows to make text stand
out against the image background
• CSS image effects to grayscale, blur, opacity
• (Flexbox saved my ass)
–John Berger, Understanding a Photograph
“I have decided that seeing this is worth
recording. … Photography is the process of
rendering observation self-conscious.”
https://www.nyu.edu/classes/bkg/methods/harper.pdf
“Talking about pictures: a case for photo elicitation” (Harper, 2002)
https://www.ncbi.nlm.nih.gov/pubmed/28889976
Fludd’s
Memory Palace
of Music
–Frances Yates, The Art of Memory
“There are two kinds of memory… one natural,
the other artificial. The natural memory is that
which is engrafted in our minds, born
simultaneously with thought. The artificial
memory is a memory strengthened or
confirmed by training.”
Romberch Memory,
1553
–Frances Yates, The Art of Memory
“The artificial memory is established from
places and images…. A locus is a place easily
grasped by the memory, such as a house, an
intercolumnar space, a corner, an arch, or the
like. Images are forms, marks or simulacra.. of
what we wish to remember. For instance if we
wish to recall the genus of a horse, or a lion, of
an eagle, we must place their images on
definite loci.”
@TheGlenlivet
amazon rekognition says…
amazon rekognition says…
Recognition works on either an S3 bucket url,
or an image blob.
Using Google Vision API is similar but
can take any url
Project 2: COCO’s
poems, & 3 voices
Fourteen
Mary Doria Russell
All bone, no meat
Growing, not grown.
Desire, dreams,
Nothing to show.
Diving, but still dry.
Photo courtesy of Accidental Mysteries, collection of John Foster.
“Poems About Photography”
MS COCO Dataset
“Common Objects in Context” image training dataset:
“Our dataset contains photos of [80 objects]* types that would
be easily recognizable by a 4 year old.”
http://cocodataset.org/#home *2014 was 80, not 91
Useful articles
How to Freeze a Model and Serve It With A Python API
Dat Tran’s post: How to Train Your Own Object Detector With
Tensorflow
Useful articles
How to Freeze a Model and Serve It With A Python API
Dat Tran’s post: How to Train Your Own Object Detector With
Tensorflow
Undergraduates said…. “people, event”
What is the model trained on
==
What is the model trained on
==
what can it talk about?
COCO’s world of 80 things.
"person"
"bicycle"
"car"
"motorcycle"
"airplane"
"bus"
"train"
"truck"
"boat"
"traffic light"
"fire hydrant"
"stop sign"
"parking meter"
"bench"
COCO’s world of 80 things.
"person"
"bicycle"
"car"
"motorcycle"
"airplane"
"bus"
"train"
"truck"
"boat"
"traffic light"
"fire hydrant"
"stop sign"
"parking meter"
"bench"
"bird"
"cat"
"dog"
"horse"
"sheep"
"cow"
"elephant"
"bear"
"zebra"
"giraffe"
"backpack"
"umbrella"
"handbag"
"tie"
"suitcase"
"frisbee"
"skis"
"snowboard"
"sports ball"
"kite"
"baseball bat"
"baseball glove"
"skateboard"
"surfboard"
"tennis racket"
"backpack"
"umbrella"
"handbag"
"tie"
"suitcase"
"frisbee"
"skis"
"snowboard"
"sports ball"
"kite"
"baseball bat"
"baseball glove"
"skateboard"
"surfboard"
"tennis racket"
"bottle"
"wine glass"
"cup"
"fork"
"knife"
"spoon"
"bowl"
"banana"
“apple”
"orange"
"broccoli"
"carrot"
"hot dog"
"pizza"
"donut"
“cake"
"sandwich"
"chair"
"couch"
"potted plant"
"bed"
"dining table"
"toilet"
"tv"
"laptop"
"mouse"
"remote"
"keyboard"
"cell phone"
"microwave"
“oven"
"toaster"
"sink"
"refrigerator"
"chair"
"couch"
"potted plant"
"bed"
"dining table"
"toilet"
"tv"
"laptop"
"mouse"
"remote"
"keyboard"
"cell phone"
"microwave"
“oven"
"toaster"
"sink"
"refrigerator"
"book"
"clock"
"vase"
"scissors"
"teddy bear"
"hair drier"
"toothbrush"
“potted plant” but not “tree.”
“potted plant” but not “tree.”
COCO knows about computers, but
there is no sky, no grass, no mountains.
There is a Jupyter notebook there that will run the
tensorflow demo for you.
How poetic is COCO?
eh, whoops. Not legit: “traffic light” not “light”, “fire hydrant”
not “fire.”
“couch” > “cat”
“toilet” > “umbrella”
“traffic” > “bus”
“spoon” == “fork” but “knife” beats both
> grep sandwich poetry-json
“in microwave fashion”
“donut-sized scales”
“even if ice-cream and
pizza attach dead-pan
humor”
> grep sandwich poetry-json
COCO’s grammar*
• Includes words related to her degree of
uncertainty at the object recognition
• nouns mixed in that are NOT in her vocabulary,
from a word2vec lookup of related terms in the
Gigaword corpus served by API (part-of-speech
tagged)
• Language style reflecting naivety and youth
*tracery.js from Kate Compton
https://github.com/zonination/perceptions
Part-of-speech tagged Gigaword corpus word2vec model
Words similar to “castle_NOUN”:
–John Berger
“A photograph, whilst recording what has been
seen, always and by its nature refers to what is
not seen. … the quantum of truth in an
'impersonal' photograph must still depend upon
the general categories already in the
spectator's mind.”
Naho
Matsudo
http://everythingeverytime.net/about.html
Adding more voices
• Google Vision API — more “objective” and
scientific language
• Amazon Rekognition — more colorful, more
poetic
Google Vision API
& Amazon Rekognition
(the nouns)
Using COCO, Amazon, and Google API’s all together…
WH Auden on “The Fall of Icarus”
by Peter Brueghel
(Google said “FU” to the URL)
“the old guitarist,” Picasso
“The Man with the Blue Guitar”
by Wallace Stevens
“The man bent over his guitar,
A shearsman of sorts. The day was green.
They said, “You have a blue guitar,
You do not play things as they are.”
The man replied, ‘Things as they are
Are changed upon the blue guitar.’”
https://www.amazon.com/PERSONAL-Recordable-Plush-Talking-Teddy/dp/B001RZE0E4
COCO says, man with a teddy bear.
https://www.amazon.com/PERSONAL-Recordable-Plush-Talking-Teddy/dp/B001RZE0E4
COCO says, man with a teddy bear.
https://www.amazon.com/PERSONAL-Recordable-Plush-Talking-Teddy/dp/B001RZE0E4
https://www.eff.org/ai/metrics
–Wallace Stevens, Man With the Blue Guitar
“We shall forget by day, except
The moments when we choose to play
The imagined pine, the imagined jay.”
–Yates, the Art of Memory
“an artificial memory gone out of control into
wild imaginative indulgence…”
–Yates, the Art of Memory
“an artificial memory gone out of control into
wild imaginative indulgence…”
“the dream archaeology of a humanist mingles with dream
memory systems to form the strange fantasia.”
Thank you.
-@arnicas
Slides will go on slideshare and a blog post.

Mais conteúdo relacionado

Semelhante a COCO's Memory Palace: A Strange Fantasia

HarambeeNet: Data by the people, for the people
HarambeeNet: Data by the people, for the peopleHarambeeNet: Data by the people, for the people
HarambeeNet: Data by the people, for the people
Michael Bernstein
 
John P Garcia Final Paper
John P Garcia Final PaperJohn P Garcia Final Paper
John P Garcia Final Paper
John Garcia
 
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
MongoDB
 

Semelhante a COCO's Memory Palace: A Strange Fantasia (20)

Deep Learning: Changing the Playing Field of Artificial Intelligence - MaRS G...
Deep Learning: Changing the Playing Field of Artificial Intelligence - MaRS G...Deep Learning: Changing the Playing Field of Artificial Intelligence - MaRS G...
Deep Learning: Changing the Playing Field of Artificial Intelligence - MaRS G...
 
HarambeeNet: Data by the people, for the people
HarambeeNet: Data by the people, for the peopleHarambeeNet: Data by the people, for the people
HarambeeNet: Data by the people, for the people
 
From Natural Language Processing to Artificial Intelligence
From Natural Language Processing to Artificial IntelligenceFrom Natural Language Processing to Artificial Intelligence
From Natural Language Processing to Artificial Intelligence
 
ReactConf 2018: Developing Immersive cross-platform AR and VR Apps using Reac...
ReactConf 2018: Developing Immersive cross-platform AR and VR Apps using Reac...ReactConf 2018: Developing Immersive cross-platform AR and VR Apps using Reac...
ReactConf 2018: Developing Immersive cross-platform AR and VR Apps using Reac...
 
Tom Crane - An Introduction to IIIF
Tom Crane - An Introduction to IIIF Tom Crane - An Introduction to IIIF
Tom Crane - An Introduction to IIIF
 
Creative AI & multimodality: looking ahead
Creative AI & multimodality: looking aheadCreative AI & multimodality: looking ahead
Creative AI & multimodality: looking ahead
 
Query by Image (이미지로 이미지 검색하기)
Query by Image (이미지로 이미지 검색하기)Query by Image (이미지로 이미지 검색하기)
Query by Image (이미지로 이미지 검색하기)
 
John P Garcia Final Paper
John P Garcia Final PaperJohn P Garcia Final Paper
John P Garcia Final Paper
 
Multi-modal embeddings: from discriminative to generative models and creative ai
Multi-modal embeddings: from discriminative to generative models and creative aiMulti-modal embeddings: from discriminative to generative models and creative ai
Multi-modal embeddings: from discriminative to generative models and creative ai
 
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
 
Geo spatial documentary presentation at i-docs 2012
Geo spatial documentary presentation at i-docs 2012Geo spatial documentary presentation at i-docs 2012
Geo spatial documentary presentation at i-docs 2012
 
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
 
Deep Learning AtoC with Image Perspective
Deep Learning AtoC with Image PerspectiveDeep Learning AtoC with Image Perspective
Deep Learning AtoC with Image Perspective
 
BL Labs 2014 Symposium: The Mechanical Curator
BL Labs 2014 Symposium: The Mechanical CuratorBL Labs 2014 Symposium: The Mechanical Curator
BL Labs 2014 Symposium: The Mechanical Curator
 
Image Search: Then and Now
Image Search: Then and NowImage Search: Then and Now
Image Search: Then and Now
 
R in the Humanities: Text Analysis (v2)
R in the Humanities: Text Analysis (v2)R in the Humanities: Text Analysis (v2)
R in the Humanities: Text Analysis (v2)
 
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
 
Semantics and the Humanities: some lessons from my journey 2000-2012
Semantics and the Humanities: some lessons from my journey 2000-2012Semantics and the Humanities: some lessons from my journey 2000-2012
Semantics and the Humanities: some lessons from my journey 2000-2012
 
Nmc 2007 Publish
Nmc 2007 PublishNmc 2007 Publish
Nmc 2007 Publish
 
Hacking Human Language (PyCon Sweden 2015)
Hacking Human Language (PyCon Sweden 2015)Hacking Human Language (PyCon Sweden 2015)
Hacking Human Language (PyCon Sweden 2015)
 

Mais de Lynn Cherny

Mais de Lynn Cherny (10)

Things I Think Are Awesome (Eyeo 2016 Talk)
Things I Think Are Awesome (Eyeo 2016 Talk)Things I Think Are Awesome (Eyeo 2016 Talk)
Things I Think Are Awesome (Eyeo 2016 Talk)
 
Visualizing Networks
Visualizing NetworksVisualizing Networks
Visualizing Networks
 
Mining Someone Else's Magic World: Dwarf Fortress Story Generation
Mining Someone Else's Magic World: Dwarf Fortress Story GenerationMining Someone Else's Magic World: Dwarf Fortress Story Generation
Mining Someone Else's Magic World: Dwarf Fortress Story Generation
 
Bestseller Analysis: Visualization Fiction (for PyData Boston 2013)
Bestseller Analysis: Visualization Fiction (for PyData Boston 2013)Bestseller Analysis: Visualization Fiction (for PyData Boston 2013)
Bestseller Analysis: Visualization Fiction (for PyData Boston 2013)
 
The Bones of a Bestseller: Visualizing Fiction
The Bones of a Bestseller: Visualizing FictionThe Bones of a Bestseller: Visualizing Fiction
The Bones of a Bestseller: Visualizing Fiction
 
Nodebox for Data Visualization
Nodebox for Data VisualizationNodebox for Data Visualization
Nodebox for Data Visualization
 
Interactive Data Visualization (with D3.js)
Interactive Data Visualization (with D3.js)Interactive Data Visualization (with D3.js)
Interactive Data Visualization (with D3.js)
 
A Fast and Dirty Intro to NetworkX (and D3)
A Fast and Dirty Intro to NetworkX (and D3)A Fast and Dirty Intro to NetworkX (and D3)
A Fast and Dirty Intro to NetworkX (and D3)
 
Simplifying Social Network Diagrams
Simplifying Social Network Diagrams Simplifying Social Network Diagrams
Simplifying Social Network Diagrams
 
Design For Online Community: Beyond the Hype
Design For Online Community: Beyond the HypeDesign For Online Community: Beyond the Hype
Design For Online Community: Beyond the Hype
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

COCO's Memory Palace: A Strange Fantasia