Overview of Human and Computer Vision

•Transferir como PPT, PDF•

0 gostou•516 visualizações

techmonkey4u

Tecnologia

"The eye doesn't see any shapes, it sees
only what is differentiated through light
and dark or through colors."
-Johann Wolfgang Von Goethe
(1749–1832), German poet.

An Overview of
Human and
Computer Vision
BarCamp Omaha 2010
Corey A. Spitzer

The Eye
http://en.wikipedia.org/wiki/File:Diagram_of_eye_evolution.svg

The Retina
http://www.colorado.edu/intphys/Class/IPHY3730/07vision.html

The Retina
http://openwetware.org/wiki/Image:Ch11f12.gif

Beyond the Retina
http://en.wikipedia.org/wiki/File:ERP_-_optic_cabling.jpg (Attribution: Ratznium at en.wikipedia )

Beyond the Retina
http://langabi.name/blog/2005/09/26/optical-illusions-and-visual-phenomena

Computer Vision
http://en.wikipedia.org/wiki/File:Studijskifotoaparat.JPG

Low-level Image Processing
http://en.wikipedia.org/wiki/File:Aliasing_a.png
http://en.wikipedia.org/wiki/Histogram_equalization

Image Segmentation
http://en.wikipedia.org/wiki/File:EdgeDetectionMathematica.png
Edge Detection

Image Segmentation
http://people.cs.uchicago.edu/~pff/segment/
Region-based Segmentation

High-level Image Processing
brosnan et. al.*

Movement / Object Tracking
http://www.youtube.com/watch?v=OjLlZJTahUw&t=1m12s

Depth Perception using Structured Light
http://www.youtube.com/watch?v=rYD6L1X1GUI
http://www.youtube.com/watch?v=854ZTvs8UoU&t=6m42s

Sources and Further Information
Brain and Behavior course website
University of Colorado at Boulder
http://www.colorado.edu/intphys/Class/IPHY3730/07vision.html
* Improving quality inspection of food products by computer vision––a review
Tadhg Brosnan, Da-Wen Sun
** Shape and the stereo correspondence problem
Abhijit S. Ogale and Yiannis Aloimonos

Mais conteúdo relacionado

Último

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Destaque

2024 State of Marketing Report – by HubspotMarius Sescu

Everything You Need To Know About ChatGPTExpeed Software

Product Design Trends in 2024 | Teenage EngineeringsPixeldarts

How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow

AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork

Skeleton Culture CodeSkeleton Technologies

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

Content Methodology: A Best Practices Report (Webinar)contently

How to Prepare For a Successful Job Search for 2024Albert Qian

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

5 Public speaking tips from TED - Visualized summarySpeakerHub

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

Getting into the tech field. what next Tessa Mero

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

Introduction to Data ScienceChristy Abraham Joy

Time Management & Productivity - Best PracticesVit Horky

The six step guide to practical project managementMindGenius

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36

Destaque (20)

2024 State of Marketing Report – by Hubspot

Everything You Need To Know About ChatGPT

Product Design Trends in 2024 | Teenage Engineerings

How Race, Age and Gender Shape Attitudes Towards Mental Health

AI Trends in Creative Operations 2024 by Artwork Flow.pdf

Skeleton Culture Code

PEPSICO Presentation to CAGNY Conference Feb 2024

Content Methodology: A Best Practices Report (Webinar)

How to Prepare For a Successful Job Search for 2024

Social Media Marketing Trends 2024 // The Global Indie Insights

Trends In Paid Search: Navigating The Digital Landscape In 2024

5 Public speaking tips from TED - Visualized summary

ChatGPT and the Future of Work - Clark Boyd

Getting into the tech field. what next

Google's Just Not That Into You: Understanding Core Updates & Search Intent

How to have difficult conversations

Introduction to Data Science

Time Management & Productivity - Best Practices

The six step guide to practical project management

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...

Overview of Human and Computer Vision

1. "The eye doesn't see any shapes, it sees only what is differentiated through light and dark or through colors." -Johann Wolfgang Von Goethe (1749–1832), German poet.

2. An Overview of Human and Computer Vision BarCamp Omaha 2010 Corey A. Spitzer

3. Hi.

4. The Eye http://en.wikipedia.org/wiki/File:Diagram_of_eye_evolution.svg

5. The Retina http://www.colorado.edu/intphys/Class/IPHY3730/07vision.html

6. The Retina http://openwetware.org/wiki/Image:Ch11f12.gif

7. Beyond the Retina http://en.wikipedia.org/wiki/File:ERP_-_optic_cabling.jpg (Attribution: Ratznium at en.wikipedia )

8. Beyond the Retina http://langabi.name/blog/2005/09/26/optical-illusions-and-visual-phenomena

9. Beyond the Retina http://langabi.name/blog/2005/09/26/optical-illusions-and-visual-phenomena

10. Computer Vision http://en.wikipedia.org/wiki/File:Studijskifotoaparat.JPG

11. Low-level Image Processing http://en.wikipedia.org/wiki/File:Aliasing_a.png http://en.wikipedia.org/wiki/Histogram_equalization

12. Image Segmentation http://en.wikipedia.org/wiki/File:EdgeDetectionMathematica.png Edge Detection

13. Image Segmentation http://people.cs.uchicago.edu/~pff/segment/ Region-based Segmentation

14. Object and Facial Recognition

15. High-level Image Processing brosnan et. al.*

16. Movement / Object Tracking http://www.youtube.com/watch?v=OjLlZJTahUw&t=1m12s

17. Depth Perception using Structured Light http://www.youtube.com/watch?v=rYD6L1X1GUI http://www.youtube.com/watch?v=854ZTvs8UoU&t=6m42s

18. Stereopsis

19. Stereopsis Ogale and Aloimonos**

20. Stereopsis

21. Stereopsis

22. Stereopsis ~ 37.13 cm

23. Stereopsis ~ 23.52 cm

24. Sources and Further Information Brain and Behavior course website University of Colorado at Boulder http://www.colorado.edu/intphys/Class/IPHY3730/07vision.html * Improving quality inspection of food products by computer vision––a review Tadhg Brosnan, Da-Wen Sun ** Shape and the stereo correspondence problem Abhijit S. Ogale and Yiannis Aloimonos

25. Sources and Further Information

Notas do Editor

fovea - pit that has provides the greatest focus rods and cones turn light into electrochemical signals that are sent to the brain
cones are dedicated to bright light and colors 3 kinds of cones rods are active in processing dim light hard to see color in dim light
left and right fields get processed together and in parallel Doesn&apos;t show path to the superior colliculus -- SC serves to generate quick and usually unconscious movements of the eye (saccades); -- often purely reflexive -- focuses attention onto regions of interest such as areas where a texture or color is different from its surroundings or where movement has been detected by higher areas of the brain LGN -- each LGN has 6 layers of processing, 3 for each eye; -- exact function is not known, but information is sent to and from V1 Visual Cortex -- 5 major layers processing: orientation, position, size; _; form and shape; color; motion -- two paths: &quot;where&quot; and &quot;what&quot; processed in parallel No one completely knows how vision works in a mathematical / algorithmic sense
Processing involves more than just working with the image coming from the retina Retinal images don&apos;t tell us the difference between a hole in the ground and a shadow. The brain adds hints to the image for correct interpretation based on probability, past experience, and knowledge. Brain tweaks the image; may add additional shading or changing perceived colors to synthetically add features like depth cues What the brain allows you to see isn&apos;t always the image that&apos;s actually coming in. Not raw data
Huge area of study full of huge sub-areas of study Things are more objective and discrete with pixels versus fuzzy biological signals
Huge area. Deals with preprocessing an image to aid higher-level analysis High/low pass filtering (e.g. sharpening, blurring) aliasing / antialiasing Histogram equalization - redistributes the gray-level intensities amongst the pixels, shifting all pixels with a given intensity together (i.e. all pixels that had the same intensity before have the same intensity now, it&apos;s just a different value), thus increasing the global contrast cumulative distribution function - at point X, how many pixels have intensity at or below X
after image is prepared to be analyzed at a high level.. Need to be able to tell difference between the foreground and background, areas / objects of interest in the image, etc. subproblem of a lot of different high-level problems such as -- object recognition/detection -- image classification discontinuities - adjacent pixel regions where local contrast exceeds some threshold local contrasts - define edges - define boundaries of shapes - define objects Canny edge detector problems: -- can create edges that don&apos;t actually exist -- can ignore edges that do exist -- no inherent way to tell if an edge is part of an object or is an object boundary; e.g. textures
looking for homogeneity wrt certain features (e.g. color, texture, etc); think of the paint bucket tool in photoshop; spread out in all directions looking for contiguous pixels that are similar can be used in conjunction with edge detection
High level image processing feature detection
Images from a system that classifies pizzas as good or bad based on the pattern and distribution of toppings
Disparity - thumb exercise Brain offers hints and cues for distance -- parallax - moving head, closer objects move across field of view faster; moon follows you wherever you go -- shadows -- knowledge of what things look like Correspondence problem -- some pixels don&apos;t correspond at all due to occlusions; can see more AROUND the left side with left eye, right side with right eye -- some areas are going to appear as different widths in the different images (e.g. slanted)
Smoothed image to eliminate noise Segmented based mostly on color and contrast. -- colors weren&apos;t the same due to different cameras, different lighting from different angles, noise, etc.
Triangulation using distance between each camera, focal points, and relative positions of the corresponding segments

Overview of Human and Computer Vision

Recomendados

Recomendados

Mais conteúdo relacionado

Último

Último (20)

Destaque

Destaque (20)

Overview of Human and Computer Vision

Notas do Editor