SlideShare uma empresa Scribd logo
1 de 126
Baixar para ler offline
Find your passion and make a
difference in your career!
Albert Y. C. Chen, Ph.D.
Vice President, R&D
Viscovery
Albert Y. C. Chen, Ph.D.
陳彥呈博⼠士
albert@viscovery.com
http://slideshare.net/albertycchen
• Experience
2017-present: Vice President of R&D @ Viscovery
2015-2017: Chief Scientist @ Viscovery
2015-2015: Principal Scientist @ Nervve Technologies
2013-2014 Senior Scientist @ Tandent Vision Science
2011-2012 @ GE Global Research, Computer Vision Lab
• Education
Ph.D. in Computer Science, SUNY-Buffalo
M.S. in Computer Science, NTNU
B.S. in Computer Science, NTHU
• Face it. You'll be working the majority of your
time, for the rest of your life.
• Do something meaningful, do something you are
passionate about.
Life
24 36 48 60 72
• What are the things that you can't wait to get up
every morning and do it all over again?
What are you passionate about?
I've got a dream!
My crush on CV, HCI, MM, ML...
Can't wait to figure out the world!
Freestyle Sketching Stage
AirTouch waits in background
for the initialization signal
Initialize
Terminate
Output
image
database
Start:
Results
CBIR
query
Airtouch HCI interface for Content-based Image Retrieval
Interactive Segmentation & Classification
• Segmentation then classification:
• computationally more efficient,
• results in much higher classification accuracy.
• Pioneered the “pixel label propagation” field.
• First to utilize superpixels and supervoxels for the task.
FG
Traditional Spatial
Propagation
Pixel label map
Label a subset of pixels
BG
Spatio-temporal Propagation
time
Image/Video Object Recognition
and Content Understanding
approaches
person carries
gives
recieves
Ontology
object
Person 1
Person 1Person 2
High-Level
Mid-Level
approach
activity
receives gives
carries
activity
activity activity
Time
Reasoning
x
x
x
Low-Level
x x
x
x
Learning and Adapting Optimal
Classifier Parameters
subspace B
subspace
A
subspace
C
Image-level feature space
priors
Patch-level feature space
posterior
probability
suggest optimal
parameter configuration
Graphical Models and
Stochastic Optimization
A
(a) The space-time volume of a
video showing the objects
(A--F) and their appearing
time-span.
space
time
A
B
C
D
E
F
B E
F
C
D
(b) The temporal relationship
graph. An edge between
two vertices mean that the
two objects overlap in time.
(c) The goal is: cover all objects
with the smallest number of
"ground truth key frames".
space
time
A
B
C
D
E
F
key 1 key 2
A
B E
F
C
D
(d) This translates to: iteratively
solving the max clique
problem until all vertices
belong to a clique.
A
B E
F
C
D
key 2
key 1
frame t-1 frame t
layer n layer n
layer n+1 layer n+1
Temporal
Shift
Shift
µ
Medical Imaging and
Geospatial Imaging
GNN detection and
segmentation
in Lung CT geospatial imaging:
building detection
Brain tumor detection and
segmentation in MR images.
1. W.Wu,A.Y. C. Chen, L. Zhao, and J. J. Corso. Brain tumor detection and segmentation in a CRF
framework with pixel-wise affinity and superpixel-level features. International Journal of Computer
Assisted Radiology and Surgery, 2015.
2. S. N. Lim,A.Y. C. Chen and X.Yang. Parameter Inference Engine (PIE) on the Pareto Front. In
Proceedings of International Conference of Machine Learning,Auto ML Workshop, 2014.
3. A.Y.C. Chen and J.J. Corso.Temporally consistent multi-class video-object segmentation with the
video graph-shifts algorithm. In Proceedings of IEEE Workshop on Applications of ComputerVision,
2011.
4. D.R. Schlegel,A.Y.C. Chen, C. Xiong, J.A. Delmerico, and J.J. Corso. Airtouch: Interacting with
computer systems at a distance. In Proceedings of IEEE Workshop on Applications of Computer
Vision, 2011.
5. A.Y.C. Chen and J.J. Corso. On the effects of normalization in adaptive MRF Hierarchies. In
Proceedings of International Symposium CompIMAGE, 2010.
6. A.Y.C. Chen and J.J. Corso. Propagating multi-class pixel labels throughout video frames. In
Proceedings of IEEE Western NewYork Image Processing Workshop, 2010.
7. A.Y. C. Chen and J. J. Corso. On the effects of normalization in adaptive MRF Hierarchies.
Computational Modeling of Objects Represented in Images, pages 275–286, 2010.
8. Y.Tao, L. Lu, M. Dewan,A.Y. C. Chen, J. J. Corso, J. Xuan, M. Salganicoff, and A. Krishnan. Multi-level
ground glass nodule detection and segmentation in ct lung images. Medical Image Computing and
Computer-Assisted Intervention, 2009.
9. A.Y.C. Chen, J.J. Corso, and L.Wang. Hops: Efficient region labeling using higher order proxy
neighborhoods. In Proceedings of IEEE International Conference on Pattern Recognition, 2008.
• I gained the following capabilities throughout my
graduate studies:
• problem solving skills (research/engineering),
• the art of teaching,
• entrepreneurship.
• Furthermore, I'm really into my field of study, AI.
• I asked myself, how can I balance between
these things for my career?
Upon getting my Ph.D.
• Realized being Principal Investigator (PI) is not
that much different from being an entrepreneur!
• Entrepreneur has more control over where the
money is spent, how quickly it is spent, and get
more out of fruit of hard labor!
• Also, I get to do RD, BD, and mentoring, all at
once!
• However, I still have many things to learn before
I fly solo.
Inflection point
• How century-old multi-national corporation
remain competitive by reinventing itself.
• How wave after wave of innovations become the
new pillars of the US economy.
• How Taiwan failed to gain significance in the
Internet-based economy in the past 20 years.
• How I might be able to help make a difference
with my limited capability, in my limited lifetime.
Throughout the years, I observed...
Why Risk Innovating?
• Good business model NEVER last forever.
• Average “shelf life” on S&P 500: 20 years.
• 100-year old companies constantly reinvent
themselves every 10-20 years
• Startups contribute to 20% of USA’s GDP.
Change is the only constant
-Heraclitus (535 BC - 475 BC)
Change is the only constant
-Heraclitus (535 BC - 475 BC)
The Death of a Good
Business Model
• Foxconn 20 year revenue v.s. net profit (now at 5%)
How 100 year old corporations
remain innovative & competitive
GE Schenectady, 1896
History of change at GE
• 1886: one of the 12 original companies on the Dow
Jone Industrial Average (also the only one remaining).
• 1889: lightbulbs
• 1919: radios
• 1927: TV
• 1941: jet engine
• 1960: nuclear power
• 1971: room AC units
• 1995: MRI
History of change at IBM
• 1960s: mainframe computer
• 1980s: personal computer
• 2000s: integrated solutions
• 2020s: AI, Watson
How about the leading
Semiconductor companies?
NVidia reinventing itself
—2 times in 20 years
“Bad money drives out good”
in the desktop GPU market
The rise of mobile computing,
and how NVidia missed the boat!
NVidia’s Tegra mobile
processors never took off
then, the market
saturated…
NVidia not just survived.
NVidia is thriving!
Meet the new NVidia: Deep Learning,
Deep Learning, and still, Deep Learning
The king is dead,
long live the king!
Why startups have to be
innovative to stay competitive
• Startup: a company in pursuit of a bold new
business model that can scale.
Must take risks to gain edge, being small
The Venture Capital Funnel is
extremely efficient at filtering ideas
Don't romanticize startup life;
it's a cruel struggle for survival...
Before making the leap, 

you must ask yourself
• Are you passionate about your idea?
• Is your idea disruptive enough?
• What is your business plan?
• What is it?
• Can it make money?
• What is the future of the idea?
• What is your competitive advantage?
• How do you build up your entry barrier?
Furthermore, be prepared to face
constant, repeated rejections
NO
Ha Ha
Ha
NO
NO
It doesn't really matter
how you start
nor does it matter
when you start
It's important though, to
fail early, fail often, fail forward!
Build
Measure
Learn
The lean cycle
FAIL?
Ideate
pivot leap of faith
MVP
Build
Measure
Learn
Start all over again!
MVP
It's also important to quickly
build up your entry barrier!
• Speed
• Team
• Technology
• Market (users)
niche, niche, and niche!
The Goldilocks zone of innovation
The Goldilocks zone of innovation
Business
Relevance
Academic
Relevance
plentiful resources; hierarchical organization
lack of resources; responsive organization
traditional corporations
talking “innovation”
corporate research
startups struggling to survive
academic spinoffs
MSR
Niche market/technology for AI startups
(BCG AI Report, 2016/10)
appl.
layer
tech
layer
infra
layer
solution
platform
libraries
modules
data
machine computing power
data accumulation via open API
AI/DNN library AI/DNN library
general purpose
platforms
general purpose
platforms
app-specific
platforms
app-specific
platforms
app app app app app
HW
co.
VerticalAIStartups
agri. manu. med. fin. retail trans.
E.g., 1: Google, Amazon, FB, 2: IBM, 3: Walmart, 5: NVidia
Vertical AI Startups
Solving industry-specific problems by combining
AI and Subject Matter Expertise.
• Full Stack Products
• Subject Matter Expertise
• Proprietary Data
• AI delivers core value
(Bradford Cross, 2017/06/14)
Why Go Vertical?
• Don’t get ripped off—don’t get disconnected
from key customer/providers.
• Tasks get commoditized
• Software is eating the world—every company in
every industry needs to be a tech company.
• Enterprise exits come in cohorts
(Bradford Cross, 2017/06/14)
So, which vertical?
(Bradford Cross, 2017/06/14)
• choose large TAM
with high margin
• avoid 1% fallacy
• avoid confirmation
bias
So, which vertical?
(Bradford Cross, 2017/06/14)
AI startups total investmentUnicorn tally
Big
Innovation
Large
Market
Wide
Exit
Beware of the
"Technical Founder's Trap"
cannot
exit
no
edge
another
SME
Easy trap
for technical
founders to
fall into.
Not necessarily
bad for a first step.
Dominate the
small market and
then expand to
neighboring one.
Big Innovations with small
markets or narrow exits
Image Panoramas
(1980s - now)
Solving Panorama Problem
with Markov Random Fields
Input:
Solving Panorama Problem
with Markov Random Fields
Input:
Solving Panorama Problem
with Markov Random Fields
Input:
Solving Panorama Problem
with Markov Random Fields
Input:
Solving Panorama Problem
with Markov Random Fields
Input:
Solving Panorama Problem
with Markov Random Fields
Input:
Solving Panorama Problem
with Markov Random Fields
Input:
Solving Panorama Problem
with Markov Random Fields
Solving Panorama Problem
with Markov Random Fields
ICM (Iterated Conditional Modes), 1986
Solving Panorama Problem
with Markov Random Fields
Belief Propagation (1980-2000)
Solving Panorama Problem
with Markov Random Fields
Graph-Cuts (alpha expansion), 2001
Photosynthesis
Solving Photosynthesis Problems
with Alpha-matting (2000s-now)
Self-driving cars (2000s-now)
DARPA Grand Challenge
(2005)
2005 winner, Stanley (Stanford),
3mph through desert
DARPA Urban Challenge
(2007)
2007 winner, Boss (CMU),
13mpg through the city
Self Driving Cadillac, US
congressman to airport, 2013
Google Self Driving Car,
2015
Google Self-Driving Car,
2016
Google Self-Driving Car, 2016
NVidia Self Driving Car, 2016
How did we come this far?
Race car drivers know the trick
Focus on Free Space /
Drivable Area, not Obstacles!
Not so major innovations
with big monetary gains
Sports—NFL first down line
(1995-now)
Industrial Inspection: turbofan
jet engine blade maintenance
• Some seemingly daunting
machine vision tasks actually
works with relatively simple
image processing algorithms.
Industrial Inspection: Cognex Omniview
Industrial Inspection: Cognex Omniview
Object Recognition
Blue River Technology
Retail Insights
Source:
Prism Skylabs
Other Applications in
Business Intelligence
• Measure brand exposure.
• Measure sponsorship effectiveness.
• Loss prevention and retail layout optimization.
Some AI verticals that didn't
quite work out for me...
• Medical Imaging
• Multimedia
• Advanced Driver Assistance Systems (ADAS)
• Geospatial imaging for Energy Sector
• Surveillance
• Business Intelligence
• Advertisements
Intrinsic Imaging at Tandent
Vision Science
Computer Vision would be half-solved without shadows!
LightOriginal Image Surface
Tandent Lightbrush
Video Tutorial for Tandent Lightbrush: https://vimeo.com/47009123
Issues
• Highly anticipated, highly acclaimed, but small
crowd at $500 a license.
• Adobe Photoshop monopoly and the “not
invented here” syndrome.
• Adobe’s arch-rival, Corel (Corel Draw, Paint
Shop Pro, Ulead PhotoImpact) was DYING and
asked too much from the botched deal.
Have fun scribbling out your
shadows in photoshop!
Poor Bob from Adobe wasted 9 minutes removing just 1 shadow
Intrinsic Imaging for improving the
RGB signal in autonomous driving
Intrinsic Imaging’s other
applications
Retrospect
• 20 researchers burned 25 million in 8 years;
investors got 50 patents in return, period.
• Overestimated the total addressable market
size, in a market with existing monopoly.
• Many missed opportunities. Counterexample of
the lean startup model.
Some SfM, SLAM startups
Satellite/Aerial Imagery Analysis
• 40cm resolution at 30fps for 90 sec for any location on earth.
• One LEO satellite revisits any place on Earth every 3 days.
• Need 24 satellites to revisit any place on Earth every 3 hours.
Challenges for Single satellite depth
estimation and 3D reconstruction
• At 30fps, a LEO satellite
travels 250m between two
consecutive frames —>
theoretically sufficient for
cm-level depth estimation.
• Sources of Noise:
• Camera distortions
• Atmospheric Disturbance
• Ground vegetation
• Sub-pixel sampling noise
1
2
What happened?
• B2B customers takes too long to strike deals.
• Google ate us alive in just 3 months, while we
were still pitching for VC-funding with our
prototype.
Visual Search at Nervve
Retrospect
• Growth pains expanding from intelligence
community clients to advertisement clients.
• Forming the right team of engineers and
researchers and moving at the right pace.
• For any Computer Vision/Machine Learning
company:
• Researchers that cannot program—> OUT
• Engineers that don’t know math —> OUT
Viscovery = Video Discovery
Optical Character
Recognition
Offline
Recognition
2013
2014
Product Recognition
2015
Video Content related
Advertisements
2017
Wearable Devices
Video Content Discovery &
Interaction
2016
Leading provider of visual search technology in APAC
Once in a lifetime opportunity in
China’s video streaming market
Traditional
Media
Free Content,
Ad Revenue
Subscription
Revenue
Do Nothing?
Sitting Duck!
Improve Ad
Revenue
Content-
related video
ads
Own platform
Shared/
congregated
platform
customized
recommendatio
ns
ad measure /
brand safety
Image/Video
content owners
struggling to
monetize
Can AI solve it all?
Our initial focus:

Content-related advertisements
Why not go ahead and analyze all videos? COST!
0600120018002400
2014 2015 2016 2017 2018 2019
Revenue Hardware Cost Data Collection Cost Model Training Cost
Technology Inflection Point:
• Computer vision cost has dropped significantly
while market demand has increased beyond
exploratory stages.
• Overall cost reduction resulted from accumulated
domain expertise, efficient machines learning
techniques, and lower computing cost.
Case study on leading e-commerce site in APAC, with 1M SKU and >20M calls/month.
• Business model vs videos analyzed
Need to deliver value while analyzing!
o2o marketing
ad effectiveness
analysis
campaign
effectiveness
analytics
direct-sell
content-related ads
programmatic
content-related ads
Big Data Insights
0 2.5 5 7.5 10
videos
analyzed
(10^N) • Extremely costly to
analyze video content.
Google charges
$1.5/1000 images. with
3600 seconds/hour, it
will cost $5.4 / hour
video / single single
recognition task.
• Costly to store videos
and their recognition
results.
Hindsight on "right approach" for media
Increase user’s time spent & stickiness
Multiply traffic w/o subsidizing growth.
(4) Video content-related ads & e-commerce
(3) Ad exposure measure
(2) Content recommendation
Grow user & advertiser
budget organically with
new services
(1) Image/Video Content Recognition as a Service
Focus on
Revenue
Growth
(5)DataMiningandAnalyticsasaService
Goal
Media
Size
Value proposition to media:
Start making money the first day!
user/time/ad/budgetincrease(times)
0
1
2
3
4
TotalSiteRevenueIncrease(times)
04.5913.518
current Better recommendation
Improve user stickiness
Better measurement
Grow user & ad budget
Content-related ad, EC
Focus on Revenue
# users time / user ad / user*time budget / ad total site revenue
Viscovery's Complete Solution

for Media and Beyond
Media
AdRetail
Archive & Search
Content
Recommendation
Video Generator*
Brand Insight
Ad Effectiveness
Measure
Brand Safety
Assurance
Visual
Content
Recognition
Smart Checkout
Video to EC
Consumer Insights
Media AI: Video content archive & search
Media AI: Content Recommendation
Article-to-Article, Article-to-Video, Video-to-Video
The 31 Tiniest Cat Breeds
Cats are notoriously small
household pets, but some are
even tinier than the norm. If you
have tight living quarters or
simply can’t get enough of cats
little enough to resemble kittens
forever, there are plenty of small
cat breeds to suit you. When it
comes to cats, bigger isn’t
always better, as some of the
most charming felines come in
small packages.
Meaningful
Terms Re-weighting
High-dimension
Mapping
Terms (Attribute)
Extraction
Para
Media AI: Video Generator
powering the "pivot to video" movement!
Text Analysis
(NLP)
Image

Recognition
Key Phrases
Sentiment
Audio
Subtitles
F/O/S
Concept
Media
Database
Generation Engine
Ad AI: ad exposure measure
# Appearances 68 times
# Appearances with
Duration > 5 sec
2 times
# Appearances with
Duration > 15 sec
0
# Appearances with
Duration > 30 sec
0
Longest Duration of
Appearance
9 sec
Average Duration of
Appearance
2.12 sec
Average Appearance
Interval
37.79 sec
Total Duration of
Appearance
144 sec
Ad Exposure for Audi @ China GT (Shanghai) 2017
# times of logo
occupying >5%
screen size
8 times
Duration of logo
occupying > 5%
screen size
16 sec
# times of logo
occupying > 10%
screen size
0 times
Duration of logo
occupying > 10%
screen size
0 sec
Average logo size
(relative to screen)
2.12%
Maximum logo size
(relative to screen)
11.05%
Ad AI: Content-related ads
Ad AI: Brand safety assurance
Terroris Violence
DrugPolitical Explicit Content
Retail AI: o2o redirection
Retail AI: smart checkout
Oreo cookie ……………
$4.95
Bounty Paper Towel … $3.45
Cal. Red wine………… $12.44
--------------------------
Total
$20.84
Scan payment QR Code:
wechat Alipay
Smart checkout demo
Retail AI: merchandise management
•2D+3D cameras for head and eye tracking
•Detect Point-Of-Interest
•Consumer profiling and data collection
•Integrate with kiosk big screen devices
•ESL Integration
Smart POS
• Smart check out

• Client-facing POS screen
with 3D camera

• POS-on-the-run

• Customer Big Data

• ESL integration

Smart Shelf
Retail AI: Cunsumer Insights
Female, 30-35
Ms. Mary Rice
Status: VIP
Housewife
Brand: P&G,
LV, Nike,
Tiffany
Already
holding
product A
Reaching for
product B on
the shelf
E.g., People who appear interested in milk powder will
get an eDM containing relevant promo info.
Recognize customer appearance and behavior characteristics
to establish a buying recommendation database.
• Clear segment focus: Any corporation smaller
than Google, Facebook, Baidu, Alibaba,
Tencent.
• Need to go from 0-80 fast. Then, help them go
from 80-90 and catch up quickly.
• How can we make the "0-80" part scalable?
Building Technology Barrier
Sample-less Face Recognition
Merchandise Recognition
Some notes
• Managing innovation and delivering promises.
• Visionary leader v.s. stubborn fool v.s. big liar.
When something is important enough,
you do it even if the odds are not in your favor.
Elon Musk
Falcon 9
takeoff
Falcon 9
decelerate
Falcon 9
vertical
touchdown
Thank You!
albert@viscovery.com

Mais conteúdo relacionado

Mais procurados

Mais procurados (16)

Patent: Presentation on Patent Mining
Patent: Presentation on Patent MiningPatent: Presentation on Patent Mining
Patent: Presentation on Patent Mining
 
Lean innovation
Lean innovationLean innovation
Lean innovation
 
JLL-V3
JLL-V3JLL-V3
JLL-V3
 
NYAI #18: Team Alignment for Human-Centered AI (Chris Butler - Director of AI...
NYAI #18: Team Alignment for Human-Centered AI (Chris Butler - Director of AI...NYAI #18: Team Alignment for Human-Centered AI (Chris Butler - Director of AI...
NYAI #18: Team Alignment for Human-Centered AI (Chris Butler - Director of AI...
 
Working model to patentability
Working model to patentabilityWorking model to patentability
Working model to patentability
 
Lean Startup, Hardware Edition
Lean Startup, Hardware EditionLean Startup, Hardware Edition
Lean Startup, Hardware Edition
 
Nutanix Event - Watson AI Presentation
Nutanix Event - Watson AI PresentationNutanix Event - Watson AI Presentation
Nutanix Event - Watson AI Presentation
 
10 bezcennych lekcji dla software developera stającego się szefem firmy
10 bezcennych lekcji dla software developera stającego się szefem firmy10 bezcennych lekcji dla software developera stającego się szefem firmy
10 bezcennych lekcji dla software developera stającego się szefem firmy
 
Ten lessons I painfully learnt while moving from software developer to entrep...
Ten lessons I painfully learnt while moving from software developer to entrep...Ten lessons I painfully learnt while moving from software developer to entrep...
Ten lessons I painfully learnt while moving from software developer to entrep...
 
Ten lessons I painfully learnt while moving from software developer
to entrep...
Ten lessons I painfully learnt while moving from software developer
to entrep...Ten lessons I painfully learnt while moving from software developer
to entrep...
Ten lessons I painfully learnt while moving from software developer
to entrep...
 
Agilelessons scanagile-final 2013
Agilelessons scanagile-final 2013Agilelessons scanagile-final 2013
Agilelessons scanagile-final 2013
 
Google: Rapid Prototyping for AR
Google: Rapid Prototyping for ARGoogle: Rapid Prototyping for AR
Google: Rapid Prototyping for AR
 
Software Development Innovation in Practice - 33rd Degree 2014
Software Development Innovation in Practice - 33rd Degree 2014Software Development Innovation in Practice - 33rd Degree 2014
Software Development Innovation in Practice - 33rd Degree 2014
 
Pretotyping - Ayla Matalon
Pretotyping - Ayla MatalonPretotyping - Ayla Matalon
Pretotyping - Ayla Matalon
 
How Companies Innovate
How Companies InnovateHow Companies Innovate
How Companies Innovate
 
Lessons learned about entrepreneurship
Lessons learned about entrepreneurshipLessons learned about entrepreneurship
Lessons learned about entrepreneurship
 

Semelhante a Find Your Passion and Make a Difference in Your Career

ICSE 2011 Panel - Wolfram Schulte
ICSE 2011 Panel - Wolfram SchulteICSE 2011 Panel - Wolfram Schulte
ICSE 2011 Panel - Wolfram Schulte
jorgearanda
 
AI and Education 20240327 v16 for Northeastern.pptx
AI and Education 20240327 v16 for Northeastern.pptxAI and Education 20240327 v16 for Northeastern.pptx
AI and Education 20240327 v16 for Northeastern.pptx
ISSIP
 
Brno-IESS 20240205 v9 service-science ai.pptx
Brno-IESS 20240205 v9 service-science ai.pptxBrno-IESS 20240205 v9 service-science ai.pptx
Brno-IESS 20240205 v9 service-science ai.pptx
ISSIP
 
PDL Distinguished Alumni Talk
PDL Distinguished Alumni TalkPDL Distinguished Alumni Talk
PDL Distinguished Alumni Talk
Erik Riedel
 
2020 Future Work Skills" by Institute of the Future (pdf)
2020 Future Work Skills" by Institute of the Future (pdf)2020 Future Work Skills" by Institute of the Future (pdf)
2020 Future Work Skills" by Institute of the Future (pdf)
Errol A. Adams, J.D., M.L.S.
 

Semelhante a Find Your Passion and Make a Difference in Your Career (20)

Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLPractical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
 
The Modern Software Engineer - Nuno Job
The Modern Software Engineer - Nuno JobThe Modern Software Engineer - Nuno Job
The Modern Software Engineer - Nuno Job
 
What your employees need to learn to work with data in the 21 st century
What your employees need to learn to work with data in the 21 st century What your employees need to learn to work with data in the 21 st century
What your employees need to learn to work with data in the 21 st century
 
London Futurists - The Future of AI & Sustainability
London Futurists - The Future of AI & SustainabilityLondon Futurists - The Future of AI & Sustainability
London Futurists - The Future of AI & Sustainability
 
ICSE 2011 Panel - Wolfram Schulte
ICSE 2011 Panel - Wolfram SchulteICSE 2011 Panel - Wolfram Schulte
ICSE 2011 Panel - Wolfram Schulte
 
AUGMENTED AND VIRTUAL REALITY MEDICAL AND HEALTH
AUGMENTED AND VIRTUAL REALITY MEDICAL AND HEALTHAUGMENTED AND VIRTUAL REALITY MEDICAL AND HEALTH
AUGMENTED AND VIRTUAL REALITY MEDICAL AND HEALTH
 
AI and Education 20240327 v16 for Northeastern.pptx
AI and Education 20240327 v16 for Northeastern.pptxAI and Education 20240327 v16 for Northeastern.pptx
AI and Education 20240327 v16 for Northeastern.pptx
 
Brno-IESS 20240205 v9 service-science ai.pptx
Brno-IESS 20240205 v9 service-science ai.pptxBrno-IESS 20240205 v9 service-science ai.pptx
Brno-IESS 20240205 v9 service-science ai.pptx
 
Malaysian Higher Ed: C-ing the Future
Malaysian Higher Ed: C-ing the FutureMalaysian Higher Ed: C-ing the Future
Malaysian Higher Ed: C-ing the Future
 
Peter Shanley, Principal & Evangelist at Neo
Peter Shanley, Principal & Evangelist at Neo Peter Shanley, Principal & Evangelist at Neo
Peter Shanley, Principal & Evangelist at Neo
 
AI for SDGs and International Development - Basics of AI
AI for SDGs and International Development - Basics of AIAI for SDGs and International Development - Basics of AI
AI for SDGs and International Development - Basics of AI
 
Augmented Reality and Virtual Reality in Aviation
Augmented Reality and Virtual Reality in AviationAugmented Reality and Virtual Reality in Aviation
Augmented Reality and Virtual Reality in Aviation
 
Snowforce 2017 Keynote - Peter Coffee
Snowforce 2017 Keynote - Peter CoffeeSnowforce 2017 Keynote - Peter Coffee
Snowforce 2017 Keynote - Peter Coffee
 
20210908 jim spohrer naples forum_2021 v1
20210908 jim spohrer naples forum_2021 v120210908 jim spohrer naples forum_2021 v1
20210908 jim spohrer naples forum_2021 v1
 
Visuals in business consulting
Visuals in business consultingVisuals in business consulting
Visuals in business consulting
 
Visualizing the Agency of the Future
Visualizing the Agency of the FutureVisualizing the Agency of the Future
Visualizing the Agency of the Future
 
Lec 02
Lec 02Lec 02
Lec 02
 
PDL Distinguished Alumni Talk
PDL Distinguished Alumni TalkPDL Distinguished Alumni Talk
PDL Distinguished Alumni Talk
 
Spohrer GAMP 20230628 v17.pptx
Spohrer GAMP 20230628 v17.pptxSpohrer GAMP 20230628 v17.pptx
Spohrer GAMP 20230628 v17.pptx
 
2020 Future Work Skills" by Institute of the Future (pdf)
2020 Future Work Skills" by Institute of the Future (pdf)2020 Future Work Skills" by Institute of the Future (pdf)
2020 Future Work Skills" by Institute of the Future (pdf)
 

Mais de Albert Y. C. Chen

Mais de Albert Y. C. Chen (14)

Building ML models for smart retail
Building ML models for smart retailBuilding ML models for smart retail
Building ML models for smart retail
 
為何VC不投資我的AI新創?
為何VC不投資我的AI新創?為何VC不投資我的AI新創?
為何VC不投資我的AI新創?
 
數據特性 vs AI產品設計與實作
數據特性 vs AI產品設計與實作數據特性 vs AI產品設計與實作
數據特性 vs AI產品設計與實作
 
AI創新創業的商業模式與專案風險管理
AI創新創業的商業模式與專案風險管理AI創新創業的商業模式與專案風險管理
AI創新創業的商業模式與專案風險管理
 
Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional Managers
 
用AI創造大商機:媒體、廣告、電商、零售業的視覺辨識應用
用AI創造大商機:媒體、廣告、電商、零售業的視覺辨識應用用AI創造大商機:媒體、廣告、電商、零售業的視覺辨識應用
用AI創造大商機:媒體、廣告、電商、零售業的視覺辨識應用
 
Machine Learning Foundations
Machine Learning FoundationsMachine Learning Foundations
Machine Learning Foundations
 
AI智慧服務推動經驗分享
AI智慧服務推動經驗分享AI智慧服務推動經驗分享
AI智慧服務推動經驗分享
 
AI gold rush, tool vendors and the next big thing
AI gold rush, tool vendors and the next big thingAI gold rush, tool vendors and the next big thing
AI gold rush, tool vendors and the next big thing
 
影音大數據商機挖掘
影音大數據商機挖掘影音大數據商機挖掘
影音大數據商機挖掘
 
媒體、影視產業、AI新創
媒體、影視產業、AI新創媒體、影視產業、AI新創
媒體、影視產業、AI新創
 
Improving Spatiotemporal Stability for Object Detection and Classification
Improving Spatiotemporal Stability for Object Detection and ClassificationImproving Spatiotemporal Stability for Object Detection and Classification
Improving Spatiotemporal Stability for Object Detection and Classification
 
人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機
人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機
人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機
 
擁抱人工智慧帶來的劇烈產業改變 @ Mix Taiwan
擁抱人工智慧帶來的劇烈產業改變 @ Mix Taiwan擁抱人工智慧帶來的劇烈產業改變 @ Mix Taiwan
擁抱人工智慧帶來的劇烈產業改變 @ Mix Taiwan
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

Find Your Passion and Make a Difference in Your Career

  • 1. Find your passion and make a difference in your career! Albert Y. C. Chen, Ph.D. Vice President, R&D Viscovery
  • 2. Albert Y. C. Chen, Ph.D. 陳彥呈博⼠士 albert@viscovery.com http://slideshare.net/albertycchen • Experience 2017-present: Vice President of R&D @ Viscovery 2015-2017: Chief Scientist @ Viscovery 2015-2015: Principal Scientist @ Nervve Technologies 2013-2014 Senior Scientist @ Tandent Vision Science 2011-2012 @ GE Global Research, Computer Vision Lab • Education Ph.D. in Computer Science, SUNY-Buffalo M.S. in Computer Science, NTNU B.S. in Computer Science, NTHU
  • 3. • Face it. You'll be working the majority of your time, for the rest of your life. • Do something meaningful, do something you are passionate about. Life 24 36 48 60 72
  • 4. • What are the things that you can't wait to get up every morning and do it all over again? What are you passionate about? I've got a dream!
  • 5. My crush on CV, HCI, MM, ML... Can't wait to figure out the world! Freestyle Sketching Stage AirTouch waits in background for the initialization signal Initialize Terminate Output image database Start: Results CBIR query Airtouch HCI interface for Content-based Image Retrieval
  • 6. Interactive Segmentation & Classification • Segmentation then classification: • computationally more efficient, • results in much higher classification accuracy. • Pioneered the “pixel label propagation” field. • First to utilize superpixels and supervoxels for the task. FG Traditional Spatial Propagation Pixel label map Label a subset of pixels BG Spatio-temporal Propagation time
  • 7. Image/Video Object Recognition and Content Understanding approaches person carries gives recieves Ontology object Person 1 Person 1Person 2 High-Level Mid-Level approach activity receives gives carries activity activity activity Time Reasoning x x x Low-Level x x x x
  • 8. Learning and Adapting Optimal Classifier Parameters subspace B subspace A subspace C Image-level feature space priors Patch-level feature space posterior probability suggest optimal parameter configuration
  • 9. Graphical Models and Stochastic Optimization A (a) The space-time volume of a video showing the objects (A--F) and their appearing time-span. space time A B C D E F B E F C D (b) The temporal relationship graph. An edge between two vertices mean that the two objects overlap in time. (c) The goal is: cover all objects with the smallest number of "ground truth key frames". space time A B C D E F key 1 key 2 A B E F C D (d) This translates to: iteratively solving the max clique problem until all vertices belong to a clique. A B E F C D key 2 key 1 frame t-1 frame t layer n layer n layer n+1 layer n+1 Temporal Shift Shift µ
  • 10. Medical Imaging and Geospatial Imaging GNN detection and segmentation in Lung CT geospatial imaging: building detection Brain tumor detection and segmentation in MR images.
  • 11. 1. W.Wu,A.Y. C. Chen, L. Zhao, and J. J. Corso. Brain tumor detection and segmentation in a CRF framework with pixel-wise affinity and superpixel-level features. International Journal of Computer Assisted Radiology and Surgery, 2015. 2. S. N. Lim,A.Y. C. Chen and X.Yang. Parameter Inference Engine (PIE) on the Pareto Front. In Proceedings of International Conference of Machine Learning,Auto ML Workshop, 2014. 3. A.Y.C. Chen and J.J. Corso.Temporally consistent multi-class video-object segmentation with the video graph-shifts algorithm. In Proceedings of IEEE Workshop on Applications of ComputerVision, 2011. 4. D.R. Schlegel,A.Y.C. Chen, C. Xiong, J.A. Delmerico, and J.J. Corso. Airtouch: Interacting with computer systems at a distance. In Proceedings of IEEE Workshop on Applications of Computer Vision, 2011. 5. A.Y.C. Chen and J.J. Corso. On the effects of normalization in adaptive MRF Hierarchies. In Proceedings of International Symposium CompIMAGE, 2010. 6. A.Y.C. Chen and J.J. Corso. Propagating multi-class pixel labels throughout video frames. In Proceedings of IEEE Western NewYork Image Processing Workshop, 2010. 7. A.Y. C. Chen and J. J. Corso. On the effects of normalization in adaptive MRF Hierarchies. Computational Modeling of Objects Represented in Images, pages 275–286, 2010. 8. Y.Tao, L. Lu, M. Dewan,A.Y. C. Chen, J. J. Corso, J. Xuan, M. Salganicoff, and A. Krishnan. Multi-level ground glass nodule detection and segmentation in ct lung images. Medical Image Computing and Computer-Assisted Intervention, 2009. 9. A.Y.C. Chen, J.J. Corso, and L.Wang. Hops: Efficient region labeling using higher order proxy neighborhoods. In Proceedings of IEEE International Conference on Pattern Recognition, 2008.
  • 12. • I gained the following capabilities throughout my graduate studies: • problem solving skills (research/engineering), • the art of teaching, • entrepreneurship. • Furthermore, I'm really into my field of study, AI. • I asked myself, how can I balance between these things for my career? Upon getting my Ph.D.
  • 13. • Realized being Principal Investigator (PI) is not that much different from being an entrepreneur! • Entrepreneur has more control over where the money is spent, how quickly it is spent, and get more out of fruit of hard labor! • Also, I get to do RD, BD, and mentoring, all at once! • However, I still have many things to learn before I fly solo. Inflection point
  • 14. • How century-old multi-national corporation remain competitive by reinventing itself. • How wave after wave of innovations become the new pillars of the US economy. • How Taiwan failed to gain significance in the Internet-based economy in the past 20 years. • How I might be able to help make a difference with my limited capability, in my limited lifetime. Throughout the years, I observed...
  • 15. Why Risk Innovating? • Good business model NEVER last forever. • Average “shelf life” on S&P 500: 20 years. • 100-year old companies constantly reinvent themselves every 10-20 years • Startups contribute to 20% of USA’s GDP.
  • 16. Change is the only constant -Heraclitus (535 BC - 475 BC)
  • 17. Change is the only constant -Heraclitus (535 BC - 475 BC)
  • 18. The Death of a Good Business Model • Foxconn 20 year revenue v.s. net profit (now at 5%)
  • 19. How 100 year old corporations remain innovative & competitive GE Schenectady, 1896
  • 20. History of change at GE • 1886: one of the 12 original companies on the Dow Jone Industrial Average (also the only one remaining). • 1889: lightbulbs • 1919: radios • 1927: TV • 1941: jet engine • 1960: nuclear power • 1971: room AC units • 1995: MRI
  • 21. History of change at IBM • 1960s: mainframe computer • 1980s: personal computer • 2000s: integrated solutions • 2020s: AI, Watson
  • 22. How about the leading Semiconductor companies?
  • 23. NVidia reinventing itself —2 times in 20 years
  • 24. “Bad money drives out good” in the desktop GPU market
  • 25. The rise of mobile computing, and how NVidia missed the boat!
  • 26. NVidia’s Tegra mobile processors never took off then, the market saturated…
  • 27. NVidia not just survived. NVidia is thriving!
  • 28. Meet the new NVidia: Deep Learning, Deep Learning, and still, Deep Learning
  • 29.
  • 30. The king is dead, long live the king!
  • 31. Why startups have to be innovative to stay competitive • Startup: a company in pursuit of a bold new business model that can scale.
  • 32. Must take risks to gain edge, being small
  • 33. The Venture Capital Funnel is extremely efficient at filtering ideas
  • 34. Don't romanticize startup life; it's a cruel struggle for survival...
  • 35. Before making the leap, 
 you must ask yourself • Are you passionate about your idea? • Is your idea disruptive enough? • What is your business plan? • What is it? • Can it make money? • What is the future of the idea? • What is your competitive advantage? • How do you build up your entry barrier?
  • 36. Furthermore, be prepared to face constant, repeated rejections NO Ha Ha Ha NO NO
  • 37. It doesn't really matter how you start
  • 38. nor does it matter when you start
  • 39. It's important though, to fail early, fail often, fail forward! Build Measure Learn The lean cycle FAIL? Ideate pivot leap of faith MVP Build Measure Learn Start all over again! MVP
  • 40. It's also important to quickly build up your entry barrier! • Speed • Team • Technology • Market (users) niche, niche, and niche!
  • 41. The Goldilocks zone of innovation
  • 42. The Goldilocks zone of innovation Business Relevance Academic Relevance plentiful resources; hierarchical organization lack of resources; responsive organization traditional corporations talking “innovation” corporate research startups struggling to survive academic spinoffs MSR
  • 43. Niche market/technology for AI startups (BCG AI Report, 2016/10) appl. layer tech layer infra layer solution platform libraries modules data machine computing power data accumulation via open API AI/DNN library AI/DNN library general purpose platforms general purpose platforms app-specific platforms app-specific platforms app app app app app HW co. VerticalAIStartups agri. manu. med. fin. retail trans. E.g., 1: Google, Amazon, FB, 2: IBM, 3: Walmart, 5: NVidia
  • 44. Vertical AI Startups Solving industry-specific problems by combining AI and Subject Matter Expertise. • Full Stack Products • Subject Matter Expertise • Proprietary Data • AI delivers core value (Bradford Cross, 2017/06/14)
  • 45. Why Go Vertical? • Don’t get ripped off—don’t get disconnected from key customer/providers. • Tasks get commoditized • Software is eating the world—every company in every industry needs to be a tech company. • Enterprise exits come in cohorts (Bradford Cross, 2017/06/14)
  • 47. (Bradford Cross, 2017/06/14) • choose large TAM with high margin • avoid 1% fallacy • avoid confirmation bias So, which vertical?
  • 48. (Bradford Cross, 2017/06/14) AI startups total investmentUnicorn tally
  • 49.
  • 50. Big Innovation Large Market Wide Exit Beware of the "Technical Founder's Trap" cannot exit no edge another SME Easy trap for technical founders to fall into. Not necessarily bad for a first step. Dominate the small market and then expand to neighboring one.
  • 51. Big Innovations with small markets or narrow exits
  • 53. Solving Panorama Problem with Markov Random Fields Input:
  • 54. Solving Panorama Problem with Markov Random Fields Input:
  • 55. Solving Panorama Problem with Markov Random Fields Input:
  • 56. Solving Panorama Problem with Markov Random Fields Input:
  • 57. Solving Panorama Problem with Markov Random Fields Input:
  • 58. Solving Panorama Problem with Markov Random Fields Input:
  • 59. Solving Panorama Problem with Markov Random Fields Input:
  • 60. Solving Panorama Problem with Markov Random Fields
  • 61. Solving Panorama Problem with Markov Random Fields ICM (Iterated Conditional Modes), 1986
  • 62. Solving Panorama Problem with Markov Random Fields Belief Propagation (1980-2000)
  • 63. Solving Panorama Problem with Markov Random Fields Graph-Cuts (alpha expansion), 2001
  • 65. Solving Photosynthesis Problems with Alpha-matting (2000s-now)
  • 68. 2005 winner, Stanley (Stanford), 3mph through desert
  • 69.
  • 71. 2007 winner, Boss (CMU), 13mpg through the city
  • 72. Self Driving Cadillac, US congressman to airport, 2013
  • 73. Google Self Driving Car, 2015
  • 76. NVidia Self Driving Car, 2016
  • 77. How did we come this far? Race car drivers know the trick
  • 78. Focus on Free Space / Drivable Area, not Obstacles!
  • 79. Not so major innovations with big monetary gains
  • 80. Sports—NFL first down line (1995-now)
  • 81. Industrial Inspection: turbofan jet engine blade maintenance • Some seemingly daunting machine vision tasks actually works with relatively simple image processing algorithms.
  • 86. Other Applications in Business Intelligence • Measure brand exposure. • Measure sponsorship effectiveness. • Loss prevention and retail layout optimization.
  • 87. Some AI verticals that didn't quite work out for me... • Medical Imaging • Multimedia • Advanced Driver Assistance Systems (ADAS) • Geospatial imaging for Energy Sector • Surveillance • Business Intelligence • Advertisements
  • 88. Intrinsic Imaging at Tandent Vision Science Computer Vision would be half-solved without shadows! LightOriginal Image Surface
  • 89. Tandent Lightbrush Video Tutorial for Tandent Lightbrush: https://vimeo.com/47009123
  • 90.
  • 91. Issues • Highly anticipated, highly acclaimed, but small crowd at $500 a license. • Adobe Photoshop monopoly and the “not invented here” syndrome. • Adobe’s arch-rival, Corel (Corel Draw, Paint Shop Pro, Ulead PhotoImpact) was DYING and asked too much from the botched deal.
  • 92. Have fun scribbling out your shadows in photoshop! Poor Bob from Adobe wasted 9 minutes removing just 1 shadow
  • 93. Intrinsic Imaging for improving the RGB signal in autonomous driving
  • 95. Retrospect • 20 researchers burned 25 million in 8 years; investors got 50 patents in return, period. • Overestimated the total addressable market size, in a market with existing monopoly. • Many missed opportunities. Counterexample of the lean startup model.
  • 96. Some SfM, SLAM startups
  • 97. Satellite/Aerial Imagery Analysis • 40cm resolution at 30fps for 90 sec for any location on earth. • One LEO satellite revisits any place on Earth every 3 days. • Need 24 satellites to revisit any place on Earth every 3 hours.
  • 98. Challenges for Single satellite depth estimation and 3D reconstruction • At 30fps, a LEO satellite travels 250m between two consecutive frames —> theoretically sufficient for cm-level depth estimation. • Sources of Noise: • Camera distortions • Atmospheric Disturbance • Ground vegetation • Sub-pixel sampling noise 1 2
  • 99. What happened? • B2B customers takes too long to strike deals. • Google ate us alive in just 3 months, while we were still pitching for VC-funding with our prototype.
  • 100. Visual Search at Nervve
  • 101. Retrospect • Growth pains expanding from intelligence community clients to advertisement clients. • Forming the right team of engineers and researchers and moving at the right pace. • For any Computer Vision/Machine Learning company: • Researchers that cannot program—> OUT • Engineers that don’t know math —> OUT
  • 102. Viscovery = Video Discovery Optical Character Recognition Offline Recognition 2013 2014 Product Recognition 2015 Video Content related Advertisements 2017 Wearable Devices Video Content Discovery & Interaction 2016 Leading provider of visual search technology in APAC
  • 103. Once in a lifetime opportunity in China’s video streaming market Traditional Media Free Content, Ad Revenue Subscription Revenue Do Nothing? Sitting Duck! Improve Ad Revenue Content- related video ads Own platform Shared/ congregated platform customized recommendatio ns ad measure / brand safety Image/Video content owners struggling to monetize Can AI solve it all?
  • 105. Why not go ahead and analyze all videos? COST! 0600120018002400 2014 2015 2016 2017 2018 2019 Revenue Hardware Cost Data Collection Cost Model Training Cost Technology Inflection Point: • Computer vision cost has dropped significantly while market demand has increased beyond exploratory stages. • Overall cost reduction resulted from accumulated domain expertise, efficient machines learning techniques, and lower computing cost. Case study on leading e-commerce site in APAC, with 1M SKU and >20M calls/month.
  • 106. • Business model vs videos analyzed Need to deliver value while analyzing! o2o marketing ad effectiveness analysis campaign effectiveness analytics direct-sell content-related ads programmatic content-related ads Big Data Insights 0 2.5 5 7.5 10 videos analyzed (10^N) • Extremely costly to analyze video content. Google charges $1.5/1000 images. with 3600 seconds/hour, it will cost $5.4 / hour video / single single recognition task. • Costly to store videos and their recognition results.
  • 107. Hindsight on "right approach" for media Increase user’s time spent & stickiness Multiply traffic w/o subsidizing growth. (4) Video content-related ads & e-commerce (3) Ad exposure measure (2) Content recommendation Grow user & advertiser budget organically with new services (1) Image/Video Content Recognition as a Service Focus on Revenue Growth (5)DataMiningandAnalyticsasaService Goal Media Size
  • 108. Value proposition to media: Start making money the first day! user/time/ad/budgetincrease(times) 0 1 2 3 4 TotalSiteRevenueIncrease(times) 04.5913.518 current Better recommendation Improve user stickiness Better measurement Grow user & ad budget Content-related ad, EC Focus on Revenue # users time / user ad / user*time budget / ad total site revenue
  • 109. Viscovery's Complete Solution
 for Media and Beyond Media AdRetail Archive & Search Content Recommendation Video Generator* Brand Insight Ad Effectiveness Measure Brand Safety Assurance Visual Content Recognition Smart Checkout Video to EC Consumer Insights
  • 110. Media AI: Video content archive & search
  • 111. Media AI: Content Recommendation Article-to-Article, Article-to-Video, Video-to-Video The 31 Tiniest Cat Breeds Cats are notoriously small household pets, but some are even tinier than the norm. If you have tight living quarters or simply can’t get enough of cats little enough to resemble kittens forever, there are plenty of small cat breeds to suit you. When it comes to cats, bigger isn’t always better, as some of the most charming felines come in small packages. Meaningful Terms Re-weighting High-dimension Mapping Terms (Attribute) Extraction Para
  • 112. Media AI: Video Generator powering the "pivot to video" movement! Text Analysis (NLP) Image
 Recognition Key Phrases Sentiment Audio Subtitles F/O/S Concept Media Database Generation Engine
  • 113. Ad AI: ad exposure measure # Appearances 68 times # Appearances with Duration > 5 sec 2 times # Appearances with Duration > 15 sec 0 # Appearances with Duration > 30 sec 0 Longest Duration of Appearance 9 sec Average Duration of Appearance 2.12 sec Average Appearance Interval 37.79 sec Total Duration of Appearance 144 sec Ad Exposure for Audi @ China GT (Shanghai) 2017 # times of logo occupying >5% screen size 8 times Duration of logo occupying > 5% screen size 16 sec # times of logo occupying > 10% screen size 0 times Duration of logo occupying > 10% screen size 0 sec Average logo size (relative to screen) 2.12% Maximum logo size (relative to screen) 11.05%
  • 115. Ad AI: Brand safety assurance Terroris Violence DrugPolitical Explicit Content
  • 116. Retail AI: o2o redirection
  • 117. Retail AI: smart checkout Oreo cookie …………… $4.95 Bounty Paper Towel … $3.45 Cal. Red wine………… $12.44 -------------------------- Total $20.84 Scan payment QR Code: wechat Alipay
  • 119. Retail AI: merchandise management •2D+3D cameras for head and eye tracking •Detect Point-Of-Interest •Consumer profiling and data collection •Integrate with kiosk big screen devices •ESL Integration Smart POS • Smart check out • Client-facing POS screen with 3D camera • POS-on-the-run • Customer Big Data • ESL integration Smart Shelf
  • 120. Retail AI: Cunsumer Insights Female, 30-35 Ms. Mary Rice Status: VIP Housewife Brand: P&G, LV, Nike, Tiffany Already holding product A Reaching for product B on the shelf E.g., People who appear interested in milk powder will get an eDM containing relevant promo info. Recognize customer appearance and behavior characteristics to establish a buying recommendation database.
  • 121. • Clear segment focus: Any corporation smaller than Google, Facebook, Baidu, Alibaba, Tencent. • Need to go from 0-80 fast. Then, help them go from 80-90 and catch up quickly. • How can we make the "0-80" part scalable? Building Technology Barrier
  • 124. Some notes • Managing innovation and delivering promises. • Visionary leader v.s. stubborn fool v.s. big liar.
  • 125. When something is important enough, you do it even if the odds are not in your favor. Elon Musk Falcon 9 takeoff Falcon 9 decelerate Falcon 9 vertical touchdown