Soumith Chintala, AI research engineer, Facebook, at MLconf NYC 2017

•

3 gostaram•953 visualizações

Soumith Chintala is a Researcher at Facebook AI Research, where he works on deep learning, reinforcement learning, generative image models, agents for video games and large-scale high-performance deep learning. Prior to joining Facebook in August 2014, he worked at MuseAmi, where he built deep learning models for music and vision targeted at mobile devices. He holds a Masters in CS from NYU, and spent time in Yann LeCun’s NYU lab building deep learning models for pedestrian detection, natural image OCR, depth-images among others. Abstract Summary: Dynamic Deep Learning: a paradigm shift in AI research and tools: AI research has seen many shifts in the last few years. We’ve seen research go from using static datasets such as Imagenet to being more dynamic and online in self-driving cars, robots and game-playing.Many dynamic environments such as Universe and Starcraft are being used in AI research to solve problems pertaining to reinforcement learning and online learning. In this talk, I shall discuss these shifts in research. Tools such as PyTorch, DyNet and Chainer have popped up to cope up with the paradigm shift, enabling cutting-edge AI, and I shall discuss these as well.

Tecnologia

Dynamic Deep Learning
A paradigm shift in AI research and Tools
Soumith Chintala
Facebook AI Research

Today's AI
Active Research &
Future AI
Tools for AI
keeping up with change
Overview

Today's AI Future AI Tools for AI
Today's AI DenseCap by Justin Johnson & group
https://github.com/jcjohnson/densecap

Today's AI Future AI Tools for AI
Today's AI
DeepMask by Pedro Pinhero & group

Today's AI Future AI Tools for AI
Today's AI
Machine Translation

Today's AI Future AI Tools for AI
Today's AI
Text Classiﬁcation (sentiment analysis etc.)
Text Embeddings
Graph embeddings
Machine Translation
Ads ranking

Data
BatchNorm
ReLU
Conv2d
Model
ObjectiveTrain Model
Today's AI Future AI Tools for AI
Today's AI

Data
BatchNorm
ReLU
Conv2d
Model
ObjectiveTrain Model
BatchNorm
ReLU
Conv2d
Deploy & Use New
Data Prediction
Today's AI Future AI Tools for AI
Today's AI

Data
BatchNorm
ReLU
Conv2d
ObjectiveTrain Model
BatchNorm
ReLU
Conv2d
Deploy & Use New
Data Prediction
Today's AI Future AI Tools for AI
Static datasets + Static model structure
Today's AI

Today's AI Future AI Tools for AI
Current AI Research / Future AI
Self-driving Cars

Today's AI Future AI Tools for AI
Current AI Research / Future AI
Agents trained in many environments
Cars Video games
Internet

Today's AI Future AI Tools for AI
Current AI Research / Future AI
Dynamic Neural Networks
self-adding new memory or layers
changing evaluation path based on inputs

Today's AI Future AI Tools for AI
Current AI Research / Future AI
Live
data
BatchNorm
ReLU
Conv2d
Prediction
Continued Online Learning

Today's AI Future AI Tools for AI
Current AI Research / Future AI
Sample-1
BatchNorm
ReLU
Conv2d
Prediction
Data-dependent change in model structure
BatchNorm
ReLU
Conv2d
BatchNorm
ReLU
Conv2d
BatchNorm
ReLU
Conv2d
BatchNorm
ReLU
Conv2d
BatchNorm
ReLU
Conv2d
BatchNorm
ReLU
Conv2d

Today's AI Future AI Tools for AI
Current AI Research / Future AI
Sample-2
BatchNorm
ReLU
Conv2d
Prediction
Data-dependent change in model structure
BatchNorm
ReLU
Conv2d
BatchNorm
ReLU
Conv2d
BatchNorm
ReLU
Conv2d
BatchNorm
ReLU
Conv2d
BatchNorm
ReLU
Conv2d

Today's AI Future AI Tools for AI
Current AI Research / Future AI
Sample
BatchNorm
ReLU
Conv2d
Prediction
BatchNorm
ReLU
Conv2d
BatchNorm
ReLU
Conv2d
Change in model-capacity at runtime
BatchNorm
ReLU
Conv2d
BatchNorm
ReLU
Conv2d
BatchNorm
ReLU
Conv2d
BatchNorm
ReLU
Conv2d

Today's AI Future AI Tools for AI
A next-gen framework for AI
•Interop with many dynamic environments
- Connecting to car sensors should be as easy as training on a dataset
- Connect to environments such as OpenAI Universe
•Dynamic Neural Networks
- Change behavior and structure of neural network at runtime
•Minimal Abstractions
- more complex AI systems means harder to debug without a simple API

Today's AI Future AI Tools for AI
Tools for AI research and deployment
Many machine learning tools and deep learning frameworks

Today's AI Future AI Tools for AI
Tools for AI research and deployment
Static graph frameworks Dynamic graph frameworks

Today's AI Future AI Tools for AI
Dynamic graph Frameworks
•Model is constructed on the ﬂy at runtime
•Change behavior, structure of model
•Imperative style of programming

PyTorch Autograd
from torch.autograd import Variable

from torch.autograd import Variable
x = Variable(torch.randn(1, 10))
prev_h = Variable(torch.randn(1, 20))
W_h = Variable(torch.randn(20, 20))
W_x = Variable(torch.randn(20, 10))
PyTorch Autograd

http://pytorch.org
Released Jan 18th
20,000+ downloads
200+ community repos
3000+ user posts
With ❤ from

Mais conteúdo relacionado

Destaque

Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...MLconf

Alex Smola, Director of Machine Learning, AWS/Amazon, at MLconf SF 2016MLconf

Rajat Monga, Engineering Director, TensorFlow, Google at MLconf 2016MLconf

Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016MLconf

Caroline Sinders, Online Harassment Researcher, Wikimedia at The AI Conferenc...MLconf

Corinna Cortes, Head of Research, Google, at MLconf NYC 2017MLconf

Hanie Sedghi, Research Scientist at Allen Institute for Artificial Intelligen...MLconf

Virginia Smith, Researcher, UC Berkeley at MLconf SF 2016MLconf

Yi Wang, Tech Lead of AI Platform, Baidu, at MLconf 2017MLconf

Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016MLconf

Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016MLconf

Ashirth Barthur, Security Scientist, H2O, at MLconf Seattle 2017MLconf

Luna Dong, Principal Scientist, Amazon at MLconf Seattle 2017MLconf

Serena Yeung, PHD, Stanford, at MLconf Seattle 2017 MLconf

Ross Goodwin, Technologist, Sunspring, MLconf NYC 2017MLconf

Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016MLconf

Anjuli Kannan, Software Engineer, Google at MLconf SF 2016MLconf

Irina Rish, Researcher, IBM Watson, at MLconf NYC 2017MLconf

Mayur Thakur, Managing Director, Goldman Sachs, at MLconf NYC 2017MLconf

Ben Lau, Quantitative Researcher, Hobbyist, at MLconf NYC 2017MLconf

Destaque (20)

Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...

Alex Smola, Director of Machine Learning, AWS/Amazon, at MLconf SF 2016

Rajat Monga, Engineering Director, TensorFlow, Google at MLconf 2016

Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016

Caroline Sinders, Online Harassment Researcher, Wikimedia at The AI Conferenc...

Corinna Cortes, Head of Research, Google, at MLconf NYC 2017

Hanie Sedghi, Research Scientist at Allen Institute for Artificial Intelligen...

Virginia Smith, Researcher, UC Berkeley at MLconf SF 2016

Yi Wang, Tech Lead of AI Platform, Baidu, at MLconf 2017

Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016

Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016

Ashirth Barthur, Security Scientist, H2O, at MLconf Seattle 2017

Luna Dong, Principal Scientist, Amazon at MLconf Seattle 2017

Serena Yeung, PHD, Stanford, at MLconf Seattle 2017

Ross Goodwin, Technologist, Sunspring, MLconf NYC 2017

Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016

Anjuli Kannan, Software Engineer, Google at MLconf SF 2016

Irina Rish, Researcher, IBM Watson, at MLconf NYC 2017

Mayur Thakur, Managing Director, Goldman Sachs, at MLconf NYC 2017

Ben Lau, Quantitative Researcher, Hobbyist, at MLconf NYC 2017

Mais de MLconf

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...MLconf

Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingMLconf

Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...MLconf

Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushMLconf

Josh Wills - Data Labeling as Religious ExperienceMLconf

Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...MLconf

Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...MLconf

Meghana Ravikumar - Optimized Image Classification on the CheapMLconf

Noam Finkelstein - The Importance of Modeling Data CollectionMLconf

June Andrews - The Uncanny Valley of MLMLconf

Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf

Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...MLconf

Vito Ostuni - The Voice: New Challenges in a Zero UI WorldMLconf

Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...MLconf

Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...MLconf

Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...MLconf

Neel Sundaresan - Teaching a machine to codeMLconf

Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf

Soumith Chintala - Increasing the Impact of AI Through Better SoftwareMLconf

Roy Lowrance - Predicting Bond Prices: Regime ChangesMLconf

Mais de MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...

Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding

Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...

Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush

Josh Wills - Data Labeling as Religious Experience

Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...

Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...

Meghana Ravikumar - Optimized Image Classification on the Cheap

Noam Finkelstein - The Importance of Modeling Data Collection

June Andrews - The Uncanny Valley of ML

Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks

Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...

Vito Ostuni - The Voice: New Challenges in a Zero UI World

Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...

Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...

Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...

Neel Sundaresan - Teaching a machine to code

Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...

Soumith Chintala - Increasing the Impact of AI Through Better Software

Roy Lowrance - Predicting Bond Prices: Regime Changes

Último

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

A Call to Action for Generative AI in 2024Results

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700

Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

Slack Application Development 101 Slidespraypatel2

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

A Domino Admins Adventures (Engage 2024)Gabriella Davis

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Soumith Chintala, AI research engineer, Facebook, at MLconf NYC 2017

2. Dynamic Deep Learning A paradigm shift in AI research and Tools Soumith Chintala Facebook AI Research

3. Today's AI Active Research & Future AI Tools for AI keeping up with change Overview

4. Today's AI Future AI Tools for AI Today's AI DenseCap by Justin Johnson & group https://github.com/jcjohnson/densecap

5. Today's AI Future AI Tools for AI Today's AI DeepMask by Pedro Pinhero & group

6. Today's AI Future AI Tools for AI Today's AI Machine Translation

7. Today's AI Future AI Tools for AI Today's AI Text Classiﬁcation (sentiment analysis etc.) Text Embeddings Graph embeddings Machine Translation Ads ranking

8. Data BatchNorm ReLU Conv2d Model ObjectiveTrain Model Today's AI Future AI Tools for AI Today's AI

9. Data BatchNorm ReLU Conv2d Model ObjectiveTrain Model BatchNorm ReLU Conv2d Deploy & Use New Data Prediction Today's AI Future AI Tools for AI Today's AI

10. Data BatchNorm ReLU Conv2d ObjectiveTrain Model BatchNorm ReLU Conv2d Deploy & Use New Data Prediction Today's AI Future AI Tools for AI Static datasets + Static model structure Today's AI

11. Data BatchNorm ReLU Conv2d ObjectiveTrain Model BatchNorm ReLU Conv2d Deploy & Use New Data Prediction Today's AI Future AI Tools for AI Static datasets + Static model structure Oﬄine Learning Today's AI

12. Today's AI Future AI Tools for AI Current AI Research / Future AI Self-driving Cars

13. Today's AI Future AI Tools for AI Current AI Research / Future AI Agents trained in many environments Cars Video games Internet

14. Today's AI Future AI Tools for AI Current AI Research / Future AI Dynamic Neural Networks self-adding new memory or layers changing evaluation path based on inputs

15. Today's AI Future AI Tools for AI Current AI Research / Future AI Live data BatchNorm ReLU Conv2d Prediction Continued Online Learning

16. Today's AI Future AI Tools for AI Current AI Research / Future AI Sample-1 BatchNorm ReLU Conv2d Prediction Data-dependent change in model structure BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d

17. Today's AI Future AI Tools for AI Current AI Research / Future AI Sample-2 BatchNorm ReLU Conv2d Prediction Data-dependent change in model structure BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d

18. Today's AI Future AI Tools for AI Current AI Research / Future AI Sample BatchNorm ReLU Conv2d Prediction BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d Change in model-capacity at runtime BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d

19. Today's AI Future AI Tools for AI Current AI Research / Future AI Sample BatchNorm ReLU Conv2d Prediction BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d Change in model-capacity at runtime BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d BatchNorm ReLU Conv2d

20. Today's AI Future AI Tools for AI A next-gen framework for AI •Interop with many dynamic environments - Connecting to car sensors should be as easy as training on a dataset - Connect to environments such as OpenAI Universe •Dynamic Neural Networks - Change behavior and structure of neural network at runtime •Minimal Abstractions - more complex AI systems means harder to debug without a simple API

21. Today's AI Future AI Tools for AI Tools for AI research and deployment Many machine learning tools and deep learning frameworks

22. Today's AI Future AI Tools for AI Tools for AI research and deployment Static graph frameworks Dynamic graph frameworks

23. Today's AI Future AI Tools for AI Dynamic graph Frameworks •Model is constructed on the ﬂy at runtime •Change behavior, structure of model •Imperative style of programming

24. PyTorch Autograd from torch.autograd import Variable

25. from torch.autograd import Variable x = Variable(torch.randn(1, 10)) prev_h = Variable(torch.randn(1, 20)) W_h = Variable(torch.randn(20, 20)) W_x = Variable(torch.randn(20, 10)) PyTorch Autograd

26. from torch.autograd import Variable x = Variable(torch.randn(1, 10)) prev_h = Variable(torch.randn(1, 20)) W_h = Variable(torch.randn(20, 20)) W_x = Variable(torch.randn(20, 10)) i2h = torch.mm(W_x, x.t()) h2h = torch.mm(W_h, prev_h.t()) MMMM PyTorch Autograd

27. from torch.autograd import Variable x = Variable(torch.randn(1, 10)) prev_h = Variable(torch.randn(1, 20)) W_h = Variable(torch.randn(20, 20)) W_x = Variable(torch.randn(20, 10)) i2h = torch.mm(W_x, x.t()) h2h = torch.mm(W_h, prev_h.t()) next_h = i2h + h2h MMMM PyTorch Autograd

28. from torch.autograd import Variable x = Variable(torch.randn(1, 10)) prev_h = Variable(torch.randn(1, 20)) W_h = Variable(torch.randn(20, 20)) W_x = Variable(torch.randn(20, 10)) i2h = torch.mm(W_x, x.t()) h2h = torch.mm(W_h, prev_h.t()) next_h = i2h + h2h Add MMMM PyTorch Autograd

29. from torch.autograd import Variable x = Variable(torch.randn(1, 10)) prev_h = Variable(torch.randn(1, 20)) W_h = Variable(torch.randn(20, 20)) W_x = Variable(torch.randn(20, 10)) i2h = torch.mm(W_x, x.t()) h2h = torch.mm(W_h, prev_h.t()) next_h = i2h + h2h next_h = next_h.tanh() Add MMMM Tanh PyTorch Autograd

30. from torch.autograd import Variable x = Variable(torch.randn(1, 10)) prev_h = Variable(torch.randn(1, 20)) W_h = Variable(torch.randn(20, 20)) W_x = Variable(torch.randn(20, 10)) i2h = torch.mm(W_x, x.t()) h2h = torch.mm(W_h, prev_h.t()) next_h = i2h + h2h next_h = next_h.tanh() next_h.backward(torch.ones(1, 20)) Add MMMM Tanh PyTorch Autograd

31. http://pytorch.org Released Jan 18th 20,000+ downloads 200+ community repos 3000+ user posts With ❤ from

Soumith Chintala, AI research engineer, Facebook, at MLconf NYC 2017

Recomendados

Recomendados

Mais conteúdo relacionado

Destaque

Destaque (20)

Mais de MLconf

Mais de MLconf (20)

Último

Último (20)

Soumith Chintala, AI research engineer, Facebook, at MLconf NYC 2017