DL輪読会LT Embodied Question Answering, World Models 輪読

•

0 gostou•781 visualizações

Tatsuya Matsushima

2018/03/30 DL輪読会LT "Embodied Question Answering", "World Models"の輪読

Engenharia

“Embodied Question Answering”
“World Models”
2017.03.30
Tatsuya Matsushima @__tmats__

Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra (Facebook Research)
https://arxiv.org/abs/1711.11543
“Embodied Question Answering” (arXiv, 2017)
3D QA Embodied
Question Answering (EmbodiedQA)
github
https://github.com/facebookresearch/house3d
QA
1) 1
2)
( )
RL
- navigation QA (SL or
)
Key
- (active perception)
-
ex)
- grounding ( )

David Ha, Jürgen Schmidhuber
https://arxiv.org/abs/1803.10122
“World Models” (arXiv, 2018)
- VAE RNN
-
(hallucinated dream)
- VAE
- z ( RNN)
-
(z h )
RNN
But RL credit assignment
NN
-
NN
Key
- CarRacing-v0
-

David Ha, Jürgen Schmidhuber
https://arxiv.org/abs/1803.10122
“World Models” (arXiv, 2018)
Overview
This paper proposes to learn dynamics of environment
and control of agent separately in RL settings.
- model dynamics of environment using VAE and
mixture gaussian RNN
- We can make controller simpler (with fewer
parameters)
By learning the model of environment, the agent can
learn policies without interacting real environment
(hallucinated dream), then even transfer into real
settings.
Key Point of Proposed Method
Making the controller simpler by dividing modules into
“World Model” with a RNN, and controller with small
number of parameters
- dimension reduction with VAE
- predict latent representation z using Gaussian
Mixture RNN
- simple controller with linear model
Difference between Previous Work
Large RNNs have high capacity, but in RL setting,
there’s credit assignment problem, so existing method
tended to use smaller RNNs.
In proposed method, the model is divided into the
model of environment and controller, so large RNNs
can be used.
Main Insights
- First model that achieved required score in
CarRacing-v0 task
- solve task using only learned environment model

Recomendados

AIのラボからロボティクスへ --- 東大松尾研究室のWRS2020パートナーロボットチャレンジへの挑戦Tatsuya Matsushima

深層強化学習入門　2020年度Deep Learning基礎講座「強化学習」Tatsuya Matsushima

Learning to Navigate in Complex Environments 輪読Tatsuya Matsushima

2024 State of Marketing Report – by HubspotMarius Sescu

Everything You Need To Know About ChatGPTExpeed Software

Product Design Trends in 2024 | Teenage EngineeringsPixeldarts

How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow

AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork

S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxSCMS School of Architecture

COST-EFFETIVE and Energy Efficient BUILDINGS ptxJIT KUMAR GUPTA

FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsArindam Chakraborty, Ph.D., P.E. (CA, TX)

Standard vs Custom Battery Packs - Decoding the Power PlayEpec Engineered Technologies

Wadi Rum luxhotel lodge Analysis case study.pptxNadaHaitham1

Hostel management system project report..pdfKamal Acharya

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b

AIRCANVAS[1].pdf mini project for btech studentsvanyagupta248

A Study of Urban Area Plan for Pabna MunicipalityMorshed Ahmed Rahath

Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X79953056974 Low Rate Call Girls In Saket, Delhi NCR

Thermal Engineering -unit - III & IV.pptDineshKumar4165

kiln thermal load.pptx kiln tgermal loadhamedmustafa094

Double Revolving field theory-how the rotor develops torqueBhangaleSonal

Integrated Test Rig For HTFE-25 - NeometrixNeometrix_Engineering_Pvt_Ltd

Hospital management system project report.pdfKamal Acharya

Engineering Drawing focus on projection of planesRAJNEESHKUMAR341697

NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...Amil baba

Moment Distribution Method For Btech CivilVinayVitekari

Design For Accessibility: Getting it right from the startQuintin Balsdon

Generative AI or GenAI technology based PPTbhaskargani46

Skeleton Culture CodeSkeleton Technologies

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

Mais conteúdo relacionado

Último

S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxSCMS School of Architecture

COST-EFFETIVE and Energy Efficient BUILDINGS ptxJIT KUMAR GUPTA

FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsArindam Chakraborty, Ph.D., P.E. (CA, TX)

Standard vs Custom Battery Packs - Decoding the Power PlayEpec Engineered Technologies

Wadi Rum luxhotel lodge Analysis case study.pptxNadaHaitham1

Hostel management system project report..pdfKamal Acharya

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b

AIRCANVAS[1].pdf mini project for btech studentsvanyagupta248

A Study of Urban Area Plan for Pabna MunicipalityMorshed Ahmed Rahath

Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X79953056974 Low Rate Call Girls In Saket, Delhi NCR

Thermal Engineering -unit - III & IV.pptDineshKumar4165

kiln thermal load.pptx kiln tgermal loadhamedmustafa094

Double Revolving field theory-how the rotor develops torqueBhangaleSonal

Integrated Test Rig For HTFE-25 - NeometrixNeometrix_Engineering_Pvt_Ltd

Hospital management system project report.pdfKamal Acharya

Engineering Drawing focus on projection of planesRAJNEESHKUMAR341697

NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...Amil baba

Moment Distribution Method For Btech CivilVinayVitekari

Design For Accessibility: Getting it right from the startQuintin Balsdon

Generative AI or GenAI technology based PPTbhaskargani46

Destaque

Skeleton Culture CodeSkeleton Technologies

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

Content Methodology: A Best Practices Report (Webinar)contently

How to Prepare For a Successful Job Search for 2024Albert Qian

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

5 Public speaking tips from TED - Visualized summarySpeakerHub

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

Getting into the tech field. what next Tessa Mero

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

Introduction to Data ScienceChristy Abraham Joy

Time Management & Productivity - Best PracticesVit Horky

The six step guide to practical project managementMindGenius

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools

12 Ways to Increase Your Influence at WorkGetSmarter

ChatGPT webinar slidesAlireza Esmikhani

More than Just Lines on a Map: Best Practices for U.S Bike RoutesProject for Public Spaces & National Center for Biking and Walking

Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference

Destaque (20)

Skeleton Culture Code

PEPSICO Presentation to CAGNY Conference Feb 2024

Content Methodology: A Best Practices Report (Webinar)

How to Prepare For a Successful Job Search for 2024

Social Media Marketing Trends 2024 // The Global Indie Insights

Trends In Paid Search: Navigating The Digital Landscape In 2024

5 Public speaking tips from TED - Visualized summary

ChatGPT and the Future of Work - Clark Boyd

Getting into the tech field. what next

Google's Just Not That Into You: Understanding Core Updates & Search Intent

How to have difficult conversations

Introduction to Data Science

Time Management & Productivity - Best Practices

The six step guide to practical project management

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...

12 Ways to Increase Your Influence at Work

ChatGPT webinar slides

More than Just Lines on a Map: Best Practices for U.S Bike Routes

Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...

DL輪読会LT Embodied Question Answering, World Models 輪読

1. “Embodied Question Answering” “World Models” 2017.03.30 Tatsuya Matsushima @__tmats__

2. Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra (Facebook Research) https://arxiv.org/abs/1711.11543 “Embodied Question Answering” (arXiv, 2017) 3D QA Embodied Question Answering (EmbodiedQA) github https://github.com/facebookresearch/house3d QA 1) 1 2) ( ) RL - navigation QA (SL or ) Key - (active perception) - ex) - grounding ( )

3. Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra (Facebook Research) https://arxiv.org/abs/1711.11543 “Embodied Question Answering” (arXiv, 2017) Overview This paper proposes Embodied Question Answering (EmbodiedQA) task. The simulator is available in github https://github.com/facebookresearch/house3d Key Point of Proposed Method Difference between existing QA tasks 1) State is presented as a first person view 2) Agent needs its actions in order to answer correctly In Experiment, they use hierarchical RL consisted of planner and controller - Train separately both modules of navigation and QA, then joint two modules Main Insights Design concept of task “Long term objective is to make intelligent agents that can perceive, communicate and act” - need active perception - need inference with “common sense” ex) If asked about a car, agents try to go garage, - need grounding of symbol and real world

4. David Ha, Jürgen Schmidhuber https://arxiv.org/abs/1803.10122 “World Models” (arXiv, 2018) - VAE RNN - (hallucinated dream) - VAE - z ( RNN) - (z h ) RNN But RL credit assignment NN - NN Key - CarRacing-v0 -

5. David Ha, Jürgen Schmidhuber https://arxiv.org/abs/1803.10122 “World Models” (arXiv, 2018) Overview This paper proposes to learn dynamics of environment and control of agent separately in RL settings. - model dynamics of environment using VAE and mixture gaussian RNN - We can make controller simpler (with fewer parameters) By learning the model of environment, the agent can learn policies without interacting real environment (hallucinated dream), then even transfer into real settings. Key Point of Proposed Method Making the controller simpler by dividing modules into “World Model” with a RNN, and controller with small number of parameters - dimension reduction with VAE - predict latent representation z using Gaussian Mixture RNN - simple controller with linear model Difference between Previous Work Large RNNs have high capacity, but in RL setting, there’s credit assignment problem, so existing method tended to use smaller RNNs. In proposed method, the model is divided into the model of environment and controller, so large RNNs can be used. Main Insights - First model that achieved required score in CarRacing-v0 task - solve task using only learned environment model

DL輪読会LT Embodied Question Answering, World Models 輪読

Recomendados

Recomendados

Mais conteúdo relacionado

Último

Último (20)

Destaque

Destaque (20)

DL輪読会LT Embodied Question Answering, World Models 輪読