Prompt部の詳細
● 1. Stastical Prompt
○ いくつactionが動画中にあるか
○ The video has {num} actions.
● 2. Ordinal Prompt
○ 何番目のactionか
○ This is the {ord_i} action in the video.
● 3. Semantic Prompt
○ “{ord_i}, the person is performing the
action step of {vp_i}”
● 3+1. Integrated Prompt
○ 全部
○ Semanticを全て文として並べる
評価用データセット
● 50Salads: 50 top view 30-fps instructional videos regarding salad preparation
○ 19 kind of actions
● Georgia Tech Egocentric Activities(GTEA): 28 egocentric 15-fps instructional
videos daily kitchen activities
○ 74 class of actions
● Breakfast: 1,712 third person 15-fps videos of breakfast preparation activities.
○ 48 type of different actions
○