SlideShare a Scribd company logo
1 of 10
Download to read offline
The AudioVisual Description Profile
    a.k.a. ISO/IEC 15938-9:2005/Amd.1
    a.k.a. MPEG-7 AVDP profile
    a.k.a. the EBU MPEG-7 profile
    a.k.a. the ultimate metadata profile
    a.k.a. …


                                    Dr. Alberto Messina
                                   R&D Area Coordinator
                             Multimedia Information Engineering
               RAI – Centre for Research and Technological Innovation
                                    Turin (ITALY)




Centro Ricerche e Innovazione Tecnologica
Why a new MPEG-7 profile?

     Existing profiles insufficient for a number of reasons




Centro Ricerche e Innovazione Tecnologica
MPEG-7 - AVDP

     Origin
         AVDP was originated in EBU in the context of the technical
         group MIM/SCAIE concerned with the study of automatic
         techniques in media production
     Requirements
         Target application analysis was the starting point to define AVDP
         requirements
              Content Summarisation
              Text Recognition
              Semantic Segmentation
              Copy/Repetition detection
              Personality Identification
              Keywords Extraction
              Subject Classification


Centro Ricerche e Innovazione Tecnologica
Sources of the work and standardisation
              process

     JOANNEUM Research’s Detailed AudioVisual Profile
     NHK’s Metadata Production Framework
     Process
         Proposed to MPEG in July 2010
         Went through standardisation process in 2011
              PDAM
              DAM
              FDAM
         Officialised as a standard in April 2012
         Part 11 (Schema) is now at its final stages too




Centro Ricerche e Innovazione Tecnologica
AVDP General Requirements

 Number      Requirement
 1           The metadata model must have the ability to identify a feature extraction tool (e.g., name),
                 version and institute (e.g. name of a company or university/affiliation).

 2           The metadata model must have the ability to identify contributors who have participated in the
                 test and their respective roles.
 3           The metadata model should allow results from different extractions being combined if related to
                 a common timeline event.
 4           The metadata model must have provisions for the date and time on which the results of the
                 feature extraction tool were generated.
 5           The metadata model must be able to identify and describe (e.g. “title”, “genre”, “language”) one
                 or more assets or parts of assets (e.g. using a standard identification format), the associated
                 type (e.g. MIME type) and location (e.g. URL), to which the feature extraction relates to.

 6           The metadata model should have the ability to describe the content on multiple timeline (such as
                 video and audio timeline).
 7           The metadata model should be have the ability to add confidence levels attached to the results
                 for each feature extracted by any feature extraction tool at the appropriate level of
                 granularity


Centro Ricerche e Innovazione Tecnologica
AVDP semantics
                                                         TD : TemporalDecomposition            AVS : AudioVisualSegment
          Mpeg7                                          STD: SpatioTemporalDecomposition      AS : AudioSegment
                                                         MSD : MediaSourceDecomposition        VS: VideoSegment
             Description type=“ContentEntityType”        SD: spatialDecomposition              SR : StillRegion
                                                                                               MR : MovingRegion
               MultimediaContent type=“AudioVisualType”
                 AudioVisual
                       TD                   AVS          AVS             AVS            ( An experiment/ criteria=shot )

                       TD                   AVS          AVS             AVS            ( An experiment/criteria=ASR )

                       TD                   AVS          AVS             AVS            ( An experiment/criteria=Face)

                             ( Container)
                                                                          T
                                  T               TD
                                                   TD          AVS-2nd            TD
                                                                                   TD             AVS-3rd T
                        AVS-1st
                                                                          V                                    V
                                                  MSD
                                                   MSD           AVS
                                                                  VS             STD
                                                                                  STD                MR
                                                                                                           VideoText
                                                                 VS       A      TD         VS V
                                Same duration                     AS              TD         VS
           T Text
                                                                AS
                                                               VS-key            TD         AS A
           V Video feature + Text                                                 TD         AS
           A Audio feature + Text                               AS
                                                               AS-key

                                                                 SR
                                                                  SR     V       SD                  SR
                                                                                  SD                           V
                                                               A frame                             ImageText




Centro Ricerche e Innovazione Tecnologica
Implementation example




Centro Ricerche e Innovazione Tecnologica
Conclusions

     AVDP is the new standard reference for low level automatically
     extracted metadata
     Grounded on a thorough requirements analysis made by
     experts of the media domain
         EBU
     Several strategic impacts foreseen
         EBU members Internal projects
         FIMS (Framework for Interoperable Media Services)
     XML Schema of AVDP going to be standardised soon
     Guidelines being prepared by MIM/SCAIE about usage of AVDP
         Stay tuned!




Centro Ricerche e Innovazione Tecnologica
Acknowledgements

     Masanori (Masa) Sano (NHK)
         Excellent skills and knowledge of the MPEG procedural rules
         Continued passion for AVDP
         Very nice person
     Jean-Pierre Evain (EBU)
         Continued support from within EBU Technical
         Co-chaired the MPEG AhG with Masa
         Dissemination and support of AVDP throughout the world
     Werner Bailer (JOANNEUM Research)
         Top-level expertise of MPEG-7 technicalities and definitions
     All MIM/SCAIE members for having supported AVDP
     in its infancy


Centro Ricerche e Innovazione Tecnologica
a.messina@rai.it




                                                                                         Dr. Alberto Messina
                                                                                      R&D Area Coordinator
                                                                         Multimedia Information Engineering
                                                     RAI – Centre for Research and Technological Innovation
                                                                                                Turin (ITALY)




Centro Ricerche e Innovazione Tecnologica

More Related Content

Viewers also liked

SDN API & Unified Coomunications
SDN API & Unified CoomunicationsSDN API & Unified Coomunications
SDN API & Unified CoomunicationsIMTC
 
The Cloud: Enabling Real-time Video Services
The Cloud: Enabling Real-time Video ServicesThe Cloud: Enabling Real-time Video Services
The Cloud: Enabling Real-time Video ServicesIMTC
 
Customer Perspective, Pfizer
Customer Perspective, PfizerCustomer Perspective, Pfizer
Customer Perspective, PfizerIMTC
 
EARLY DAYS OF VIDEO CODING STANDARDIZATION
EARLY DAYS OF VIDEO CODING STANDARDIZATIONEARLY DAYS OF VIDEO CODING STANDARDIZATION
EARLY DAYS OF VIDEO CODING STANDARDIZATIONIMTC
 
IBM - Video Communications - An Enterprise Perspective
IBM - Video Communications - An Enterprise PerspectiveIBM - Video Communications - An Enterprise Perspective
IBM - Video Communications - An Enterprise PerspectiveIMTC
 
Radvision CTO - Yair Wiener, on Interoperability
Radvision CTO - Yair Wiener, on InteroperabilityRadvision CTO - Yair Wiener, on Interoperability
Radvision CTO - Yair Wiener, on InteroperabilityIMTC
 
Spatial Conferencing
Spatial ConferencingSpatial Conferencing
Spatial ConferencingIMTC
 
Telepresence Testing Approach by Shenick
Telepresence Testing Approach by ShenickTelepresence Testing Approach by Shenick
Telepresence Testing Approach by ShenickIMTC
 
IMTC - CTO Roundtable 2011
IMTC - CTO Roundtable 2011IMTC - CTO Roundtable 2011
IMTC - CTO Roundtable 2011IMTC
 
Stefan slivinski lifesize video coding
Stefan slivinski lifesize video coding Stefan slivinski lifesize video coding
Stefan slivinski lifesize video coding IMTC
 
Alicia Abella - Social TV Collaboration Reaserch
Alicia Abella - Social TV Collaboration ReaserchAlicia Abella - Social TV Collaboration Reaserch
Alicia Abella - Social TV Collaboration ReaserchIMTC
 
Video Coding - Past, Present & Future
Video Coding - Past, Present & FutureVideo Coding - Past, Present & Future
Video Coding - Past, Present & FutureIMTC
 

Viewers also liked (12)

SDN API & Unified Coomunications
SDN API & Unified CoomunicationsSDN API & Unified Coomunications
SDN API & Unified Coomunications
 
The Cloud: Enabling Real-time Video Services
The Cloud: Enabling Real-time Video ServicesThe Cloud: Enabling Real-time Video Services
The Cloud: Enabling Real-time Video Services
 
Customer Perspective, Pfizer
Customer Perspective, PfizerCustomer Perspective, Pfizer
Customer Perspective, Pfizer
 
EARLY DAYS OF VIDEO CODING STANDARDIZATION
EARLY DAYS OF VIDEO CODING STANDARDIZATIONEARLY DAYS OF VIDEO CODING STANDARDIZATION
EARLY DAYS OF VIDEO CODING STANDARDIZATION
 
IBM - Video Communications - An Enterprise Perspective
IBM - Video Communications - An Enterprise PerspectiveIBM - Video Communications - An Enterprise Perspective
IBM - Video Communications - An Enterprise Perspective
 
Radvision CTO - Yair Wiener, on Interoperability
Radvision CTO - Yair Wiener, on InteroperabilityRadvision CTO - Yair Wiener, on Interoperability
Radvision CTO - Yair Wiener, on Interoperability
 
Spatial Conferencing
Spatial ConferencingSpatial Conferencing
Spatial Conferencing
 
Telepresence Testing Approach by Shenick
Telepresence Testing Approach by ShenickTelepresence Testing Approach by Shenick
Telepresence Testing Approach by Shenick
 
IMTC - CTO Roundtable 2011
IMTC - CTO Roundtable 2011IMTC - CTO Roundtable 2011
IMTC - CTO Roundtable 2011
 
Stefan slivinski lifesize video coding
Stefan slivinski lifesize video coding Stefan slivinski lifesize video coding
Stefan slivinski lifesize video coding
 
Alicia Abella - Social TV Collaboration Reaserch
Alicia Abella - Social TV Collaboration ReaserchAlicia Abella - Social TV Collaboration Reaserch
Alicia Abella - Social TV Collaboration Reaserch
 
Video Coding - Past, Present & Future
Video Coding - Past, Present & FutureVideo Coding - Past, Present & Future
Video Coding - Past, Present & Future
 

Similar to The AudioVisual Description Profile

Software defined radio
Software defined radioSoftware defined radio
Software defined radioDevesh Samaiya
 
Verification of Graphics ASICs (Part II)
Verification of Graphics ASICs (Part II)Verification of Graphics ASICs (Part II)
Verification of Graphics ASICs (Part II)DVClub
 
08 android multimedia_framework_overview
08 android multimedia_framework_overview08 android multimedia_framework_overview
08 android multimedia_framework_overviewArjun Reddy
 
BUT2012 APPROACHES FOR SPOKEN WEB SEARCH - MEDIAEVAL 2012
BUT2012 APPROACHES FOR SPOKEN WEB SEARCH - MEDIAEVAL 2012BUT2012 APPROACHES FOR SPOKEN WEB SEARCH - MEDIAEVAL 2012
BUT2012 APPROACHES FOR SPOKEN WEB SEARCH - MEDIAEVAL 2012MediaEval2012
 
Roslyn compiler as a service
Roslyn compiler as a serviceRoslyn compiler as a service
Roslyn compiler as a serviceEugene Zharkov
 

Similar to The AudioVisual Description Profile (11)

Software defined radio
Software defined radioSoftware defined radio
Software defined radio
 
Yang greenstein part_2
Yang greenstein part_2Yang greenstein part_2
Yang greenstein part_2
 
Verification of Graphics ASICs (Part II)
Verification of Graphics ASICs (Part II)Verification of Graphics ASICs (Part II)
Verification of Graphics ASICs (Part II)
 
Music workflow4
Music workflow4Music workflow4
Music workflow4
 
DDS vs AMQP
DDS vs AMQPDDS vs AMQP
DDS vs AMQP
 
08 android multimedia_framework_overview
08 android multimedia_framework_overview08 android multimedia_framework_overview
08 android multimedia_framework_overview
 
BUT2012 APPROACHES FOR SPOKEN WEB SEARCH - MEDIAEVAL 2012
BUT2012 APPROACHES FOR SPOKEN WEB SEARCH - MEDIAEVAL 2012BUT2012 APPROACHES FOR SPOKEN WEB SEARCH - MEDIAEVAL 2012
BUT2012 APPROACHES FOR SPOKEN WEB SEARCH - MEDIAEVAL 2012
 
Observer ts
Observer tsObserver ts
Observer ts
 
Observer ts
Observer tsObserver ts
Observer ts
 
Observer ts
Observer tsObserver ts
Observer ts
 
Roslyn compiler as a service
Roslyn compiler as a serviceRoslyn compiler as a service
Roslyn compiler as a service
 

More from IMTC

UC SDN
UC SDNUC SDN
UC SDNIMTC
 
VoLTE Testing at IMTC SuperOP 2015 - Open Invitation
VoLTE Testing at IMTC SuperOP 2015 -  Open InvitationVoLTE Testing at IMTC SuperOP 2015 -  Open Invitation
VoLTE Testing at IMTC SuperOP 2015 - Open InvitationIMTC
 
Unified Communications and Software Defined Networks (UC SDN)
Unified Communications and Software Defined Networks (UC SDN)Unified Communications and Software Defined Networks (UC SDN)
Unified Communications and Software Defined Networks (UC SDN)IMTC
 
SIPv6 Test Program
SIPv6 Test ProgramSIPv6 Test Program
SIPv6 Test ProgramIMTC
 
EVS Advances in VoLTE Networks
EVS Advances in VoLTE NetworksEVS Advances in VoLTE Networks
EVS Advances in VoLTE NetworksIMTC
 
WebRTC - Bridging Web and SIP Worlds
WebRTC - Bridging Web and SIP WorldsWebRTC - Bridging Web and SIP Worlds
WebRTC - Bridging Web and SIP WorldsIMTC
 
Predictable Experience for Lync - Meru Networks
Predictable Experience for Lync - Meru NetworksPredictable Experience for Lync - Meru Networks
Predictable Experience for Lync - Meru NetworksIMTC
 
VoLTE & VoMBB The New Era in Voice Services
VoLTE & VoMBB The New Era in Voice ServicesVoLTE & VoMBB The New Era in Voice Services
VoLTE & VoMBB The New Era in Voice ServicesIMTC
 
Test & Certification WG Review, 2014 Member Meeting
Test & Certification WG Review, 2014 Member MeetingTest & Certification WG Review, 2014 Member Meeting
Test & Certification WG Review, 2014 Member MeetingIMTC
 
UC SDN AG Review
UC SDN AG ReviewUC SDN AG Review
UC SDN AG ReviewIMTC
 
Video on the Web is Changing ... massively! VP9 and beyond
Video on the Web is Changing ... massively! VP9 and beyondVideo on the Web is Changing ... massively! VP9 and beyond
Video on the Web is Changing ... massively! VP9 and beyondIMTC
 
What’s Next for Mobile Video
What’s Next for Mobile VideoWhat’s Next for Mobile Video
What’s Next for Mobile VideoIMTC
 
Development of a 4K Main 10 Profile HEVC Encoder for Great Improvements in Co...
Development of a 4K Main 10 Profile HEVC Encoder for Great Improvements in Co...Development of a 4K Main 10 Profile HEVC Encoder for Great Improvements in Co...
Development of a 4K Main 10 Profile HEVC Encoder for Great Improvements in Co...IMTC
 
New Video Technologies Defining the Workspace of the Future
New Video Technologies Defining the Workspace of the FutureNew Video Technologies Defining the Workspace of the Future
New Video Technologies Defining the Workspace of the FutureIMTC
 
The Ecosystem A driver for natural collaboration
The Ecosystem A driver for natural collaborationThe Ecosystem A driver for natural collaboration
The Ecosystem A driver for natural collaborationIMTC
 
Optimizing Real Time Interactive Video Delivery from the Cloud
Optimizing Real Time Interactive Video Delivery from the CloudOptimizing Real Time Interactive Video Delivery from the Cloud
Optimizing Real Time Interactive Video Delivery from the CloudIMTC
 
UC SDN Use Case
UC SDN Use CaseUC SDN Use Case
UC SDN Use CaseIMTC
 
SIP Parity Actvity Group & Video Interoperability Review
SIP Parity Actvity Group & Video Interoperability ReviewSIP Parity Actvity Group & Video Interoperability Review
SIP Parity Actvity Group & Video Interoperability ReviewIMTC
 
Wearables
WearablesWearables
WearablesIMTC
 
The MDCT and its Applications in Audio Coding
The MDCT and its Applications in Audio CodingThe MDCT and its Applications in Audio Coding
The MDCT and its Applications in Audio CodingIMTC
 

More from IMTC (20)

UC SDN
UC SDNUC SDN
UC SDN
 
VoLTE Testing at IMTC SuperOP 2015 - Open Invitation
VoLTE Testing at IMTC SuperOP 2015 -  Open InvitationVoLTE Testing at IMTC SuperOP 2015 -  Open Invitation
VoLTE Testing at IMTC SuperOP 2015 - Open Invitation
 
Unified Communications and Software Defined Networks (UC SDN)
Unified Communications and Software Defined Networks (UC SDN)Unified Communications and Software Defined Networks (UC SDN)
Unified Communications and Software Defined Networks (UC SDN)
 
SIPv6 Test Program
SIPv6 Test ProgramSIPv6 Test Program
SIPv6 Test Program
 
EVS Advances in VoLTE Networks
EVS Advances in VoLTE NetworksEVS Advances in VoLTE Networks
EVS Advances in VoLTE Networks
 
WebRTC - Bridging Web and SIP Worlds
WebRTC - Bridging Web and SIP WorldsWebRTC - Bridging Web and SIP Worlds
WebRTC - Bridging Web and SIP Worlds
 
Predictable Experience for Lync - Meru Networks
Predictable Experience for Lync - Meru NetworksPredictable Experience for Lync - Meru Networks
Predictable Experience for Lync - Meru Networks
 
VoLTE & VoMBB The New Era in Voice Services
VoLTE & VoMBB The New Era in Voice ServicesVoLTE & VoMBB The New Era in Voice Services
VoLTE & VoMBB The New Era in Voice Services
 
Test & Certification WG Review, 2014 Member Meeting
Test & Certification WG Review, 2014 Member MeetingTest & Certification WG Review, 2014 Member Meeting
Test & Certification WG Review, 2014 Member Meeting
 
UC SDN AG Review
UC SDN AG ReviewUC SDN AG Review
UC SDN AG Review
 
Video on the Web is Changing ... massively! VP9 and beyond
Video on the Web is Changing ... massively! VP9 and beyondVideo on the Web is Changing ... massively! VP9 and beyond
Video on the Web is Changing ... massively! VP9 and beyond
 
What’s Next for Mobile Video
What’s Next for Mobile VideoWhat’s Next for Mobile Video
What’s Next for Mobile Video
 
Development of a 4K Main 10 Profile HEVC Encoder for Great Improvements in Co...
Development of a 4K Main 10 Profile HEVC Encoder for Great Improvements in Co...Development of a 4K Main 10 Profile HEVC Encoder for Great Improvements in Co...
Development of a 4K Main 10 Profile HEVC Encoder for Great Improvements in Co...
 
New Video Technologies Defining the Workspace of the Future
New Video Technologies Defining the Workspace of the FutureNew Video Technologies Defining the Workspace of the Future
New Video Technologies Defining the Workspace of the Future
 
The Ecosystem A driver for natural collaboration
The Ecosystem A driver for natural collaborationThe Ecosystem A driver for natural collaboration
The Ecosystem A driver for natural collaboration
 
Optimizing Real Time Interactive Video Delivery from the Cloud
Optimizing Real Time Interactive Video Delivery from the CloudOptimizing Real Time Interactive Video Delivery from the Cloud
Optimizing Real Time Interactive Video Delivery from the Cloud
 
UC SDN Use Case
UC SDN Use CaseUC SDN Use Case
UC SDN Use Case
 
SIP Parity Actvity Group & Video Interoperability Review
SIP Parity Actvity Group & Video Interoperability ReviewSIP Parity Actvity Group & Video Interoperability Review
SIP Parity Actvity Group & Video Interoperability Review
 
Wearables
WearablesWearables
Wearables
 
The MDCT and its Applications in Audio Coding
The MDCT and its Applications in Audio CodingThe MDCT and its Applications in Audio Coding
The MDCT and its Applications in Audio Coding
 

Recently uploaded

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 

Recently uploaded (20)

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 

The AudioVisual Description Profile

  • 1. The AudioVisual Description Profile a.k.a. ISO/IEC 15938-9:2005/Amd.1 a.k.a. MPEG-7 AVDP profile a.k.a. the EBU MPEG-7 profile a.k.a. the ultimate metadata profile a.k.a. … Dr. Alberto Messina R&D Area Coordinator Multimedia Information Engineering RAI – Centre for Research and Technological Innovation Turin (ITALY) Centro Ricerche e Innovazione Tecnologica
  • 2. Why a new MPEG-7 profile? Existing profiles insufficient for a number of reasons Centro Ricerche e Innovazione Tecnologica
  • 3. MPEG-7 - AVDP Origin AVDP was originated in EBU in the context of the technical group MIM/SCAIE concerned with the study of automatic techniques in media production Requirements Target application analysis was the starting point to define AVDP requirements Content Summarisation Text Recognition Semantic Segmentation Copy/Repetition detection Personality Identification Keywords Extraction Subject Classification Centro Ricerche e Innovazione Tecnologica
  • 4. Sources of the work and standardisation process JOANNEUM Research’s Detailed AudioVisual Profile NHK’s Metadata Production Framework Process Proposed to MPEG in July 2010 Went through standardisation process in 2011 PDAM DAM FDAM Officialised as a standard in April 2012 Part 11 (Schema) is now at its final stages too Centro Ricerche e Innovazione Tecnologica
  • 5. AVDP General Requirements Number Requirement 1 The metadata model must have the ability to identify a feature extraction tool (e.g., name), version and institute (e.g. name of a company or university/affiliation). 2 The metadata model must have the ability to identify contributors who have participated in the test and their respective roles. 3 The metadata model should allow results from different extractions being combined if related to a common timeline event. 4 The metadata model must have provisions for the date and time on which the results of the feature extraction tool were generated. 5 The metadata model must be able to identify and describe (e.g. “title”, “genre”, “language”) one or more assets or parts of assets (e.g. using a standard identification format), the associated type (e.g. MIME type) and location (e.g. URL), to which the feature extraction relates to. 6 The metadata model should have the ability to describe the content on multiple timeline (such as video and audio timeline). 7 The metadata model should be have the ability to add confidence levels attached to the results for each feature extracted by any feature extraction tool at the appropriate level of granularity Centro Ricerche e Innovazione Tecnologica
  • 6. AVDP semantics TD : TemporalDecomposition AVS : AudioVisualSegment Mpeg7 STD: SpatioTemporalDecomposition AS : AudioSegment MSD : MediaSourceDecomposition VS: VideoSegment Description type=“ContentEntityType” SD: spatialDecomposition SR : StillRegion MR : MovingRegion MultimediaContent type=“AudioVisualType” AudioVisual TD AVS AVS AVS ( An experiment/ criteria=shot ) TD AVS AVS AVS ( An experiment/criteria=ASR ) TD AVS AVS AVS ( An experiment/criteria=Face) ( Container) T T TD TD AVS-2nd TD TD AVS-3rd T AVS-1st V V MSD MSD AVS VS STD STD MR VideoText VS A TD VS V Same duration AS TD VS T Text AS VS-key TD AS A V Video feature + Text TD AS A Audio feature + Text AS AS-key SR SR V SD SR SD V A frame ImageText Centro Ricerche e Innovazione Tecnologica
  • 7. Implementation example Centro Ricerche e Innovazione Tecnologica
  • 8. Conclusions AVDP is the new standard reference for low level automatically extracted metadata Grounded on a thorough requirements analysis made by experts of the media domain EBU Several strategic impacts foreseen EBU members Internal projects FIMS (Framework for Interoperable Media Services) XML Schema of AVDP going to be standardised soon Guidelines being prepared by MIM/SCAIE about usage of AVDP Stay tuned! Centro Ricerche e Innovazione Tecnologica
  • 9. Acknowledgements Masanori (Masa) Sano (NHK) Excellent skills and knowledge of the MPEG procedural rules Continued passion for AVDP Very nice person Jean-Pierre Evain (EBU) Continued support from within EBU Technical Co-chaired the MPEG AhG with Masa Dissemination and support of AVDP throughout the world Werner Bailer (JOANNEUM Research) Top-level expertise of MPEG-7 technicalities and definitions All MIM/SCAIE members for having supported AVDP in its infancy Centro Ricerche e Innovazione Tecnologica
  • 10. a.messina@rai.it Dr. Alberto Messina R&D Area Coordinator Multimedia Information Engineering RAI – Centre for Research and Technological Innovation Turin (ITALY) Centro Ricerche e Innovazione Tecnologica