SlideShare uma empresa Scribd logo
1 de 12
Baixar para ler offline
FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene
Detection and Machine Learning
Anatoliy Zabrovskiy, Prateek Agrawal, Christian Timmerer, Radu Prodan
The 30th IEEE International Conference
of the Open Innovations Association
FRUCT
October 27-29, 2021 Oulu, Finland.
FAUST Approach. Goal
Goal:
- Develop Fast Approach for Per-Scene Encoding using Scene Detection and
Machine Learning (FAUST)
FAUST approach is based on four phases:
1) Fast Entropy based scene detection;
2) ANN based YPSNR quality prediction;
3) Convex Hull calculation and interpolation;
4) Per-scene encoding optimization.
2
1. Scene
detection
2. Quality
prediction
3. Convex Hull,
Interpolation
4. Per-scene
optimization
3
Fast Entropy based scene detection (phase 1)
The fast entropy based scene detection method includes the following steps:
1. The input video is encoded to low bitrate and low resolution (100 Kbps, 144p) using FFmpeg and x264
ultrafast encoding preset.
2. The Temporal Information (TI) and Spatial Information (SI) metrics are calculated for the encoded video
sequence.
3. Then the FAUST approach detects the scenes using the TI metrics calculated on the previous step.
Input
Video
- encoding preset: ultrafast
- encoding bitrate: 100kbps
- resolution: 144p
FFmpeg
Encoded
Video
Calculating TI
and SI metrics
SITI program
- avg. TI entropy (per second)
- avg. TI (per second)
- scene detection
FAUST program
Scenes
Difference
threshold
4
ANN based YPSNR quality prediction (phase 2)
Scene 1
Scene 2
Scene 3
Scene n
Middle
segment
Middle
segment
Middle
segment
Middle
segment
● video height;
● video width;
● encoding bitrate;
● encoding preset;
● SI (for 144p video);
● TI (for 144p video);
● input_segment_size (in bytes);
● 144p_segment_size (in bytes);
● segment duration;
● fps.
Predicting YPSNR
using ANN
Segment
no.
Bitrate Resolution YPSNR
1 1000 432p 35
1 2300 720p 37
1 4300 1080p 38
1 7000 1440p 40
1 12000 2160p 43
... ... ... ...
For each middle segment for all scenes, the ANN
predicts YPSNR values for all possible
combinations of resolutions and bitrates from the
static bitrate ladder.
2 sec.
2 sec.
2 sec.
2 sec.
Static/classic
bitrate ladder
5
ANN based YPSNR quality prediction. Results
Based on the results:
The developed ANN model on testing data is able to predict the
YPSNR metric with low mean absolute (MAE) and mean square
errors (MSE) of 0.15 and 0.08, respectively.
The results with various possible combinations of ANN input
parameters are presented in the table.
6
Convex Hull calculation and interpolation (phase 3)
Fig. 1. Convex Hull. Tears of Steel video, scene 4. Fig. 2. Interpolated Convex Hull. Tears of
Steel video, scene 4.
7
Per-scene encoding optimization (phase 4)
Steps:
● For each scene the FAUST approach selects all
points on the interpolated Convex Hull which
belong to YPSNR range [30 dB, 45 dB].
● It uses the 1.5 YPSNR spacing to calculate the
number of bitrate/resolution pairs for each
scene.
● For the selected YPSNR points, the FAUST
approach finds the appropriate bitrates and
resolutions using interpolated Convex Hulls.
Results and analysis. Fast scene detection
8
The total scene detection time shows that the FAUST approach detects video scenes
almost three times faster than FFmpeg and more than three times faster than
PySceneDetect tool.
Performance analysis with classic bitrate ladder
9
The proposed FAUST approach
improves both the bitrate reduction
and the overall video quality.
State-of-the-art comparison
10
● The video scene detection time using
our FAUST approach (9.9 s) is more
than three times faster than the
MiPSO framework algorithm, i.e. 34 s.
● With our FAUST approach, there is no
need to run multiple tests (or trial)
encoding to build a convex hull.
11
Conclusions
The key advantages of the FAUST approach:
● A fast scene detection using entropy based method.
● No need to run test encodings to build Convex Hulls. The developed
ANN model on testing data is able to predict the YPSNR metric with
low mean absolute (MAE) and mean square errors (MSE) of 0.15 and
0.08, respectively.
● Per-scene bitrate ladders.
Thank you!
12
Anatoliy Zabrovskiy
anatoliy.zabrovskiy@aau.at

Mais conteúdo relacionado

Mais procurados

ES-HAS: An Edge- and SDN-Assisted Framework for HTTP Adaptive Video Streaming
ES-HAS: An Edge- and SDN-Assisted Framework for HTTP Adaptive Video StreamingES-HAS: An Edge- and SDN-Assisted Framework for HTTP Adaptive Video Streaming
ES-HAS: An Edge- and SDN-Assisted Framework for HTTP Adaptive Video StreamingAlpen-Adria-Universität
 
CSDN: CDN-Aware QoE Optimization in SDN-Assisted HTTP Adaptive Video Streaming
CSDN: CDN-Aware QoE Optimization in SDN-Assisted HTTP Adaptive Video StreamingCSDN: CDN-Aware QoE Optimization in SDN-Assisted HTTP Adaptive Video Streaming
CSDN: CDN-Aware QoE Optimization in SDN-Assisted HTTP Adaptive Video StreamingAlpen-Adria-Universität
 
Quality impact of scalable video coding tunneling for media aware content del...
Quality impact of scalable video coding tunneling for media aware content del...Quality impact of scalable video coding tunneling for media aware content del...
Quality impact of scalable video coding tunneling for media aware content del...Alpen-Adria-Universität
 
Quality Optimization of Live Streaming Services over HTTP with Reinforcement ...
Quality Optimization of Live Streaming Services over HTTP with Reinforcement ...Quality Optimization of Live Streaming Services over HTTP with Reinforcement ...
Quality Optimization of Live Streaming Services over HTTP with Reinforcement ...Alpen-Adria-Universität
 
LwTE: Light-weight Transcoding at the Edge
LwTE: Light-weight Transcoding at the EdgeLwTE: Light-weight Transcoding at the Edge
LwTE: Light-weight Transcoding at the EdgeAlpen-Adria-Universität
 
MiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive Streaming
MiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive StreamingMiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive Streaming
MiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive StreamingAlpen-Adria-Universität
 
H2BR: An HTTP/2-based Retransmission Technique to Improve the QoE of Adaptive...
H2BR: An HTTP/2-based Retransmission Technique to Improve the QoE of Adaptive...H2BR: An HTTP/2-based Retransmission Technique to Improve the QoE of Adaptive...
H2BR: An HTTP/2-based Retransmission Technique to Improve the QoE of Adaptive...Alpen-Adria-Universität
 
Policy-driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-driven Dynamic HTTP Adaptive Streaming Player EnvironmentPolicy-driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-driven Dynamic HTTP Adaptive Streaming Player EnvironmentMinh Nguyen
 
Machine Learning Based Video Coding Enhancements for HTTP Adaptive Streaming
Machine Learning Based Video Coding Enhancements for HTTP Adaptive StreamingMachine Learning Based Video Coding Enhancements for HTTP Adaptive Streaming
Machine Learning Based Video Coding Enhancements for HTTP Adaptive StreamingAlpen-Adria-Universität
 
On the Impact of Viewing Distance on Perceived Video Quality
On the Impact of Viewing Distance on Perceived Video QualityOn the Impact of Viewing Distance on Perceived Video Quality
On the Impact of Viewing Distance on Perceived Video QualityAlpen-Adria-Universität
 
A Distributed Delivery Architecture for User Generated Content Live Streaming...
A Distributed Delivery Architecture for User Generated Content Live Streaming...A Distributed Delivery Architecture for User Generated Content Live Streaming...
A Distributed Delivery Architecture for User Generated Content Live Streaming...Alpen-Adria-Universität
 
Bandwidth Prediction in Low-Latency Chunked Streaming
Bandwidth Prediction in Low-Latency Chunked StreamingBandwidth Prediction in Low-Latency Chunked Streaming
Bandwidth Prediction in Low-Latency Chunked StreamingAlpen-Adria-Universität
 
CAdViSE or how to find the Sweet Spots of ABR Systems
CAdViSE or how to find the Sweet Spots of ABR SystemsCAdViSE or how to find the Sweet Spots of ABR Systems
CAdViSE or how to find the Sweet Spots of ABR SystemsAlpen-Adria-Universität
 
FaME-ML: Fast Multirate Encoding for HTTP Adaptive Streaming Using Machine Le...
FaME-ML: Fast Multirate Encoding for HTTP Adaptive Streaming Using Machine Le...FaME-ML: Fast Multirate Encoding for HTTP Adaptive Streaming Using Machine Le...
FaME-ML: Fast Multirate Encoding for HTTP Adaptive Streaming Using Machine Le...Alpen-Adria-Universität
 
Labmeeting - 20150831 - Overhead and Performance of Low Latency Live Streamin...
Labmeeting - 20150831 - Overhead and Performance of Low Latency Live Streamin...Labmeeting - 20150831 - Overhead and Performance of Low Latency Live Streamin...
Labmeeting - 20150831 - Overhead and Performance of Low Latency Live Streamin...Syuan Wang
 
On Optimizing Resource Utilization in AVC-based Real-time Video Streaming
On Optimizing Resource Utilization in AVC-based Real-time Video StreamingOn Optimizing Resource Utilization in AVC-based Real-time Video Streaming
On Optimizing Resource Utilization in AVC-based Real-time Video StreamingAlpen-Adria-Universität
 
A Channel Allocation Algorithm for Cognitive Radio Users Based on Channel Sta...
A Channel Allocation Algorithm for Cognitive Radio Users Based on Channel Sta...A Channel Allocation Algorithm for Cognitive Radio Users Based on Channel Sta...
A Channel Allocation Algorithm for Cognitive Radio Users Based on Channel Sta...Alpen-Adria-Universität
 
Video Coding for Large-Scale HTTP Adaptive Streaming Deployments: State of th...
Video Coding for Large-Scale HTTP Adaptive Streaming Deployments: State of th...Video Coding for Large-Scale HTTP Adaptive Streaming Deployments: State of th...
Video Coding for Large-Scale HTTP Adaptive Streaming Deployments: State of th...Alpen-Adria-Universität
 

Mais procurados (20)

ES-HAS: An Edge- and SDN-Assisted Framework for HTTP Adaptive Video Streaming
ES-HAS: An Edge- and SDN-Assisted Framework for HTTP Adaptive Video StreamingES-HAS: An Edge- and SDN-Assisted Framework for HTTP Adaptive Video Streaming
ES-HAS: An Edge- and SDN-Assisted Framework for HTTP Adaptive Video Streaming
 
CSDN: CDN-Aware QoE Optimization in SDN-Assisted HTTP Adaptive Video Streaming
CSDN: CDN-Aware QoE Optimization in SDN-Assisted HTTP Adaptive Video StreamingCSDN: CDN-Aware QoE Optimization in SDN-Assisted HTTP Adaptive Video Streaming
CSDN: CDN-Aware QoE Optimization in SDN-Assisted HTTP Adaptive Video Streaming
 
Quality impact of scalable video coding tunneling for media aware content del...
Quality impact of scalable video coding tunneling for media aware content del...Quality impact of scalable video coding tunneling for media aware content del...
Quality impact of scalable video coding tunneling for media aware content del...
 
Quality Optimization of Live Streaming Services over HTTP with Reinforcement ...
Quality Optimization of Live Streaming Services over HTTP with Reinforcement ...Quality Optimization of Live Streaming Services over HTTP with Reinforcement ...
Quality Optimization of Live Streaming Services over HTTP with Reinforcement ...
 
20 Years of Streaming in 20 Minutes
20 Years of Streaming in 20 Minutes20 Years of Streaming in 20 Minutes
20 Years of Streaming in 20 Minutes
 
LwTE: Light-weight Transcoding at the Edge
LwTE: Light-weight Transcoding at the EdgeLwTE: Light-weight Transcoding at the Edge
LwTE: Light-weight Transcoding at the Edge
 
MiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive Streaming
MiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive StreamingMiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive Streaming
MiPSO: Multi-Period Per-Scene Optimization For HTTP Adaptive Streaming
 
H2BR: An HTTP/2-based Retransmission Technique to Improve the QoE of Adaptive...
H2BR: An HTTP/2-based Retransmission Technique to Improve the QoE of Adaptive...H2BR: An HTTP/2-based Retransmission Technique to Improve the QoE of Adaptive...
H2BR: An HTTP/2-based Retransmission Technique to Improve the QoE of Adaptive...
 
Policy-driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-driven Dynamic HTTP Adaptive Streaming Player EnvironmentPolicy-driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-driven Dynamic HTTP Adaptive Streaming Player Environment
 
Machine Learning Based Video Coding Enhancements for HTTP Adaptive Streaming
Machine Learning Based Video Coding Enhancements for HTTP Adaptive StreamingMachine Learning Based Video Coding Enhancements for HTTP Adaptive Streaming
Machine Learning Based Video Coding Enhancements for HTTP Adaptive Streaming
 
On the Impact of Viewing Distance on Perceived Video Quality
On the Impact of Viewing Distance on Perceived Video QualityOn the Impact of Viewing Distance on Perceived Video Quality
On the Impact of Viewing Distance on Perceived Video Quality
 
A Distributed Delivery Architecture for User Generated Content Live Streaming...
A Distributed Delivery Architecture for User Generated Content Live Streaming...A Distributed Delivery Architecture for User Generated Content Live Streaming...
A Distributed Delivery Architecture for User Generated Content Live Streaming...
 
Bandwidth Prediction in Low-Latency Chunked Streaming
Bandwidth Prediction in Low-Latency Chunked StreamingBandwidth Prediction in Low-Latency Chunked Streaming
Bandwidth Prediction in Low-Latency Chunked Streaming
 
CAdViSE or how to find the Sweet Spots of ABR Systems
CAdViSE or how to find the Sweet Spots of ABR SystemsCAdViSE or how to find the Sweet Spots of ABR Systems
CAdViSE or how to find the Sweet Spots of ABR Systems
 
FaME-ML: Fast Multirate Encoding for HTTP Adaptive Streaming Using Machine Le...
FaME-ML: Fast Multirate Encoding for HTTP Adaptive Streaming Using Machine Le...FaME-ML: Fast Multirate Encoding for HTTP Adaptive Streaming Using Machine Le...
FaME-ML: Fast Multirate Encoding for HTTP Adaptive Streaming Using Machine Le...
 
PEMWN'21 - ANGELA
PEMWN'21 - ANGELAPEMWN'21 - ANGELA
PEMWN'21 - ANGELA
 
Labmeeting - 20150831 - Overhead and Performance of Low Latency Live Streamin...
Labmeeting - 20150831 - Overhead and Performance of Low Latency Live Streamin...Labmeeting - 20150831 - Overhead and Performance of Low Latency Live Streamin...
Labmeeting - 20150831 - Overhead and Performance of Low Latency Live Streamin...
 
On Optimizing Resource Utilization in AVC-based Real-time Video Streaming
On Optimizing Resource Utilization in AVC-based Real-time Video StreamingOn Optimizing Resource Utilization in AVC-based Real-time Video Streaming
On Optimizing Resource Utilization in AVC-based Real-time Video Streaming
 
A Channel Allocation Algorithm for Cognitive Radio Users Based on Channel Sta...
A Channel Allocation Algorithm for Cognitive Radio Users Based on Channel Sta...A Channel Allocation Algorithm for Cognitive Radio Users Based on Channel Sta...
A Channel Allocation Algorithm for Cognitive Radio Users Based on Channel Sta...
 
Video Coding for Large-Scale HTTP Adaptive Streaming Deployments: State of th...
Video Coding for Large-Scale HTTP Adaptive Streaming Deployments: State of th...Video Coding for Large-Scale HTTP Adaptive Streaming Deployments: State of th...
Video Coding for Large-Scale HTTP Adaptive Streaming Deployments: State of th...
 

Semelhante a FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machine Learning

Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...
Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...
Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...Alpen-Adria-Universität
 
Video capacity of WLANs with a multiuser perceptual quality constraint
Video capacity of WLANs with a multiuser perceptual quality constraintVideo capacity of WLANs with a multiuser perceptual quality constraint
Video capacity of WLANs with a multiuser perceptual quality constraintShivaditya Jatar
 
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...Vignesh V Menon
 
Machine Learning approaches at video compression
Machine Learning approaches at video compression Machine Learning approaches at video compression
Machine Learning approaches at video compression Roberto Iacoviello
 
Green Variable framerate encoding for Adaptive Live Streaming
Green Variable framerate encoding  for Adaptive Live StreamingGreen Variable framerate encoding  for Adaptive Live Streaming
Green Variable framerate encoding for Adaptive Live StreamingVignesh V Menon
 
H04011 04 5361
H04011 04 5361H04011 04 5361
H04011 04 5361IJMER
 
AREA OPTIMIZED FPGA IMPLEMENTATION FOR GENERATION OF RADAR PULSE COM-PRESSION...
AREA OPTIMIZED FPGA IMPLEMENTATION FOR GENERATION OF RADAR PULSE COM-PRESSION...AREA OPTIMIZED FPGA IMPLEMENTATION FOR GENERATION OF RADAR PULSE COM-PRESSION...
AREA OPTIMIZED FPGA IMPLEMENTATION FOR GENERATION OF RADAR PULSE COM-PRESSION...VLSICS Design
 
Machine learning-based energy consumption modeling and comparing of H.264 and...
Machine learning-based energy consumption modeling and comparing of H.264 and...Machine learning-based energy consumption modeling and comparing of H.264 and...
Machine learning-based energy consumption modeling and comparing of H.264 and...IJECEIAES
 
Objective Evaluation of Video Quality
Objective Evaluation of Video QualityObjective Evaluation of Video Quality
Objective Evaluation of Video QualityAnton Venema
 
Iec 62676 5 standardized spezification of image quality
Iec 62676 5 standardized spezification of image qualityIec 62676 5 standardized spezification of image quality
Iec 62676 5 standardized spezification of image qualityHenry Rutjes
 
Real-time Bangla License Plate Recognition System for Low Resource Video-base...
Real-time Bangla License Plate Recognition System for Low Resource Video-base...Real-time Bangla License Plate Recognition System for Low Resource Video-base...
Real-time Bangla License Plate Recognition System for Low Resource Video-base...MD Abdullah Al Nasim
 
Evaluating Quality of Experience in IPTV Services Using MPEG Frame Loss Rate
Evaluating Quality of Experience in IPTV Services Using MPEG Frame Loss RateEvaluating Quality of Experience in IPTV Services Using MPEG Frame Loss Rate
Evaluating Quality of Experience in IPTV Services Using MPEG Frame Loss RateJorge E. López de Vergara Méndez
 
Difference between Interlaced & progressive scanning
Difference between Interlaced & progressive scanningDifference between Interlaced & progressive scanning
Difference between Interlaced & progressive scanningaibad ahmed
 
LiveVBR presentation at VQEG NORM.pdf
LiveVBR presentation at VQEG NORM.pdfLiveVBR presentation at VQEG NORM.pdf
LiveVBR presentation at VQEG NORM.pdfVignesh V Menon
 
Compression presentation 415 (1)
Compression presentation 415 (1)Compression presentation 415 (1)
Compression presentation 415 (1)Godo Dodo
 
Information hiding in edge location of video using amalgamate fft and cubic s...
Information hiding in edge location of video using amalgamate fft and cubic s...Information hiding in edge location of video using amalgamate fft and cubic s...
Information hiding in edge location of video using amalgamate fft and cubic s...IAEME Publication
 

Semelhante a FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machine Learning (20)

Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...
Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...
Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...
 
Video capacity of WLANs with a multiuser perceptual quality constraint
Video capacity of WLANs with a multiuser perceptual quality constraintVideo capacity of WLANs with a multiuser perceptual quality constraint
Video capacity of WLANs with a multiuser perceptual quality constraint
 
A novel approach to record sound
A novel approach to record soundA novel approach to record sound
A novel approach to record sound
 
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
 
Machine Learning approaches at video compression
Machine Learning approaches at video compression Machine Learning approaches at video compression
Machine Learning approaches at video compression
 
JASLA_presentation.pdf
JASLA_presentation.pdfJASLA_presentation.pdf
JASLA_presentation.pdf
 
Green Variable framerate encoding for Adaptive Live Streaming
Green Variable framerate encoding  for Adaptive Live StreamingGreen Variable framerate encoding  for Adaptive Live Streaming
Green Variable framerate encoding for Adaptive Live Streaming
 
H04011 04 5361
H04011 04 5361H04011 04 5361
H04011 04 5361
 
AREA OPTIMIZED FPGA IMPLEMENTATION FOR GENERATION OF RADAR PULSE COM-PRESSION...
AREA OPTIMIZED FPGA IMPLEMENTATION FOR GENERATION OF RADAR PULSE COM-PRESSION...AREA OPTIMIZED FPGA IMPLEMENTATION FOR GENERATION OF RADAR PULSE COM-PRESSION...
AREA OPTIMIZED FPGA IMPLEMENTATION FOR GENERATION OF RADAR PULSE COM-PRESSION...
 
Machine learning-based energy consumption modeling and comparing of H.264 and...
Machine learning-based energy consumption modeling and comparing of H.264 and...Machine learning-based energy consumption modeling and comparing of H.264 and...
Machine learning-based energy consumption modeling and comparing of H.264 and...
 
High contrast
High contrastHigh contrast
High contrast
 
40120140504006
4012014050400640120140504006
40120140504006
 
Objective Evaluation of Video Quality
Objective Evaluation of Video QualityObjective Evaluation of Video Quality
Objective Evaluation of Video Quality
 
Iec 62676 5 standardized spezification of image quality
Iec 62676 5 standardized spezification of image qualityIec 62676 5 standardized spezification of image quality
Iec 62676 5 standardized spezification of image quality
 
Real-time Bangla License Plate Recognition System for Low Resource Video-base...
Real-time Bangla License Plate Recognition System for Low Resource Video-base...Real-time Bangla License Plate Recognition System for Low Resource Video-base...
Real-time Bangla License Plate Recognition System for Low Resource Video-base...
 
Evaluating Quality of Experience in IPTV Services Using MPEG Frame Loss Rate
Evaluating Quality of Experience in IPTV Services Using MPEG Frame Loss RateEvaluating Quality of Experience in IPTV Services Using MPEG Frame Loss Rate
Evaluating Quality of Experience in IPTV Services Using MPEG Frame Loss Rate
 
Difference between Interlaced & progressive scanning
Difference between Interlaced & progressive scanningDifference between Interlaced & progressive scanning
Difference between Interlaced & progressive scanning
 
LiveVBR presentation at VQEG NORM.pdf
LiveVBR presentation at VQEG NORM.pdfLiveVBR presentation at VQEG NORM.pdf
LiveVBR presentation at VQEG NORM.pdf
 
Compression presentation 415 (1)
Compression presentation 415 (1)Compression presentation 415 (1)
Compression presentation 415 (1)
 
Information hiding in edge location of video using amalgamate fft and cubic s...
Information hiding in edge location of video using amalgamate fft and cubic s...Information hiding in edge location of video using amalgamate fft and cubic s...
Information hiding in edge location of video using amalgamate fft and cubic s...
 

Mais de Alpen-Adria-Universität

VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instancesVEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instancesAlpen-Adria-Universität
 
GREEM: An Open-Source Energy Measurement Tool for Video Processing
GREEM: An Open-Source Energy Measurement Tool for Video ProcessingGREEM: An Open-Source Energy Measurement Tool for Video Processing
GREEM: An Open-Source Energy Measurement Tool for Video ProcessingAlpen-Adria-Universität
 
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...Alpen-Adria-Universität
 
VEEP: Video Encoding Energy and CO₂ Emission Prediction
VEEP: Video Encoding Energy and CO₂ Emission PredictionVEEP: Video Encoding Energy and CO₂ Emission Prediction
VEEP: Video Encoding Energy and CO₂ Emission PredictionAlpen-Adria-Universität
 
Content-adaptive Video Coding for HTTP Adaptive Streaming
Content-adaptive Video Coding for HTTP Adaptive StreamingContent-adaptive Video Coding for HTTP Adaptive Streaming
Content-adaptive Video Coding for HTTP Adaptive StreamingAlpen-Adria-Universität
 
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...Alpen-Adria-Universität
 
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...
Empowerment of Atypical Viewers  via Low-Effort Personalized Modeling  of Vid...Empowerment of Atypical Viewers  via Low-Effort Personalized Modeling  of Vid...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...Alpen-Adria-Universität
 
Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...
Optimizing Video Streaming  for Sustainability and Quality: The Role of Prese...Optimizing Video Streaming  for Sustainability and Quality: The Role of Prese...
Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...Alpen-Adria-Universität
 
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...Alpen-Adria-Universität
 
Machine Learning Based Resource Utilization Prediction in the Computing Conti...
Machine Learning Based Resource Utilization Prediction in the Computing Conti...Machine Learning Based Resource Utilization Prediction in the Computing Conti...
Machine Learning Based Resource Utilization Prediction in the Computing Conti...Alpen-Adria-Universität
 
Evaluation of Quality of Experience of ABR Schemes in Gaming Stream
Evaluation of Quality of Experience of ABR Schemes in Gaming StreamEvaluation of Quality of Experience of ABR Schemes in Gaming Stream
Evaluation of Quality of Experience of ABR Schemes in Gaming StreamAlpen-Adria-Universität
 
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...Alpen-Adria-Universität
 
Multi-access Edge Computing for Adaptive Video Streaming
Multi-access Edge Computing for Adaptive Video StreamingMulti-access Edge Computing for Adaptive Video Streaming
Multi-access Edge Computing for Adaptive Video StreamingAlpen-Adria-Universität
 
Policy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-Driven Dynamic HTTP Adaptive Streaming Player EnvironmentPolicy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-Driven Dynamic HTTP Adaptive Streaming Player EnvironmentAlpen-Adria-Universität
 
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...Alpen-Adria-Universität
 
Energy Consumption in Video Streaming: Components, Measurements, and Strategies
Energy Consumption in Video Streaming: Components, Measurements, and StrategiesEnergy Consumption in Video Streaming: Components, Measurements, and Strategies
Energy Consumption in Video Streaming: Components, Measurements, and StrategiesAlpen-Adria-Universität
 
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...Alpen-Adria-Universität
 
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine LearningVideo Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine LearningAlpen-Adria-Universität
 
Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...
Optimizing  QoE and Latency of  Live Video Streaming Using  Edge Computing  a...Optimizing  QoE and Latency of  Live Video Streaming Using  Edge Computing  a...
Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...Alpen-Adria-Universität
 
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming ApplicationsSARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming ApplicationsAlpen-Adria-Universität
 

Mais de Alpen-Adria-Universität (20)

VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instancesVEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
 
GREEM: An Open-Source Energy Measurement Tool for Video Processing
GREEM: An Open-Source Energy Measurement Tool for Video ProcessingGREEM: An Open-Source Energy Measurement Tool for Video Processing
GREEM: An Open-Source Energy Measurement Tool for Video Processing
 
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
 
VEEP: Video Encoding Energy and CO₂ Emission Prediction
VEEP: Video Encoding Energy and CO₂ Emission PredictionVEEP: Video Encoding Energy and CO₂ Emission Prediction
VEEP: Video Encoding Energy and CO₂ Emission Prediction
 
Content-adaptive Video Coding for HTTP Adaptive Streaming
Content-adaptive Video Coding for HTTP Adaptive StreamingContent-adaptive Video Coding for HTTP Adaptive Streaming
Content-adaptive Video Coding for HTTP Adaptive Streaming
 
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
 
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...
Empowerment of Atypical Viewers  via Low-Effort Personalized Modeling  of Vid...Empowerment of Atypical Viewers  via Low-Effort Personalized Modeling  of Vid...
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...
 
Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...
Optimizing Video Streaming  for Sustainability and Quality: The Role of Prese...Optimizing Video Streaming  for Sustainability and Quality: The Role of Prese...
Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...
 
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
 
Machine Learning Based Resource Utilization Prediction in the Computing Conti...
Machine Learning Based Resource Utilization Prediction in the Computing Conti...Machine Learning Based Resource Utilization Prediction in the Computing Conti...
Machine Learning Based Resource Utilization Prediction in the Computing Conti...
 
Evaluation of Quality of Experience of ABR Schemes in Gaming Stream
Evaluation of Quality of Experience of ABR Schemes in Gaming StreamEvaluation of Quality of Experience of ABR Schemes in Gaming Stream
Evaluation of Quality of Experience of ABR Schemes in Gaming Stream
 
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
 
Multi-access Edge Computing for Adaptive Video Streaming
Multi-access Edge Computing for Adaptive Video StreamingMulti-access Edge Computing for Adaptive Video Streaming
Multi-access Edge Computing for Adaptive Video Streaming
 
Policy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-Driven Dynamic HTTP Adaptive Streaming Player EnvironmentPolicy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Policy-Driven Dynamic HTTP Adaptive Streaming Player Environment
 
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
 
Energy Consumption in Video Streaming: Components, Measurements, and Strategies
Energy Consumption in Video Streaming: Components, Measurements, and StrategiesEnergy Consumption in Video Streaming: Components, Measurements, and Strategies
Energy Consumption in Video Streaming: Components, Measurements, and Strategies
 
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
 
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine LearningVideo Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
 
Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...
Optimizing  QoE and Latency of  Live Video Streaming Using  Edge Computing  a...Optimizing  QoE and Latency of  Live Video Streaming Using  Edge Computing  a...
Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...
 
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming ApplicationsSARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
 

Último

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 

Último (20)

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 

FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machine Learning

  • 1. FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machine Learning Anatoliy Zabrovskiy, Prateek Agrawal, Christian Timmerer, Radu Prodan The 30th IEEE International Conference of the Open Innovations Association FRUCT October 27-29, 2021 Oulu, Finland.
  • 2. FAUST Approach. Goal Goal: - Develop Fast Approach for Per-Scene Encoding using Scene Detection and Machine Learning (FAUST) FAUST approach is based on four phases: 1) Fast Entropy based scene detection; 2) ANN based YPSNR quality prediction; 3) Convex Hull calculation and interpolation; 4) Per-scene encoding optimization. 2 1. Scene detection 2. Quality prediction 3. Convex Hull, Interpolation 4. Per-scene optimization
  • 3. 3 Fast Entropy based scene detection (phase 1) The fast entropy based scene detection method includes the following steps: 1. The input video is encoded to low bitrate and low resolution (100 Kbps, 144p) using FFmpeg and x264 ultrafast encoding preset. 2. The Temporal Information (TI) and Spatial Information (SI) metrics are calculated for the encoded video sequence. 3. Then the FAUST approach detects the scenes using the TI metrics calculated on the previous step. Input Video - encoding preset: ultrafast - encoding bitrate: 100kbps - resolution: 144p FFmpeg Encoded Video Calculating TI and SI metrics SITI program - avg. TI entropy (per second) - avg. TI (per second) - scene detection FAUST program Scenes Difference threshold
  • 4. 4 ANN based YPSNR quality prediction (phase 2) Scene 1 Scene 2 Scene 3 Scene n Middle segment Middle segment Middle segment Middle segment ● video height; ● video width; ● encoding bitrate; ● encoding preset; ● SI (for 144p video); ● TI (for 144p video); ● input_segment_size (in bytes); ● 144p_segment_size (in bytes); ● segment duration; ● fps. Predicting YPSNR using ANN Segment no. Bitrate Resolution YPSNR 1 1000 432p 35 1 2300 720p 37 1 4300 1080p 38 1 7000 1440p 40 1 12000 2160p 43 ... ... ... ... For each middle segment for all scenes, the ANN predicts YPSNR values for all possible combinations of resolutions and bitrates from the static bitrate ladder. 2 sec. 2 sec. 2 sec. 2 sec. Static/classic bitrate ladder
  • 5. 5 ANN based YPSNR quality prediction. Results Based on the results: The developed ANN model on testing data is able to predict the YPSNR metric with low mean absolute (MAE) and mean square errors (MSE) of 0.15 and 0.08, respectively. The results with various possible combinations of ANN input parameters are presented in the table.
  • 6. 6 Convex Hull calculation and interpolation (phase 3) Fig. 1. Convex Hull. Tears of Steel video, scene 4. Fig. 2. Interpolated Convex Hull. Tears of Steel video, scene 4.
  • 7. 7 Per-scene encoding optimization (phase 4) Steps: ● For each scene the FAUST approach selects all points on the interpolated Convex Hull which belong to YPSNR range [30 dB, 45 dB]. ● It uses the 1.5 YPSNR spacing to calculate the number of bitrate/resolution pairs for each scene. ● For the selected YPSNR points, the FAUST approach finds the appropriate bitrates and resolutions using interpolated Convex Hulls.
  • 8. Results and analysis. Fast scene detection 8 The total scene detection time shows that the FAUST approach detects video scenes almost three times faster than FFmpeg and more than three times faster than PySceneDetect tool.
  • 9. Performance analysis with classic bitrate ladder 9 The proposed FAUST approach improves both the bitrate reduction and the overall video quality.
  • 10. State-of-the-art comparison 10 ● The video scene detection time using our FAUST approach (9.9 s) is more than three times faster than the MiPSO framework algorithm, i.e. 34 s. ● With our FAUST approach, there is no need to run multiple tests (or trial) encoding to build a convex hull.
  • 11. 11 Conclusions The key advantages of the FAUST approach: ● A fast scene detection using entropy based method. ● No need to run test encodings to build Convex Hulls. The developed ANN model on testing data is able to predict the YPSNR metric with low mean absolute (MAE) and mean square errors (MSE) of 0.15 and 0.08, respectively. ● Per-scene bitrate ladders.