SlideShare a Scribd company logo
1 of 11
Government Flight Analysis
TEAM 3:
VYSHAK SRISHYLAPPA
VIVEK KUMAR
SANKALP JADON
Business cases/problem statement:
 Crowded airspace becoming unpredictable.
 Rescheduling of critical government air space operations because of delays
 Problems in liaisoning between US military and the civilian Air Traffic Control because of sudden delays.
 Bad customer satisfaction for US residents.
 Sudden surge/decrease in the airfare.
 Solution :
 Delay Prediction
 Average Price Prediction
 Flight cancellation Prediction
 Flight Recommendation
Data
 We have gathered the data from Statistical Computing
Statistical Graphics section of American Statistical Association
Website.
 Data had around 5 million rows and 25 columns.
 We processed our prediction on .5 million rows.
 Recommendation: we ran the matchbox recommendation
algorithm against 35,000 reviews who had reviewed the
airline carriers.
Flight Cancellation classification
Model Accuracy Precision
Two Class Logistic Regression 0.978 0.565
Two Class Neural Network 0.980 0.756
Two Class Boosted DecisionTree 0.982 0.758
Two Class Decision Forest 0.980 0.591
Two Class Decision Jungle 0.981 0.872
• Classification done on the Cancelled Column of the dataset. 0 stands for not cancelled and 1 for
cancelled.
• Two Class Boosted Decision Tree gives better accuracy.
• Weather data was scraped from wunderground website.
• On Feature Selection, we selected flightnum, hour, temperature, visibility and sea level pressure as
the variables that help in better prediction.
Arrival Delay Prediction
 Based on feature selection, used- hour, flight number, day of the month, visibility, day of week and
departure delay to train various regression models.
 Used Linear regression, boosted decision tree, Neural Network, and Decision Forest.
 Concluded that the prediction required even more features like like mechanical issues, airport
congestion, etc. which were not present in the dataset.
 Found that Boosted decision tree was the best algorithm amongst all.
Flight Delay Models Matrix
Linear
Regression
Neural
Networks
Decision
Forest
Regression
Boosted
Decision
Tree
MAE 15.50 13.50 11.68 12.38
RMSE 27.70 18.83 17.23 17.32
Relative
Absolute Error
0.57 0.47 0.42 0.45
Relative
Squared Error
0.38 0.17 0.14 0.15
Coefficient 0.61 0.84 0.85 0.84
Visualization
Average Price Prediction
 Predict the average price of
flights, depending on
destination address.
 Predict average ticket price
according to Flight Carrier.
 We found Boosted Decision
Tree to be the best model
among all others.
Air Carrier Recommendation
 We are using the Microsoft Azure
recommendation System to get the
related Airlines carriers.
 The dataset is trained on UserName,
Airlines carrier and their ratings.
Thank You !!

More Related Content

What's hot

Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Simplilearn
 
Data mining project presentation
Data mining project presentationData mining project presentation
Data mining project presentation
Kaiwen Qi
 

What's hot (20)

Flight delay detection data mining project
Flight delay detection data mining projectFlight delay detection data mining project
Flight delay detection data mining project
 
Keras CNN Pre-trained Deep Learning models for Flower Recognition
Keras CNN Pre-trained Deep Learning models for Flower RecognitionKeras CNN Pre-trained Deep Learning models for Flower Recognition
Keras CNN Pre-trained Deep Learning models for Flower Recognition
 
automatic classification in information retrieval
automatic classification in information retrievalautomatic classification in information retrieval
automatic classification in information retrieval
 
The Titanic - machine learning from disaster
The Titanic - machine learning from disasterThe Titanic - machine learning from disaster
The Titanic - machine learning from disaster
 
Structuring Big Data
Structuring Big DataStructuring Big Data
Structuring Big Data
 
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
 
Heart disease prediction
Heart disease predictionHeart disease prediction
Heart disease prediction
 
Credit card fraud detection through machine learning
Credit card fraud detection through machine learningCredit card fraud detection through machine learning
Credit card fraud detection through machine learning
 
Birch Algorithm With Solved Example
Birch Algorithm With Solved ExampleBirch Algorithm With Solved Example
Birch Algorithm With Solved Example
 
Naive bayes
Naive bayesNaive bayes
Naive bayes
 
Artificial Neural Networks Lect7: Neural networks based on competition
Artificial Neural Networks Lect7: Neural networks based on competitionArtificial Neural Networks Lect7: Neural networks based on competition
Artificial Neural Networks Lect7: Neural networks based on competition
 
Data mining project presentation
Data mining project presentationData mining project presentation
Data mining project presentation
 
Titanic Survival Prediction Using Machine Learning
Titanic Survival Prediction Using Machine LearningTitanic Survival Prediction Using Machine Learning
Titanic Survival Prediction Using Machine Learning
 
Airline reservation system db design
Airline reservation system db designAirline reservation system db design
Airline reservation system db design
 
Big Data to avoid weather related flight delays
Big Data to avoid weather related flight delaysBig Data to avoid weather related flight delays
Big Data to avoid weather related flight delays
 
Chronic Kidney Disease Prediction
Chronic Kidney Disease PredictionChronic Kidney Disease Prediction
Chronic Kidney Disease Prediction
 
Unsupervised learning
Unsupervised learningUnsupervised learning
Unsupervised learning
 
Titanic survivor prediction ppt (5)
Titanic survivor prediction ppt (5)Titanic survivor prediction ppt (5)
Titanic survivor prediction ppt (5)
 
Machine Learning Final presentation
Machine Learning Final presentation Machine Learning Final presentation
Machine Learning Final presentation
 
Final ppt
Final pptFinal ppt
Final ppt
 

Viewers also liked

Flight Arrival Delay Prediction
Flight Arrival Delay PredictionFlight Arrival Delay Prediction
Flight Arrival Delay Prediction
Shabnam Abghari
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...
Kun Le
 
mavdumplog_machine_learning_2016
mavdumplog_machine_learning_2016mavdumplog_machine_learning_2016
mavdumplog_machine_learning_2016
Nancy Abramson
 

Viewers also liked (8)

Big Data For Flight Delay Report
Big Data For Flight Delay ReportBig Data For Flight Delay Report
Big Data For Flight Delay Report
 
BIG DATA TO AVOID WEATHER RELATED FLIGHT DELAYS PPT
BIG DATA TO AVOID WEATHER RELATED FLIGHT DELAYS PPTBIG DATA TO AVOID WEATHER RELATED FLIGHT DELAYS PPT
BIG DATA TO AVOID WEATHER RELATED FLIGHT DELAYS PPT
 
Flight Arrival Delay Prediction
Flight Arrival Delay PredictionFlight Arrival Delay Prediction
Flight Arrival Delay Prediction
 
Webinar | Using Big Data and Predictive Analytics to Empower Distribution and...
Webinar | Using Big Data and Predictive Analytics to Empower Distribution and...Webinar | Using Big Data and Predictive Analytics to Empower Distribution and...
Webinar | Using Big Data and Predictive Analytics to Empower Distribution and...
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...
 
Supporting Flight Test And Flight Matching
Supporting Flight Test And Flight MatchingSupporting Flight Test And Flight Matching
Supporting Flight Test And Flight Matching
 
Phase1review
Phase1reviewPhase1review
Phase1review
 
mavdumplog_machine_learning_2016
mavdumplog_machine_learning_2016mavdumplog_machine_learning_2016
mavdumplog_machine_learning_2016
 

Similar to Flight Delay Prediction

Predicting flight cancellation likelihood
Predicting flight cancellation likelihoodPredicting flight cancellation likelihood
Predicting flight cancellation likelihood
Aashish Jain
 
KNN and regression Tree
KNN and regression TreeKNN and regression Tree
KNN and regression Tree
Asmar Farooq
 
DOC245-20240219-WA0000_240219_090212.pdf
DOC245-20240219-WA0000_240219_090212.pdfDOC245-20240219-WA0000_240219_090212.pdf
DOC245-20240219-WA0000_240219_090212.pdf
ShaizaanKhan
 
Air Travel Analytics in SAS
Air Travel Analytics in SASAir Travel Analytics in SAS
Air Travel Analytics in SAS
Rohan Nanda
 
Matrix Mapper in Aviation Industry
Matrix Mapper in Aviation IndustryMatrix Mapper in Aviation Industry
Matrix Mapper in Aviation Industry
aarkentechnologies
 
Passenger forecasting at KLM
Passenger forecasting at KLMPassenger forecasting at KLM
Passenger forecasting at KLM
BigData Republic
 

Similar to Flight Delay Prediction (20)

IRJET- Study of Prediction Algorithms on Aviation Accident Dataset using Rapi...
IRJET- Study of Prediction Algorithms on Aviation Accident Dataset using Rapi...IRJET- Study of Prediction Algorithms on Aviation Accident Dataset using Rapi...
IRJET- Study of Prediction Algorithms on Aviation Accident Dataset using Rapi...
 
Predicting flight cancellation likelihood
Predicting flight cancellation likelihoodPredicting flight cancellation likelihood
Predicting flight cancellation likelihood
 
PRESENTATION ON CHALLENGE lab_084627 (1).pptx
PRESENTATION ON CHALLENGE lab_084627 (1).pptxPRESENTATION ON CHALLENGE lab_084627 (1).pptx
PRESENTATION ON CHALLENGE lab_084627 (1).pptx
 
The Internet of Flying Things - Overview
The Internet of Flying Things - OverviewThe Internet of Flying Things - Overview
The Internet of Flying Things - Overview
 
A statistical approach to predict flight delay
A statistical approach to predict flight delayA statistical approach to predict flight delay
A statistical approach to predict flight delay
 
KNN and regression Tree
KNN and regression TreeKNN and regression Tree
KNN and regression Tree
 
DOC245-20240219-WA0000_240219_090212.pdf
DOC245-20240219-WA0000_240219_090212.pdfDOC245-20240219-WA0000_240219_090212.pdf
DOC245-20240219-WA0000_240219_090212.pdf
 
Hard landing predection
Hard landing predectionHard landing predection
Hard landing predection
 
Random Forest Ensemble learning algorithm for Engineering Analytics Project
Random Forest Ensemble learning algorithm for Engineering Analytics ProjectRandom Forest Ensemble learning algorithm for Engineering Analytics Project
Random Forest Ensemble learning algorithm for Engineering Analytics Project
 
Air Travel Analytics in SAS
Air Travel Analytics in SASAir Travel Analytics in SAS
Air Travel Analytics in SAS
 
Scheduling And Revenue Management Process
Scheduling And Revenue Management ProcessScheduling And Revenue Management Process
Scheduling And Revenue Management Process
 
Secure Benchmarking
Secure BenchmarkingSecure Benchmarking
Secure Benchmarking
 
Rainmaker Crew Pay
Rainmaker Crew PayRainmaker Crew Pay
Rainmaker Crew Pay
 
Credit Card Fraudulent Transaction Detection Research Paper
Credit Card Fraudulent Transaction Detection Research PaperCredit Card Fraudulent Transaction Detection Research Paper
Credit Card Fraudulent Transaction Detection Research Paper
 
Aviation articles - Aircraft Evaluation and selection
Aviation articles - Aircraft Evaluation and selectionAviation articles - Aircraft Evaluation and selection
Aviation articles - Aircraft Evaluation and selection
 
Matrix Mapper in Aviation Industry
Matrix Mapper in Aviation IndustryMatrix Mapper in Aviation Industry
Matrix Mapper in Aviation Industry
 
Passenger forecasting at KLM
Passenger forecasting at KLMPassenger forecasting at KLM
Passenger forecasting at KLM
 
Decision making tool (ahp)
Decision making tool (ahp)Decision making tool (ahp)
Decision making tool (ahp)
 
Performance testingfromthecloud_usingBlazemeter
Performance testingfromthecloud_usingBlazemeterPerformance testingfromthecloud_usingBlazemeter
Performance testingfromthecloud_usingBlazemeter
 
IRJET - Airplane Crash Analysis and Prediction using Machine Learning
IRJET - Airplane Crash Analysis and Prediction using Machine LearningIRJET - Airplane Crash Analysis and Prediction using Machine Learning
IRJET - Airplane Crash Analysis and Prediction using Machine Learning
 

Recently uploaded

Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven CuriosityUnlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Hung Le
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
Kayode Fayemi
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
amilabibi1
 
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
David Celestin
 
Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...
Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...
Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...
ZurliaSoop
 

Recently uploaded (17)

Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio III
 
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven CuriosityUnlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
 
Zone Chairperson Role and Responsibilities New updated.pptx
Zone Chairperson Role and Responsibilities New updated.pptxZone Chairperson Role and Responsibilities New updated.pptx
Zone Chairperson Role and Responsibilities New updated.pptx
 
Digital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of DrupalDigital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of Drupal
 
My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Bailey
 
in kuwait௹+918133066128....) @abortion pills for sale in Kuwait City
in kuwait௹+918133066128....) @abortion pills for sale in Kuwait Cityin kuwait௹+918133066128....) @abortion pills for sale in Kuwait City
in kuwait௹+918133066128....) @abortion pills for sale in Kuwait City
 
ICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdfICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdf
 
lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.
 
Introduction to Artificial intelligence.
Introduction to Artificial intelligence.Introduction to Artificial intelligence.
Introduction to Artificial intelligence.
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar Training
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
 
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdfSOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatment
 
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
 
Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...
Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...
Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
 

Flight Delay Prediction

  • 1. Government Flight Analysis TEAM 3: VYSHAK SRISHYLAPPA VIVEK KUMAR SANKALP JADON
  • 2.
  • 3. Business cases/problem statement:  Crowded airspace becoming unpredictable.  Rescheduling of critical government air space operations because of delays  Problems in liaisoning between US military and the civilian Air Traffic Control because of sudden delays.  Bad customer satisfaction for US residents.  Sudden surge/decrease in the airfare.  Solution :  Delay Prediction  Average Price Prediction  Flight cancellation Prediction  Flight Recommendation
  • 4. Data  We have gathered the data from Statistical Computing Statistical Graphics section of American Statistical Association Website.  Data had around 5 million rows and 25 columns.  We processed our prediction on .5 million rows.  Recommendation: we ran the matchbox recommendation algorithm against 35,000 reviews who had reviewed the airline carriers.
  • 5. Flight Cancellation classification Model Accuracy Precision Two Class Logistic Regression 0.978 0.565 Two Class Neural Network 0.980 0.756 Two Class Boosted DecisionTree 0.982 0.758 Two Class Decision Forest 0.980 0.591 Two Class Decision Jungle 0.981 0.872 • Classification done on the Cancelled Column of the dataset. 0 stands for not cancelled and 1 for cancelled. • Two Class Boosted Decision Tree gives better accuracy. • Weather data was scraped from wunderground website. • On Feature Selection, we selected flightnum, hour, temperature, visibility and sea level pressure as the variables that help in better prediction.
  • 6. Arrival Delay Prediction  Based on feature selection, used- hour, flight number, day of the month, visibility, day of week and departure delay to train various regression models.  Used Linear regression, boosted decision tree, Neural Network, and Decision Forest.  Concluded that the prediction required even more features like like mechanical issues, airport congestion, etc. which were not present in the dataset.  Found that Boosted decision tree was the best algorithm amongst all.
  • 7. Flight Delay Models Matrix Linear Regression Neural Networks Decision Forest Regression Boosted Decision Tree MAE 15.50 13.50 11.68 12.38 RMSE 27.70 18.83 17.23 17.32 Relative Absolute Error 0.57 0.47 0.42 0.45 Relative Squared Error 0.38 0.17 0.14 0.15 Coefficient 0.61 0.84 0.85 0.84
  • 9. Average Price Prediction  Predict the average price of flights, depending on destination address.  Predict average ticket price according to Flight Carrier.  We found Boosted Decision Tree to be the best model among all others.
  • 10. Air Carrier Recommendation  We are using the Microsoft Azure recommendation System to get the related Airlines carriers.  The dataset is trained on UserName, Airlines carrier and their ratings.