SlideShare uma empresa Scribd logo
1 de 11
Linear
Regression
With Orange
A U T H O R : A N T H O N Y M O K
D AT E : 1 8 N O V 2 0 2 3
E M A I L : X X I A O H A O @ YA H O O . C O M
Predicting HDB
Resale Prices
What Do Users Say About Orange
"ORANGE IS AN EXCELLENT DATA MINING
TOOL THAT IS EASY TO USE AND HAS A
WIDE RANGE OF FEATURES.".
"I LOVE ORANGE'S VISUAL PROGRAMMING
INTERFACE. IT MAKES IT EASY TO BUILD
COMPLEX DATA MINING WORKFLOWS."
"ORANGE IS A POWERFUL TOOL THAT CAN
BE USED TO SOLVE A WIDE VARIETY OF
PROBLEMS."
Orange is an open-source data visualisation and machine learning toolkit that has been
widely praised:
Project’s Context, Objective & Strategies
Objective
To predict resale prices in to
advise his potential clients
Strategies
Explore & Clean data for analysis
Perform K-Means Clustering, in
Orange, to find possible segments
in the customer data
Tune the model to improve its
performance
Visualise the findings, share
conclusions, and give insight-
driven recommendations
Context
Housing Agent collected resale prices
on HDB apartments in Singapore
Exploratory Data Analysis
Findings
• Feature Columns = 8
• Categorical columns = 3
• Of which, Target = resales_price
• Numerical columns = 5
• Instances = 2,000
• Outliers = None
Linear Regression Model in Orange
Loading File & Exploring Data
Loading File
hdb_resales.csv file was imported
into workflow. The ‘Role’ for
‘resales_price’ was set as
‘target’, with the rest set as ‘feature’
Exploring Data
No missing data found
Looking for Relationships & Patterns*
Floor Space & Resale Prices
Prices Increase as Floor Area Increases
Region & Resale Prices
More 4 & 5-room Apartments in the
Central Region commanding Resales Prices
above 500K than in other regions
* More comprehensive findings and conclusions were provided in the project report, which are
not released at the request of the Housing Agent
Splitting Data & Doing Linear Regression
Splitting Data
Dataset was split into 70%
for training and 30% for
testing the Linear
Regression Model
Conduct Linear Regression
Testing and training data was fitted
into the Linear Regression Model
Coefficients*
Mix of positive/negative
correlations between resale
prices and the towns these
apartments belong to
* More comprehensive findings and conclusions were provided in
the project report, which are not released at the request of the
Housing Agent
Evaluating Performance of Model
Evaluation (Training Data)
Performance of Model based
on 70% for training data:
RMSE = 53,981
R2 = 82.3%
Evaluation (Testing Data)
Performance of Model based on
30% of testing data
RMSE = 61992
R2 = 78.3%
Findings, Conclusions & Recommendations
With an R-squared (R2) value of 82.3% on the training data and 78.3% on
the testing data, the model is able to explain a significant portion of the
variation in the dependent variable, which suggests it is performing well.
At 53,981 for the training data and 61,992 for the testing data, the Root
Mean Squared Error (RMSE) is also relatively low. This suggests that the
model's predictions are fairly close to the actual values of the dependent
variable.
Overall, these scores inform that this model is fit-for-use for prediction
without the need for regularisation, since the model is not overfitted to the
data.
Making Predictions With Model*
Predicting HDB Apartment
Resale Prices
Snapshot of predicted HDB
Apartment Resale Prices
Predicting Resales Prices
The number of rooms in an apartment is a relatively
good predictor of its resale price. The Housing Agent
can use this model to predict the resale price of 3-
room, 4-room, and 5-room apartments with a high
degree of accuracy
* More comprehensive findings and conclusions were provided in the project report, which are not released at the request of the Housing Agent
Linear
Regression
With Orange
A U T H O R : A N T H O N Y M O K
D AT E : 1 8 N O V 2 0 2 3
E M A I L : X X I A O H A O @ YA H O O . C O M
Predicting HDB
Resale Prices

Mais conteúdo relacionado

Semelhante a Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange

Housing Price Prediction using Machine Learning
Housing Price Prediction using Machine LearningHousing Price Prediction using Machine Learning
Housing Price Prediction using Machine LearningIRJET Journal
 
Predicting house price
Predicting house pricePredicting house price
Predicting house priceDivya Tiwari
 
House Price Prediction Using Machine Learning Via Data Analysis
House Price Prediction Using Machine Learning Via Data AnalysisHouse Price Prediction Using Machine Learning Via Data Analysis
House Price Prediction Using Machine Learning Via Data AnalysisIRJET Journal
 
Final Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal...
Final Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal...Final Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal...
Final Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal...Chaudhry Hussain
 
Nesma autumn conference - the gains of unit based pricing - Sytse van der Schaaf
Nesma autumn conference - the gains of unit based pricing - Sytse van der SchaafNesma autumn conference - the gains of unit based pricing - Sytse van der Schaaf
Nesma autumn conference - the gains of unit based pricing - Sytse van der SchaafNesma
 
Pricing Optimization using Machine Learning
Pricing Optimization using Machine LearningPricing Optimization using Machine Learning
Pricing Optimization using Machine LearningIRJET Journal
 
Data mining and analysis of customer churn dataset
Data mining and analysis of customer churn datasetData mining and analysis of customer churn dataset
Data mining and analysis of customer churn datasetRohan Choksi
 
Urban Local Bodies Case Study
Urban Local Bodies Case StudyUrban Local Bodies Case Study
Urban Local Bodies Case StudySandra Ahn
 
laptop price prediction presentation
laptop price prediction presentationlaptop price prediction presentation
laptop price prediction presentationNeerajNishad4
 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptopRising Media, Inc.
 
Advantages of Data mining Techniques in improving CRM in the Hospitality domain
Advantages of Data mining Techniques  in improving CRM in the Hospitality domainAdvantages of Data mining Techniques  in improving CRM in the Hospitality domain
Advantages of Data mining Techniques in improving CRM in the Hospitality domainWafa Raboudi
 
Predicting HDB resale price in Singapore
Predicting HDB resale price in SingaporePredicting HDB resale price in Singapore
Predicting HDB resale price in SingaporeValerie Lim
 
churn_detection.pptx
churn_detection.pptxchurn_detection.pptx
churn_detection.pptxDhanuDhanu49
 
House Price Prediction Using Machine Learning
House Price Prediction Using Machine LearningHouse Price Prediction Using Machine Learning
House Price Prediction Using Machine LearningIRJET Journal
 
Using Regression for Identifying Opportunities in Real Estate
Using Regression for Identifying Opportunities in Real EstateUsing Regression for Identifying Opportunities in Real Estate
Using Regression for Identifying Opportunities in Real EstateMelody Ucros
 

Semelhante a Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange (20)

Predicting House Prices: A Machine Learning Approach
Predicting House Prices: A Machine Learning ApproachPredicting House Prices: A Machine Learning Approach
Predicting House Prices: A Machine Learning Approach
 
Housing Price Prediction using Machine Learning
Housing Price Prediction using Machine LearningHousing Price Prediction using Machine Learning
Housing Price Prediction using Machine Learning
 
Predicting house price
Predicting house pricePredicting house price
Predicting house price
 
House Price Prediction Using Machine Learning Via Data Analysis
House Price Prediction Using Machine Learning Via Data AnalysisHouse Price Prediction Using Machine Learning Via Data Analysis
House Price Prediction Using Machine Learning Via Data Analysis
 
Final Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal...
Final Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal...Final Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal...
Final Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal Defence.pptxFinal...
 
Nesma autumn conference - the gains of unit based pricing - Sytse van der Schaaf
Nesma autumn conference - the gains of unit based pricing - Sytse van der SchaafNesma autumn conference - the gains of unit based pricing - Sytse van der Schaaf
Nesma autumn conference - the gains of unit based pricing - Sytse van der Schaaf
 
Pricing Optimization using Machine Learning
Pricing Optimization using Machine LearningPricing Optimization using Machine Learning
Pricing Optimization using Machine Learning
 
Data mining and analysis of customer churn dataset
Data mining and analysis of customer churn datasetData mining and analysis of customer churn dataset
Data mining and analysis of customer churn dataset
 
Dmml report final
Dmml report finalDmml report final
Dmml report final
 
Urban Local Bodies Case Study
Urban Local Bodies Case StudyUrban Local Bodies Case Study
Urban Local Bodies Case Study
 
laptop price prediction presentation
laptop price prediction presentationlaptop price prediction presentation
laptop price prediction presentation
 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop
 
Data analysis
Data analysisData analysis
Data analysis
 
Advantages of Data mining Techniques in improving CRM in the Hospitality domain
Advantages of Data mining Techniques  in improving CRM in the Hospitality domainAdvantages of Data mining Techniques  in improving CRM in the Hospitality domain
Advantages of Data mining Techniques in improving CRM in the Hospitality domain
 
House price prediction
House price predictionHouse price prediction
House price prediction
 
Predicting HDB resale price in Singapore
Predicting HDB resale price in SingaporePredicting HDB resale price in Singapore
Predicting HDB resale price in Singapore
 
churn_detection.pptx
churn_detection.pptxchurn_detection.pptx
churn_detection.pptx
 
Telecom customer churn prediction
Telecom customer churn predictionTelecom customer churn prediction
Telecom customer churn prediction
 
House Price Prediction Using Machine Learning
House Price Prediction Using Machine LearningHouse Price Prediction Using Machine Learning
House Price Prediction Using Machine Learning
 
Using Regression for Identifying Opportunities in Real Estate
Using Regression for Identifying Opportunities in Real EstateUsing Regression for Identifying Opportunities in Real Estate
Using Regression for Identifying Opportunities in Real Estate
 

Mais de ThinkInnovation

Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...ThinkInnovation
 
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...ThinkInnovation
 
Ordinary Least Square Regression & Stage-2 Regression - Factors Influencing M...
Ordinary Least Square Regression & Stage-2 Regression - Factors Influencing M...Ordinary Least Square Regression & Stage-2 Regression - Factors Influencing M...
Ordinary Least Square Regression & Stage-2 Regression - Factors Influencing M...ThinkInnovation
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
Decision Making Under Uncertainty - Predict the Chances of a Person Suffering...
Decision Making Under Uncertainty - Predict the Chances of a Person Suffering...Decision Making Under Uncertainty - Predict the Chances of a Person Suffering...
Decision Making Under Uncertainty - Predict the Chances of a Person Suffering...ThinkInnovation
 
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...ThinkInnovation
 
Decision Making Under Uncertainty - Decide Whether Or Not to Take Precautions
Decision Making Under Uncertainty - Decide Whether Or Not to Take PrecautionsDecision Making Under Uncertainty - Decide Whether Or Not to Take Precautions
Decision Making Under Uncertainty - Decide Whether Or Not to Take PrecautionsThinkInnovation
 
Optimal Decision Making - Cost Reduction in Logistics
Optimal Decision Making - Cost Reduction in LogisticsOptimal Decision Making - Cost Reduction in Logistics
Optimal Decision Making - Cost Reduction in LogisticsThinkInnovation
 
Create Data Model & Conduct Visualisation in Power BI Desktop
Create Data Model & Conduct Visualisation in Power BI DesktopCreate Data Model & Conduct Visualisation in Power BI Desktop
Create Data Model & Conduct Visualisation in Power BI DesktopThinkInnovation
 
Using DAX & Time-based Analysis in Data Warehouse
Using DAX & Time-based Analysis in Data WarehouseUsing DAX & Time-based Analysis in Data Warehouse
Using DAX & Time-based Analysis in Data WarehouseThinkInnovation
 
Creating Data Warehouse Using Power Query & Power Pivot
Creating Data Warehouse Using Power Query & Power PivotCreating Data Warehouse Using Power Query & Power Pivot
Creating Data Warehouse Using Power Query & Power PivotThinkInnovation
 
Unlocking New Insights Into the World of European Soccer Through the European...
Unlocking New Insights Into the World of European Soccer Through the European...Unlocking New Insights Into the World of European Soccer Through the European...
Unlocking New Insights Into the World of European Soccer Through the European...ThinkInnovation
 
Breakfast Talk - Manage Projects
Breakfast Talk - Manage ProjectsBreakfast Talk - Manage Projects
Breakfast Talk - Manage ProjectsThinkInnovation
 
Think innovation issue 4 share - scamper
Think innovation issue 4   share - scamperThink innovation issue 4   share - scamper
Think innovation issue 4 share - scamperThinkInnovation
 
Reverse Assumption Method
Reverse Assumption MethodReverse Assumption Method
Reverse Assumption MethodThinkInnovation
 
Psyche of Facilitation - The New Language of Facilitating Conversations
Psyche of Facilitation - The New Language of Facilitating ConversationsPsyche of Facilitation - The New Language of Facilitating Conversations
Psyche of Facilitation - The New Language of Facilitating ConversationsThinkInnovation
 
Visual Connection - Ideation Through Word Association
Visual Connection - Ideation Through Word AssociationVisual Connection - Ideation Through Word Association
Visual Connection - Ideation Through Word AssociationThinkInnovation
 

Mais de ThinkInnovation (18)

Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
 
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
 
Ordinary Least Square Regression & Stage-2 Regression - Factors Influencing M...
Ordinary Least Square Regression & Stage-2 Regression - Factors Influencing M...Ordinary Least Square Regression & Stage-2 Regression - Factors Influencing M...
Ordinary Least Square Regression & Stage-2 Regression - Factors Influencing M...
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Decision Making Under Uncertainty - Predict the Chances of a Person Suffering...
Decision Making Under Uncertainty - Predict the Chances of a Person Suffering...Decision Making Under Uncertainty - Predict the Chances of a Person Suffering...
Decision Making Under Uncertainty - Predict the Chances of a Person Suffering...
 
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
 
Decision Making Under Uncertainty - Decide Whether Or Not to Take Precautions
Decision Making Under Uncertainty - Decide Whether Or Not to Take PrecautionsDecision Making Under Uncertainty - Decide Whether Or Not to Take Precautions
Decision Making Under Uncertainty - Decide Whether Or Not to Take Precautions
 
Optimal Decision Making - Cost Reduction in Logistics
Optimal Decision Making - Cost Reduction in LogisticsOptimal Decision Making - Cost Reduction in Logistics
Optimal Decision Making - Cost Reduction in Logistics
 
Create Data Model & Conduct Visualisation in Power BI Desktop
Create Data Model & Conduct Visualisation in Power BI DesktopCreate Data Model & Conduct Visualisation in Power BI Desktop
Create Data Model & Conduct Visualisation in Power BI Desktop
 
Using DAX & Time-based Analysis in Data Warehouse
Using DAX & Time-based Analysis in Data WarehouseUsing DAX & Time-based Analysis in Data Warehouse
Using DAX & Time-based Analysis in Data Warehouse
 
Creating Data Warehouse Using Power Query & Power Pivot
Creating Data Warehouse Using Power Query & Power PivotCreating Data Warehouse Using Power Query & Power Pivot
Creating Data Warehouse Using Power Query & Power Pivot
 
Unlocking New Insights Into the World of European Soccer Through the European...
Unlocking New Insights Into the World of European Soccer Through the European...Unlocking New Insights Into the World of European Soccer Through the European...
Unlocking New Insights Into the World of European Soccer Through the European...
 
Breakfast Talk - Manage Projects
Breakfast Talk - Manage ProjectsBreakfast Talk - Manage Projects
Breakfast Talk - Manage Projects
 
Think innovation issue 4 share - scamper
Think innovation issue 4   share - scamperThink innovation issue 4   share - scamper
Think innovation issue 4 share - scamper
 
SCAMPER
SCAMPERSCAMPER
SCAMPER
 
Reverse Assumption Method
Reverse Assumption MethodReverse Assumption Method
Reverse Assumption Method
 
Psyche of Facilitation - The New Language of Facilitating Conversations
Psyche of Facilitation - The New Language of Facilitating ConversationsPsyche of Facilitation - The New Language of Facilitating Conversations
Psyche of Facilitation - The New Language of Facilitating Conversations
 
Visual Connection - Ideation Through Word Association
Visual Connection - Ideation Through Word AssociationVisual Connection - Ideation Through Word Association
Visual Connection - Ideation Through Word Association
 

Último

The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancingmohamed Elzalabany
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group MeetingAlison Pitt
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理cyebo
 
社内勉強会資料  Mamba - A new era or ephemeral
社内勉強会資料   Mamba - A new era or ephemeral社内勉強会資料   Mamba - A new era or ephemeral
社内勉強会資料  Mamba - A new era or ephemeralNABLAS株式会社
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp onlinebalibahu1313
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxStephen266013
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonPayment Village
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfMichaelSenkow
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfscitechtalktv
 
Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfRobertoOcampo24
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Calllward7
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyRafigAliyev2
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理pyhepag
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Jon Hansen
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理pyhepag
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理pyhepag
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictJack Cole
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Valters Lauzums
 

Último (20)

The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancing
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
社内勉強会資料  Mamba - A new era or ephemeral
社内勉強会資料   Mamba - A new era or ephemeral社内勉強会資料   Mamba - A new era or ephemeral
社内勉強会資料  Mamba - A new era or ephemeral
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp online
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdf
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdf
 
Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdf
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
 

Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange

  • 1. Linear Regression With Orange A U T H O R : A N T H O N Y M O K D AT E : 1 8 N O V 2 0 2 3 E M A I L : X X I A O H A O @ YA H O O . C O M Predicting HDB Resale Prices
  • 2. What Do Users Say About Orange "ORANGE IS AN EXCELLENT DATA MINING TOOL THAT IS EASY TO USE AND HAS A WIDE RANGE OF FEATURES.". "I LOVE ORANGE'S VISUAL PROGRAMMING INTERFACE. IT MAKES IT EASY TO BUILD COMPLEX DATA MINING WORKFLOWS." "ORANGE IS A POWERFUL TOOL THAT CAN BE USED TO SOLVE A WIDE VARIETY OF PROBLEMS." Orange is an open-source data visualisation and machine learning toolkit that has been widely praised:
  • 3. Project’s Context, Objective & Strategies Objective To predict resale prices in to advise his potential clients Strategies Explore & Clean data for analysis Perform K-Means Clustering, in Orange, to find possible segments in the customer data Tune the model to improve its performance Visualise the findings, share conclusions, and give insight- driven recommendations Context Housing Agent collected resale prices on HDB apartments in Singapore
  • 4. Exploratory Data Analysis Findings • Feature Columns = 8 • Categorical columns = 3 • Of which, Target = resales_price • Numerical columns = 5 • Instances = 2,000 • Outliers = None
  • 6. Loading File & Exploring Data Loading File hdb_resales.csv file was imported into workflow. The ‘Role’ for ‘resales_price’ was set as ‘target’, with the rest set as ‘feature’ Exploring Data No missing data found
  • 7. Looking for Relationships & Patterns* Floor Space & Resale Prices Prices Increase as Floor Area Increases Region & Resale Prices More 4 & 5-room Apartments in the Central Region commanding Resales Prices above 500K than in other regions * More comprehensive findings and conclusions were provided in the project report, which are not released at the request of the Housing Agent
  • 8. Splitting Data & Doing Linear Regression Splitting Data Dataset was split into 70% for training and 30% for testing the Linear Regression Model Conduct Linear Regression Testing and training data was fitted into the Linear Regression Model Coefficients* Mix of positive/negative correlations between resale prices and the towns these apartments belong to * More comprehensive findings and conclusions were provided in the project report, which are not released at the request of the Housing Agent
  • 9. Evaluating Performance of Model Evaluation (Training Data) Performance of Model based on 70% for training data: RMSE = 53,981 R2 = 82.3% Evaluation (Testing Data) Performance of Model based on 30% of testing data RMSE = 61992 R2 = 78.3% Findings, Conclusions & Recommendations With an R-squared (R2) value of 82.3% on the training data and 78.3% on the testing data, the model is able to explain a significant portion of the variation in the dependent variable, which suggests it is performing well. At 53,981 for the training data and 61,992 for the testing data, the Root Mean Squared Error (RMSE) is also relatively low. This suggests that the model's predictions are fairly close to the actual values of the dependent variable. Overall, these scores inform that this model is fit-for-use for prediction without the need for regularisation, since the model is not overfitted to the data.
  • 10. Making Predictions With Model* Predicting HDB Apartment Resale Prices Snapshot of predicted HDB Apartment Resale Prices Predicting Resales Prices The number of rooms in an apartment is a relatively good predictor of its resale price. The Housing Agent can use this model to predict the resale price of 3- room, 4-room, and 5-room apartments with a high degree of accuracy * More comprehensive findings and conclusions were provided in the project report, which are not released at the request of the Housing Agent
  • 11. Linear Regression With Orange A U T H O R : A N T H O N Y M O K D AT E : 1 8 N O V 2 0 2 3 E M A I L : X X I A O H A O @ YA H O O . C O M Predicting HDB Resale Prices