SlideShare a Scribd company logo
1 of 24
How Market Intelligence From Hadoop on Azure
Shows Trucking Companies a Clear Road to
Profitability
Timothy Leonard, EVP
Dr. Piyush Kumar, Data Scientist
JUNE 2017
Industry/Business Problem
Marketing intelligence is turning data into insights and opportunities
MARKET
INTELLIGENCE
TRANSPORTATION
DATA
USER
DATA
SERVICE
DATA
EXTERNAL
DATA
Events
Lanes
Rates
Drivers
Payment terms
Shipments
Planning Time
Reports
Response Time
On-Time Pickup
On-Time Del.
Payment Terms
Network
Telematics
CSA
Social Media
Marketing
campaign
Strategic
planning
Data-driven
marketing
Predictive &
prescriptive analytics
Effective resource
planning
Increase customer
retention
Lead scoring
Visibility to industry
benchmarks
Increase upselling
& cross-selling
Opportunities
It is not simple….involves analyzing petabytes of data (~2 trillion records)
Industry TL & LTL & PF Home delivery TL & LTL & PF Home delivery TL & LTL & PF Home Delivery
Market Penetration (worst-case) 100% 5%
Drivers 3,500,000 100,000,000 3,500,000 100,000,000 3,500,000 100,000,000
Assets 2,700,000 235,000,000 2,700,000 235,000,000 2,700,000 235,000,000
Loads 2,700,000 5,000,000 243,000,000 450,000,000 1,080,000,000 2,000,000,000
Orders (Loads x 3) 8,100,000 10,000,000 729,000,000 900,000,000 3,240,000,000 4,000,000,000
Commodity (Orders x 3) 24,300,000 30,000,000 2,187,000,000 2,700,000,000 9,720,000,000 12,000,000,000
User Fields (Orders x 5) 40,500,000 50,000,000 3,645,000,000 4,500,000,000 16,200,000,000 20,000,000,000
Order Audit (Orders x 20) 162,000,000 200,000,000 14,580,000,000 18,000,000,000 64,800,000,000 80,000,000,000
Records (Orders x 3) 24,300,000 30,000,000 2,187,000,000 2,700,000,000 9,720,000,000 12,000,000,000
Events 162,000,000 200,000,000 14,580,000,000 18,000,000,000 64,800,000,000 80,000,000,000
GPS Updates (Loads x 600) 1,620,000,000 6,000,000,000 145,800,000,000 540,000,000,000 648,000,000,000 2,400,000,000,000
Invoice 2,700,000 243,000,000 1,080,000,000
IoT
Rates
Driver Pay, Settlements
Integration
Others
Total 2,052,800,000 6,860,000,000 184,200,200,000 587,585,000,000 818,646,200,000 2,610,335,000,000
Daily Transaction OLAP (2 yrs @ 200days/yr)OLTP (90 days)
Fragmentation of Transportation
Domain Expertise
Freight Volume
Regional Dominance
Advantageous Pricing
Historical Factors of a Fleet’s Success
 Effects
– Very little market-wide data available
– Every fleet has their own “secret sauce” they protect at all cost
– Brokers have access to the most data points
The Small and Mid-Size Carrier Disadvantage
 A daunting deficit of data available to the small carrier
 Primary way to overcome is to price low
– Even “low pricing” is defined by a best guess
3
30
100 Truck Carrier
2000 Truck Carrier
Avg. Company Age (years)
5000
50000
100 Truck Carrier
2000 Truck Carrier
Avg. Daily Loads Available
1,000,000
300,000,000
100 Truck Carrier
2000 Truck Carrier
Avg. Historical Lane Data Points Available
The Opportunity and Use Case
Small Fleets
Etc
The Commonality
TMW
Rate
Miles
Cost
Fuel
HOS
Breaks
Weight
Large Fleets
Etc
Rate
Miles
Cost
Fuel
HOS
Breaks
Weight
Medium Fleets
Etc
Rate
Miles
Cost
Fuel
HOS
Breaks
Weight
 No matter the fleet size,
thousands of them have
one thing in common when
it comes to lane data……..
 A TMW Database
Leveling the Playing Field
Through MRI, any Fleet of any size has access to:
 Average Rate
 Minimum Rate
 Maximum Rate
Which gives them the ability to:
 Negotiate better rate structures with shippers
 Identify those lanes in your network that can easily bear a rate increase
 Analyze new lanes and move into lucrative new markets
 Quote new business to win with competitive intelligence on current rates
 Reallocate internal resources to work more profitable lanes
 Accessorial Revenue Per Load
 Fuel Surcharge Per Mile
 Geographical Level
 Standard Deviation
 Number of Carriers
 Number of Loads
Technical Solution
SaaS Solution In An On Premise World
 Multi-Tenant Hortonworks
Hadoop Business intelligence
Solution Built in the Azure
Cloud
 TMW customers enabled with
Big Data and Data Science
without the cost of
infrastructure and personnel
 TMW Business Intelligence and
Data Science teams providing
solutions for customers and
industry
Components
Data Access Governance Operations Security
Hive 1.2.1/TEZ 0.7 Atlas 0.7.0 Ambari 2.4.2 Knox 0.9.0
Spark 2.0 Ranger 0.6.0 Oozie 4.2.0 Ranger 0.6.0
Hbase 1.1.2 Talend* Kerberos
Phoenix 4.7
Zeppelin 0.6
SSIS*
NiFi 1.1.0
*Not part of Hortonworks Data Platform
Building A Transportation Data Community
 The TME Data Community
enable Analytics and
Benchmarks for
transportation industry
 TMW is able to leverage
Data Science and Statistical
approach to Market
Intelligence for data
profiling and cleansing
Data Science For Transportation
MRI Visualizations
 Facilitates telling and sharing stories about data.
Fuel Data Analysis
 White paper for analyzing fuel prices and providing
evidence towards the value that ExpertFuel provides.
K-Means Clustering And Beyond
Model Building
 Rate per Mile,
 Load Availability,
 Load Velocity.
Model Saving/Loading & Predictions
 Enable Prediction As A Service by creating an on demand
model library.
Transportation Data Lake For the entire Industry
What Went Right
SaaS BI Successes
 Building the Data Lake and working in Hadoop in Azure
allowed the data lake to feed innovation in Predictive
Analytics, Master Data Management, and Benchmarking
 Use the experts Like Horton’s PS team(teach, teach, teach)
 Individual Customer and Industry Intelligence was secured
and centralized for collaborative Business Intelligence
 Speed to Market, TMW had it’s first customer on-boarded
in under 9 months, with more quickly added.
Lessons Learned
Lessons Learned
 Build a bigger data lake, the TMW
Data Community should have been
an early target to launch
 Be Agile in your R&D processes to
enable changes in process, big data
is changing fast with additions like
Apache Spark, Zeppelin and NiFi

More Related Content

What's hot

IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...Mark Rittman
 
Seeing Redshift: How Amazon Changed Data Warehousing Forever
Seeing Redshift: How Amazon Changed Data Warehousing ForeverSeeing Redshift: How Amazon Changed Data Warehousing Forever
Seeing Redshift: How Amazon Changed Data Warehousing ForeverInside Analysis
 
Architecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the EnterpriseArchitecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the EnterpriseAmazon Web Services
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes John Archer
 
Data Lakes: 8 Enterprise Data Management Requirements
Data Lakes: 8 Enterprise Data Management RequirementsData Lakes: 8 Enterprise Data Management Requirements
Data Lakes: 8 Enterprise Data Management RequirementsSnapLogic
 
2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey Results2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey ResultsCarole Gunst
 
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors DataWorks Summit/Hadoop Summit
 
Automated Testing For Protecting Data Pipelines from Undocumented Assumptions
Automated Testing For Protecting Data Pipelines from Undocumented AssumptionsAutomated Testing For Protecting Data Pipelines from Undocumented Assumptions
Automated Testing For Protecting Data Pipelines from Undocumented AssumptionsDatabricks
 
A Design Approach To Drive Business Innovation Nov
A Design Approach To Drive Business Innovation NovA Design Approach To Drive Business Innovation Nov
A Design Approach To Drive Business Innovation NovCertus Solutions
 
Hadoop Journey at Walgreens
Hadoop Journey at WalgreensHadoop Journey at Walgreens
Hadoop Journey at WalgreensDataWorks Summit
 
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...Igor De Souza
 
How Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon RedshiftHow Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon RedshiftAttunity
 
Cloud and Analytics - From Platforms to an Ecosystem
Cloud and Analytics - From Platforms to an EcosystemCloud and Analytics - From Platforms to an Ecosystem
Cloud and Analytics - From Platforms to an EcosystemDatabricks
 
Big Data Taiwan 2014 Track2-2: Informatica Big Data Solution
Big Data Taiwan 2014 Track2-2: Informatica Big Data SolutionBig Data Taiwan 2014 Track2-2: Informatica Big Data Solution
Big Data Taiwan 2014 Track2-2: Informatica Big Data SolutionEtu Solution
 
Modernize & Automate Analytics Data Pipelines
Modernize & Automate Analytics Data PipelinesModernize & Automate Analytics Data Pipelines
Modernize & Automate Analytics Data PipelinesCarole Gunst
 
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...Amazon Web Services
 

What's hot (20)

451 Research Impact Report
451 Research Impact Report451 Research Impact Report
451 Research Impact Report
 
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
 
Marketing vs Technology
Marketing vs TechnologyMarketing vs Technology
Marketing vs Technology
 
Seeing Redshift: How Amazon Changed Data Warehousing Forever
Seeing Redshift: How Amazon Changed Data Warehousing ForeverSeeing Redshift: How Amazon Changed Data Warehousing Forever
Seeing Redshift: How Amazon Changed Data Warehousing Forever
 
Architecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the EnterpriseArchitecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the Enterprise
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
 
Data Lakes: 8 Enterprise Data Management Requirements
Data Lakes: 8 Enterprise Data Management RequirementsData Lakes: 8 Enterprise Data Management Requirements
Data Lakes: 8 Enterprise Data Management Requirements
 
2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey Results2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey Results
 
Data Process Systems, connecting everything
Data Process Systems, connecting everythingData Process Systems, connecting everything
Data Process Systems, connecting everything
 
Big Data Building Blocks with AWS Cloud
Big Data Building Blocks with AWS CloudBig Data Building Blocks with AWS Cloud
Big Data Building Blocks with AWS Cloud
 
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
 
Automated Testing For Protecting Data Pipelines from Undocumented Assumptions
Automated Testing For Protecting Data Pipelines from Undocumented AssumptionsAutomated Testing For Protecting Data Pipelines from Undocumented Assumptions
Automated Testing For Protecting Data Pipelines from Undocumented Assumptions
 
A Design Approach To Drive Business Innovation Nov
A Design Approach To Drive Business Innovation NovA Design Approach To Drive Business Innovation Nov
A Design Approach To Drive Business Innovation Nov
 
Hadoop Journey at Walgreens
Hadoop Journey at WalgreensHadoop Journey at Walgreens
Hadoop Journey at Walgreens
 
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
 
How Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon RedshiftHow Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon Redshift
 
Cloud and Analytics - From Platforms to an Ecosystem
Cloud and Analytics - From Platforms to an EcosystemCloud and Analytics - From Platforms to an Ecosystem
Cloud and Analytics - From Platforms to an Ecosystem
 
Big Data Taiwan 2014 Track2-2: Informatica Big Data Solution
Big Data Taiwan 2014 Track2-2: Informatica Big Data SolutionBig Data Taiwan 2014 Track2-2: Informatica Big Data Solution
Big Data Taiwan 2014 Track2-2: Informatica Big Data Solution
 
Modernize & Automate Analytics Data Pipelines
Modernize & Automate Analytics Data PipelinesModernize & Automate Analytics Data Pipelines
Modernize & Automate Analytics Data Pipelines
 
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
 

Similar to How Market Intelligence From Hadoop on Azure Shows Trucking Companies a Clear Road to Profitability

HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICSHIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICSHappiest Minds Technologies
 
Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic
 Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic
Simplify Your Migration to AWS and Cut Costs by 30% with TSO LogicAmazon Web Services
 
AWS Summit 2014 - Melbourne - Keynote by Mike Clayville
AWS Summit 2014 - Melbourne - Keynote by Mike ClayvilleAWS Summit 2014 - Melbourne - Keynote by Mike Clayville
AWS Summit 2014 - Melbourne - Keynote by Mike ClayvilleAmazon Web Services
 
Fleet Telematics Exec Summary
Fleet Telematics Exec SummaryFleet Telematics Exec Summary
Fleet Telematics Exec SummaryCurtis Palmer
 
#DataOnCloud New York Event
#DataOnCloud New York Event#DataOnCloud New York Event
#DataOnCloud New York EventHARMAN Services
 
What Drives the Car Business: Moving from Anecdotes to Data
What Drives the Car Business: Moving from Anecdotes to DataWhat Drives the Car Business: Moving from Anecdotes to Data
What Drives the Car Business: Moving from Anecdotes to DataDataWorks Summit
 
Cloudera Enterprise_Data Hub in Telecom
Cloudera Enterprise_Data Hub in TelecomCloudera Enterprise_Data Hub in Telecom
Cloudera Enterprise_Data Hub in TelecomEinsny Phionesgo
 
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTBig Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTKiththi Perera
 
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardBig data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardKiththi Perera
 
Security, ETL, BI & Analytics, and Software Integration
Security, ETL, BI & Analytics, and Software IntegrationSecurity, ETL, BI & Analytics, and Software Integration
Security, ETL, BI & Analytics, and Software IntegrationDataWorks Summit
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptxElsonPaul2
 
Capturing big value in big data
Capturing big value in big data Capturing big value in big data
Capturing big value in big data BSP Media Group
 
Big Data and Analytics on Amazon Web Services: Building A Business-Friendly P...
Big Data and Analytics on Amazon Web Services: Building A Business-Friendly P...Big Data and Analytics on Amazon Web Services: Building A Business-Friendly P...
Big Data and Analytics on Amazon Web Services: Building A Business-Friendly P...Amazon Web Services
 
Real-time data integration to the cloud
Real-time data integration to the cloudReal-time data integration to the cloud
Real-time data integration to the cloudSankar Nagarajan
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesAshraf Uddin
 
Hadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreHadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreTrendwise Analytics
 

Similar to How Market Intelligence From Hadoop on Azure Shows Trucking Companies a Clear Road to Profitability (20)

HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICSHIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
 
Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic
 Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic
Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic
 
Cloudant
CloudantCloudant
Cloudant
 
AWS Summit 2014 - Melbourne - Keynote by Mike Clayville
AWS Summit 2014 - Melbourne - Keynote by Mike ClayvilleAWS Summit 2014 - Melbourne - Keynote by Mike Clayville
AWS Summit 2014 - Melbourne - Keynote by Mike Clayville
 
Fleet Telematics Exec Summary
Fleet Telematics Exec SummaryFleet Telematics Exec Summary
Fleet Telematics Exec Summary
 
#DataOnCloud New York Event
#DataOnCloud New York Event#DataOnCloud New York Event
#DataOnCloud New York Event
 
What Drives the Car Business: Moving from Anecdotes to Data
What Drives the Car Business: Moving from Anecdotes to DataWhat Drives the Car Business: Moving from Anecdotes to Data
What Drives the Car Business: Moving from Anecdotes to Data
 
Cloudera Enterprise_Data Hub in Telecom
Cloudera Enterprise_Data Hub in TelecomCloudera Enterprise_Data Hub in Telecom
Cloudera Enterprise_Data Hub in Telecom
 
Etiya White Paper_ABDR
Etiya White Paper_ABDREtiya White Paper_ABDR
Etiya White Paper_ABDR
 
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTBig Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
 
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardBig data solutions on cloud – the way forward
Big data solutions on cloud – the way forward
 
Security, ETL, BI & Analytics, and Software Integration
Security, ETL, BI & Analytics, and Software IntegrationSecurity, ETL, BI & Analytics, and Software Integration
Security, ETL, BI & Analytics, and Software Integration
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
Capturing big value in big data
Capturing big value in big data Capturing big value in big data
Capturing big value in big data
 
Big Data and Analytics on Amazon Web Services: Building A Business-Friendly P...
Big Data and Analytics on Amazon Web Services: Building A Business-Friendly P...Big Data and Analytics on Amazon Web Services: Building A Business-Friendly P...
Big Data and Analytics on Amazon Web Services: Building A Business-Friendly P...
 
Real-time data integration to the cloud
Real-time data integration to the cloudReal-time data integration to the cloud
Real-time data integration to the cloud
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture Capabilities
 
Big Data into the MuleSoft world
Big Data into the MuleSoft worldBig Data into the MuleSoft world
Big Data into the MuleSoft world
 
Financial Services in the Cloud
Financial Services in the CloudFinancial Services in the Cloud
Financial Services in the Cloud
 
Hadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreHadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and More
 

More from DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 

Recently uploaded (20)

Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 

How Market Intelligence From Hadoop on Azure Shows Trucking Companies a Clear Road to Profitability

  • 1. How Market Intelligence From Hadoop on Azure Shows Trucking Companies a Clear Road to Profitability Timothy Leonard, EVP Dr. Piyush Kumar, Data Scientist JUNE 2017
  • 3. Marketing intelligence is turning data into insights and opportunities MARKET INTELLIGENCE TRANSPORTATION DATA USER DATA SERVICE DATA EXTERNAL DATA Events Lanes Rates Drivers Payment terms Shipments Planning Time Reports Response Time On-Time Pickup On-Time Del. Payment Terms Network Telematics CSA Social Media Marketing campaign Strategic planning Data-driven marketing Predictive & prescriptive analytics Effective resource planning Increase customer retention Lead scoring Visibility to industry benchmarks Increase upselling & cross-selling Opportunities
  • 4. It is not simple….involves analyzing petabytes of data (~2 trillion records) Industry TL & LTL & PF Home delivery TL & LTL & PF Home delivery TL & LTL & PF Home Delivery Market Penetration (worst-case) 100% 5% Drivers 3,500,000 100,000,000 3,500,000 100,000,000 3,500,000 100,000,000 Assets 2,700,000 235,000,000 2,700,000 235,000,000 2,700,000 235,000,000 Loads 2,700,000 5,000,000 243,000,000 450,000,000 1,080,000,000 2,000,000,000 Orders (Loads x 3) 8,100,000 10,000,000 729,000,000 900,000,000 3,240,000,000 4,000,000,000 Commodity (Orders x 3) 24,300,000 30,000,000 2,187,000,000 2,700,000,000 9,720,000,000 12,000,000,000 User Fields (Orders x 5) 40,500,000 50,000,000 3,645,000,000 4,500,000,000 16,200,000,000 20,000,000,000 Order Audit (Orders x 20) 162,000,000 200,000,000 14,580,000,000 18,000,000,000 64,800,000,000 80,000,000,000 Records (Orders x 3) 24,300,000 30,000,000 2,187,000,000 2,700,000,000 9,720,000,000 12,000,000,000 Events 162,000,000 200,000,000 14,580,000,000 18,000,000,000 64,800,000,000 80,000,000,000 GPS Updates (Loads x 600) 1,620,000,000 6,000,000,000 145,800,000,000 540,000,000,000 648,000,000,000 2,400,000,000,000 Invoice 2,700,000 243,000,000 1,080,000,000 IoT Rates Driver Pay, Settlements Integration Others Total 2,052,800,000 6,860,000,000 184,200,200,000 587,585,000,000 818,646,200,000 2,610,335,000,000 Daily Transaction OLAP (2 yrs @ 200days/yr)OLTP (90 days)
  • 5. Fragmentation of Transportation Domain Expertise Freight Volume Regional Dominance Advantageous Pricing Historical Factors of a Fleet’s Success  Effects – Very little market-wide data available – Every fleet has their own “secret sauce” they protect at all cost – Brokers have access to the most data points
  • 6. The Small and Mid-Size Carrier Disadvantage  A daunting deficit of data available to the small carrier  Primary way to overcome is to price low – Even “low pricing” is defined by a best guess 3 30 100 Truck Carrier 2000 Truck Carrier Avg. Company Age (years) 5000 50000 100 Truck Carrier 2000 Truck Carrier Avg. Daily Loads Available 1,000,000 300,000,000 100 Truck Carrier 2000 Truck Carrier Avg. Historical Lane Data Points Available
  • 8. Small Fleets Etc The Commonality TMW Rate Miles Cost Fuel HOS Breaks Weight Large Fleets Etc Rate Miles Cost Fuel HOS Breaks Weight Medium Fleets Etc Rate Miles Cost Fuel HOS Breaks Weight  No matter the fleet size, thousands of them have one thing in common when it comes to lane data……..  A TMW Database
  • 9. Leveling the Playing Field Through MRI, any Fleet of any size has access to:  Average Rate  Minimum Rate  Maximum Rate Which gives them the ability to:  Negotiate better rate structures with shippers  Identify those lanes in your network that can easily bear a rate increase  Analyze new lanes and move into lucrative new markets  Quote new business to win with competitive intelligence on current rates  Reallocate internal resources to work more profitable lanes  Accessorial Revenue Per Load  Fuel Surcharge Per Mile  Geographical Level  Standard Deviation  Number of Carriers  Number of Loads
  • 11. SaaS Solution In An On Premise World  Multi-Tenant Hortonworks Hadoop Business intelligence Solution Built in the Azure Cloud  TMW customers enabled with Big Data and Data Science without the cost of infrastructure and personnel  TMW Business Intelligence and Data Science teams providing solutions for customers and industry
  • 12. Components Data Access Governance Operations Security Hive 1.2.1/TEZ 0.7 Atlas 0.7.0 Ambari 2.4.2 Knox 0.9.0 Spark 2.0 Ranger 0.6.0 Oozie 4.2.0 Ranger 0.6.0 Hbase 1.1.2 Talend* Kerberos Phoenix 4.7 Zeppelin 0.6 SSIS* NiFi 1.1.0 *Not part of Hortonworks Data Platform
  • 13. Building A Transportation Data Community  The TME Data Community enable Analytics and Benchmarks for transportation industry  TMW is able to leverage Data Science and Statistical approach to Market Intelligence for data profiling and cleansing
  • 14. Data Science For Transportation
  • 15. MRI Visualizations  Facilitates telling and sharing stories about data.
  • 16. Fuel Data Analysis  White paper for analyzing fuel prices and providing evidence towards the value that ExpertFuel provides.
  • 18. Model Building  Rate per Mile,  Load Availability,  Load Velocity.
  • 19. Model Saving/Loading & Predictions  Enable Prediction As A Service by creating an on demand model library.
  • 20. Transportation Data Lake For the entire Industry
  • 22. SaaS BI Successes  Building the Data Lake and working in Hadoop in Azure allowed the data lake to feed innovation in Predictive Analytics, Master Data Management, and Benchmarking  Use the experts Like Horton’s PS team(teach, teach, teach)  Individual Customer and Industry Intelligence was secured and centralized for collaborative Business Intelligence  Speed to Market, TMW had it’s first customer on-boarded in under 9 months, with more quickly added.
  • 24. Lessons Learned  Build a bigger data lake, the TMW Data Community should have been an early target to launch  Be Agile in your R&D processes to enable changes in process, big data is changing fast with additions like Apache Spark, Zeppelin and NiFi