SlideShare uma empresa Scribd logo
1 de 21
Harnessing Big Data and Analytics
Sept 24th, 2013
Julio Da Silva
Global IT Director of Enterprise Data Warehouse
Page 2
Overview of Newmont
Newmont Mining Corporation is primarily a gold producer, with significant assets
or operations in the United States, Australia, Peru, Indonesia, Ghana, New
Zealand and Mexico. Founded in 1921 and publicly traded since 1925, Newmont
is one of the world’s largest gold producers and is the only gold company included
in the S&P 500 Index and Fortune 500. Headquartered near Denver, Colorado, the
company has around 40,000 employees and contractors worldwide.
Page 3
It started with bits and bytes
Gigabyte (1 000 000 000 Bytes)
1 Gigabyte: A pickup truck filled with paper
Terabyte (1 000 000 000 000 Bytes)
10 Terabytes: The printed collection of the US Library of Congress
Petabyte (1 000 000 000 000 000 Bytes)
20 Petabytes: Production of hard-disk drives in 1995
Exabyte (1 000 000 000 000 000 000 Bytes)
5 Exabytes: All words ever spoken by human beings.
Zettabyte (1 000 000 000 000 000 000 000 Bytes)
Page 4
What is Big Data
Volume & Velocity
“From the dawn of civilization until 2003, humankind generated 5 Exabytes of data.
Now we produce 5 Exabytes every 2 days…. And the pace is accelerating.” Eric
Schmidt, executive chairman, Google.
Page 5
Where is this data coming from
- Variety
High speed networks (wireless and wired networks connecting):
 Onboard computers on mobile equipment like trucks, shovels……
 Sensors gathering data from: our high value production
machinery/equipment – trucks, shovels, conveyors, processing
 RFID tags (people, shipments, inventory…..)
 Mobile phones/tablets enabling greater collection of information (words,
photos, voice, video, gps….)
 Social networks
 Web 2.0 and collaborative solutions
 Digitized Lab results
Page 6
The data explosion meets the ever reduction
in per unit costs for computing capabilities
Page 7
What does Nirvana in Big data look
like
You may have heard of IBM’s Watson…
Why Jeopardy?
The game of Jeopardy! makes great demands on its players – from the range of topical
knowledge covered to the nuances in language employed in the clues. The question IBM
had for itself was “is it possible to build a computer system that could process big data
and come up with sensible answers in seconds—so well that it could compete with
human opponents?”
A. What is the computer
system that played
against human
opponents on
“Jeopardy”…
and won.
Page 8
Getting started –
It starts with a vision & not technology
8
………..Be a data driven organization, making decisions that
drive industry leading performance……….
Guiding Principles:
1. Strategic and organization alignment (Top-Down)
2. Focus on Business Value
3. Trust the data quality
4. “Google” like speed to queries
5. Be easy to use
6. Be reliable
7. All at a low cost
Page 9
Driving towards a culture of data driven decisions
requires a foundation based on strong skills
Data Governance
Data Architecture
Data Integration/ETL
Reporting and Visualization
Business Analysts
Page 10
Driving towards a culture of data driven decisions
requires a foundation based on strong skills
Data Scientist
 Data science seeks to use all available and relevant data to
effectively tell a story that can be easily understood by non-
practitioners
 Incorporates varying elements and builds on techniques and
theories from many fields, including mathematics, statistics, data
engineering, pattern recognition and learning, advanced
computing, visualization, uncertainty modeling, data
warehousing, and high performance computing with the goal of
extracting meaning from data and creating data products
Page 11
Driving towards a culture of data driven decisions
requires a foundation based on strong skills
Confidential and proprietary. Copyright © 2012 Teradata Corporation.12
AUDIO & VIDEO WITSML TEXT DTS/DAS LOGS GIS MRO ERP
TERADATA UNIFIED DATA ARCHITECTURE
Reservoir Engineers
Production Engineers
Asset Managers
OperatorsCustomers / PartnersGeologist
HSEMaintenance
CAPTURE | STORE | REFINE
Big Data
Management
LANGUAGES MATH & STATS DATA MINING BUSINESS INTELLIGENCE APPLICATIONS
DISCOVERY
PLATFORM
INTEGRATED
DATA WAREHOUSE
Big Data
Analytics
Confidential and proprietary. Copyright © 2012 Teradata Corporation.13
Business
Analyst
Discovery
Platform
Discovery
Big Data Discovery Platform Requirements
Data Sources
Structured
Data
Multi-
Structured
Data
Non
relational
Data
OLTP
DBMS’s
Users
Data
Scientist
SQL
MapReduce
Statistical
Functions
Discovery Tools
• Structured and multi-
structured data
• Doesn’t require
extensive data
modeling
• Doesn’t balance the
books
• Data completeness
can be good enough
• No stringent SLA’s
Possible Analytics
• Path to Machine Failure
• Historical Well Behavior
• Optimization Analytics
• Machine data/process
optimization
Confidential and proprietary. Copyright © 2012 Teradata Corporation.14
Discovery Analytics Visualization Options
• A visualization tool built upon Aster’s
SQL-MapReduce framework.
• Browser-based visualizations are
produced using result sets from popular
Aster operators such as nPath and cFilter.
• Visual SQL-MR functions are perfect for
visualizing path & pattern and graph for
iterative analysis.
• Connect to the Aster platform using
ODBC.
• Aster’s relational output is easily fed into
and visualized by these partner tools (and
other standard BI Tools).
• Great for visualizing query results and for
interactive reporting.
Visual SQL-MapReduce® Functions Traditional BI Tools
Confidential and proprietary. Copyright © 2012 Teradata Corporation.15
• Deliver valuable insight to
lines of business resulting
from deep analysis of all of
your data, all of the time
Uncover New Insights In Your Business
Fraudulent Paths
Golden Path to Application Submit
Paths To Attrition
Page 16
Opportunities in the mining world
Production, Processing, Logistics, Distribution, Core/Drilling,
Exploration, Social responsibility…….
Asset Management Use case
 Capture all sensor data & “black box” – structured and
unstructured
 Capture all relevant ERP and transaction data
• ERP data (Work orders, Notifications, Preventative Maintenance schedules,
Equipment costs over life-time, future planned costs, Fleet/operations
performance data…...
 Integrate production data plans
• Real-time reporting events
 Condition Based monitoring systems and alerts leveraging a
Service Oriented Architecture
With integrated data one has the ability to navigate and analyze
“CONTEXT” in relation to the real-time event and make decisions.
“CONTEXT” = comparison data (view by equipment, site, region,
global); trends (History + planned), dependencies (production plans)
Page 17
Logistics Use case
 Big data
• Capture all relevant ERP data (Inventory, purchasing, ..)
• Capture RFID data
• Capture Vendor data (Stock on hand, estimated duration to delivery…)
 Real-time
• Stock-out can create alerts leveraging a Service Oriented Architecture
With integrated data one has the ability to navigate and analyze
“CONTEXT” in relation to the real-time event and make decisions.
“CONTEXT” = comparison data (view by site, region, global); trends
(history + planned), dependencies (maintenance orders, purchase
orders, shipments, reservations….)
Opportunities in the mining world
Production, Processing, Logistics, Distribution, Core/Drilling,
Exploration, Social responsibility…….
Page 18
Social responsibility Use case
 Topsy, is a company based in San Francisco, that provides
analyses from Twitter postings (tweets).
 There is also Social Relationship Management Software-as-a-
service (SaaS) technology that allows marketers to publish and
engage fans on social networks and customize a brand's look,
feel and message in an easy to use, scalable, and efficient
method.
 The screen captures to follow show some examples of what is
possible in this space.
Opportunities in the mining world
Production, Processing, Logistics, Distribution, Core/Drilling,
Exploration, Social responsibility…….
Page 19
Opportunities in the mining world
Production, Processing, Logistics, Distribution, Core/Drilling,
Exploration, Social responsibility…….
Page 20
Opportunities in the mining world
Production, Processing, Logistics, Distribution, Core/Drilling,
Exploration, Social responsibility…….
Page 21
Summary
 Big Data projects will fail if they are driven by technology. Companies that
have successful implementation of Big Data start with a strong Business
Case, and Big Data technology just happened to be used to solve the
business questions. This has to be a Business driven initiative.
 Find your Data Scientists – Business resource who has ability to understand
data mining techniques and interpret predictive analyses
 Enhance Technical Skills – IT resource who can develop with the Big Data
technologies; Hadoop, Aster, MapReduce………..
 Search for SaaS solutions where possible. Extremely expensive to pioneer
new solutions and keep up with fast pace of change in this ever emerging
area.
 Align with your local Universities to sponsor support Master/PHD programs
that will benefit both parties.

Mais conteúdo relacionado

Mais procurados

Ppt for Application of big data
Ppt for Application of big dataPpt for Application of big data
Ppt for Application of big dataPrashant Sharma
 
Idc big data whitepaper_final
Idc big data whitepaper_finalIdc big data whitepaper_final
Idc big data whitepaper_finalOsman Circi
 
Big data characteristics, value chain and challenges
Big data characteristics, value chain and challengesBig data characteristics, value chain and challenges
Big data characteristics, value chain and challengesMusfiqur Rahman
 
Data 4 AI: For European Economic Competitiveness and Societal Progress
Data 4 AI: For European Economic Competitiveness and Societal ProgressData 4 AI: For European Economic Competitiveness and Societal Progress
Data 4 AI: For European Economic Competitiveness and Societal ProgressEdward Curry
 
Integrating Big Data Technologies
Integrating Big Data TechnologiesIntegrating Big Data Technologies
Integrating Big Data TechnologiesDATAVERSITY
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...Edward Curry
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its ChallengesKathirvel Ayyaswamy
 
Big data overview external
Big data overview externalBig data overview external
Big data overview externalBrett Colbert
 
Big Data Analytics for Dodd-Frank
Big Data Analytics for Dodd-FrankBig Data Analytics for Dodd-Frank
Big Data Analytics for Dodd-FrankDataWorks Summit
 
Big Data: Industry trends and key players
Big Data: Industry trends and key playersBig Data: Industry trends and key players
Big Data: Industry trends and key playersCM Research
 
Future of jobs, big data & innovation
Future of jobs, big data & innovation Future of jobs, big data & innovation
Future of jobs, big data & innovation suresh sood
 
Team 2 Big Data Presentation
Team 2 Big Data PresentationTeam 2 Big Data Presentation
Team 2 Big Data PresentationMatthew Urdan
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigDataValarmathi V
 

Mais procurados (20)

Ppt for Application of big data
Ppt for Application of big dataPpt for Application of big data
Ppt for Application of big data
 
Big data by_mcal
Big data by_mcalBig data by_mcal
Big data by_mcal
 
Big Data Trends
Big Data TrendsBig Data Trends
Big Data Trends
 
Idc big data whitepaper_final
Idc big data whitepaper_finalIdc big data whitepaper_final
Idc big data whitepaper_final
 
Big data characteristics, value chain and challenges
Big data characteristics, value chain and challengesBig data characteristics, value chain and challenges
Big data characteristics, value chain and challenges
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Data 4 AI: For European Economic Competitiveness and Societal Progress
Data 4 AI: For European Economic Competitiveness and Societal ProgressData 4 AI: For European Economic Competitiveness and Societal Progress
Data 4 AI: For European Economic Competitiveness and Societal Progress
 
The promise and challenge of Big Data
The promise and challenge of Big DataThe promise and challenge of Big Data
The promise and challenge of Big Data
 
Privacy in the Age of Big Data
Privacy in the Age of Big DataPrivacy in the Age of Big Data
Privacy in the Age of Big Data
 
National Conference - Big Data - 31 Jan 2015
National Conference - Big Data - 31 Jan 2015National Conference - Big Data - 31 Jan 2015
National Conference - Big Data - 31 Jan 2015
 
Integrating Big Data Technologies
Integrating Big Data TechnologiesIntegrating Big Data Technologies
Integrating Big Data Technologies
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
 
Big data overview external
Big data overview externalBig data overview external
Big data overview external
 
Big Data Analytics for Dodd-Frank
Big Data Analytics for Dodd-FrankBig Data Analytics for Dodd-Frank
Big Data Analytics for Dodd-Frank
 
Big Data: Industry trends and key players
Big Data: Industry trends and key playersBig Data: Industry trends and key players
Big Data: Industry trends and key players
 
Future of jobs, big data & innovation
Future of jobs, big data & innovation Future of jobs, big data & innovation
Future of jobs, big data & innovation
 
Big data
Big dataBig data
Big data
 
Team 2 Big Data Presentation
Team 2 Big Data PresentationTeam 2 Big Data Presentation
Team 2 Big Data Presentation
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigData
 

Destaque

Untitled Presentation
Untitled PresentationUntitled Presentation
Untitled Presentationabuyaodion
 
Script Final
Script FinalScript Final
Script FinalTobyAds
 
Present Perfect and Present Continuous por Diego Duma
Present Perfect and Present Continuous por Diego DumaPresent Perfect and Present Continuous por Diego Duma
Present Perfect and Present Continuous por Diego DumaDeegoDuma5708
 
Advert storyboard
Advert   storyboard Advert   storyboard
Advert storyboard TobyAds
 
Hidden-Web Induced by Client-Side Scripting: An Empirical Study
Hidden-Web Induced by Client-Side Scripting: An Empirical StudyHidden-Web Induced by Client-Side Scripting: An Empirical Study
Hidden-Web Induced by Client-Side Scripting: An Empirical StudySALT Lab @ UBC
 
Autoradio gps dvd kia forte 2009 avec ecran
Autoradio gps dvd kia forte 2009 avec ecranAutoradio gps dvd kia forte 2009 avec ecran
Autoradio gps dvd kia forte 2009 avec ecranradiovoiture
 
C solutions wreck removal contracts the interface with emergency salvage se...
C solutions   wreck removal contracts the interface with emergency salvage se...C solutions   wreck removal contracts the interface with emergency salvage se...
C solutions wreck removal contracts the interface with emergency salvage se...Zertec
 
Initial analysis of music magazine
Initial analysis of music magazineInitial analysis of music magazine
Initial analysis of music magazineasmediag12
 
Wwwwwwwwwwwwwwwwwwwwdddwddwwwwwwwwwwww
WwwwwwwwwwwwwwwwwwwwdddwddwwwwwwwwwwwwWwwwwwwwwwwwwwwwwwwwdddwddwwwwwwwwwwww
Wwwwwwwwwwwwwwwwwwwwdddwddwwwwwwwwwwwwnicodtt
 
Analysing nme dizzee cover prep for blog ppt
Analysing  nme dizzee cover prep for blog pptAnalysing  nme dizzee cover prep for blog ppt
Analysing nme dizzee cover prep for blog pptasmediag12
 
Dompletion: DOM-Aware JavaScript Code Completion
Dompletion: DOM-Aware JavaScript Code CompletionDompletion: DOM-Aware JavaScript Code Completion
Dompletion: DOM-Aware JavaScript Code CompletionSALT Lab @ UBC
 
A Metric for Code Readability
A Metric for Code ReadabilityA Metric for Code Readability
A Metric for Code ReadabilityRay Buse
 

Destaque (13)

Untitled Presentation
Untitled PresentationUntitled Presentation
Untitled Presentation
 
Script Final
Script FinalScript Final
Script Final
 
Surreal
SurrealSurreal
Surreal
 
Present Perfect and Present Continuous por Diego Duma
Present Perfect and Present Continuous por Diego DumaPresent Perfect and Present Continuous por Diego Duma
Present Perfect and Present Continuous por Diego Duma
 
Advert storyboard
Advert   storyboard Advert   storyboard
Advert storyboard
 
Hidden-Web Induced by Client-Side Scripting: An Empirical Study
Hidden-Web Induced by Client-Side Scripting: An Empirical StudyHidden-Web Induced by Client-Side Scripting: An Empirical Study
Hidden-Web Induced by Client-Side Scripting: An Empirical Study
 
Autoradio gps dvd kia forte 2009 avec ecran
Autoradio gps dvd kia forte 2009 avec ecranAutoradio gps dvd kia forte 2009 avec ecran
Autoradio gps dvd kia forte 2009 avec ecran
 
C solutions wreck removal contracts the interface with emergency salvage se...
C solutions   wreck removal contracts the interface with emergency salvage se...C solutions   wreck removal contracts the interface with emergency salvage se...
C solutions wreck removal contracts the interface with emergency salvage se...
 
Initial analysis of music magazine
Initial analysis of music magazineInitial analysis of music magazine
Initial analysis of music magazine
 
Wwwwwwwwwwwwwwwwwwwwdddwddwwwwwwwwwwww
WwwwwwwwwwwwwwwwwwwwdddwddwwwwwwwwwwwwWwwwwwwwwwwwwwwwwwwwdddwddwwwwwwwwwwww
Wwwwwwwwwwwwwwwwwwwwdddwddwwwwwwwwwwww
 
Analysing nme dizzee cover prep for blog ppt
Analysing  nme dizzee cover prep for blog pptAnalysing  nme dizzee cover prep for blog ppt
Analysing nme dizzee cover prep for blog ppt
 
Dompletion: DOM-Aware JavaScript Code Completion
Dompletion: DOM-Aware JavaScript Code CompletionDompletion: DOM-Aware JavaScript Code Completion
Dompletion: DOM-Aware JavaScript Code Completion
 
A Metric for Code Readability
A Metric for Code ReadabilityA Metric for Code Readability
A Metric for Code Readability
 

Semelhante a Big Data Mining Keynote presentation Sept 2013 09012013

Big Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsBig Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsWay-Yen Lin
 
How to design ai functions to the cloud native infra
How to design ai functions to the cloud native infraHow to design ai functions to the cloud native infra
How to design ai functions to the cloud native infraChun Myung Kyu
 
Big data and the data quality imperative
Big data and the data quality imperativeBig data and the data quality imperative
Big data and the data quality imperativeTrillium Software
 
Integrate Big Data into Your Organization with Informatica and Perficient
Integrate Big Data into Your Organization with Informatica and PerficientIntegrate Big Data into Your Organization with Informatica and Perficient
Integrate Big Data into Your Organization with Informatica and PerficientPerficient, Inc.
 
Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big DataJean-Marc Desvaux
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyRohit Dubey
 
JIMS Rohini IT Flash Monthly Newsletter - October Issue
JIMS Rohini IT Flash Monthly Newsletter  - October IssueJIMS Rohini IT Flash Monthly Newsletter  - October Issue
JIMS Rohini IT Flash Monthly Newsletter - October IssueJIMS Rohini Sector 5
 
Watson data platform_sofia_20171017
Watson data platform_sofia_20171017Watson data platform_sofia_20171017
Watson data platform_sofia_20171017Mladen Jovanovski
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 
big-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptxbig-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptxVaishnavGhadge1
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"MDS ap
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life RevolutionCapgemini
 
World’s 10 Best Data Integration Solution Providers 2022.pdf
World’s 10 Best Data Integration Solution Providers 2022.pdfWorld’s 10 Best Data Integration Solution Providers 2022.pdf
World’s 10 Best Data Integration Solution Providers 2022.pdfInsightsSuccess4
 
Every angle jacques adriaansen
Every angle   jacques adriaansenEvery angle   jacques adriaansen
Every angle jacques adriaansenBigDataExpo
 
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisatiesData Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisatiesMultiscope
 
The Evolving Role of the Data Engineer - Whitepaper | Qubole
The Evolving Role of the Data Engineer - Whitepaper | QuboleThe Evolving Role of the Data Engineer - Whitepaper | Qubole
The Evolving Role of the Data Engineer - Whitepaper | QuboleVasu S
 

Semelhante a Big Data Mining Keynote presentation Sept 2013 09012013 (20)

Big Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsBig Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data Scientists
 
How to design ai functions to the cloud native infra
How to design ai functions to the cloud native infraHow to design ai functions to the cloud native infra
How to design ai functions to the cloud native infra
 
Big data and the data quality imperative
Big data and the data quality imperativeBig data and the data quality imperative
Big data and the data quality imperative
 
Integrate Big Data into Your Organization with Informatica and Perficient
Integrate Big Data into Your Organization with Informatica and PerficientIntegrate Big Data into Your Organization with Informatica and Perficient
Integrate Big Data into Your Organization with Informatica and Perficient
 
Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big Data
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 
JIMS Rohini IT Flash Monthly Newsletter - October Issue
JIMS Rohini IT Flash Monthly Newsletter  - October IssueJIMS Rohini IT Flash Monthly Newsletter  - October Issue
JIMS Rohini IT Flash Monthly Newsletter - October Issue
 
Watson data platform_sofia_20171017
Watson data platform_sofia_20171017Watson data platform_sofia_20171017
Watson data platform_sofia_20171017
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
big-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptxbig-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptx
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
 
Big data Analytics
Big data Analytics Big data Analytics
Big data Analytics
 
Big Data ppt
Big Data pptBig Data ppt
Big Data ppt
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life Revolution
 
World’s 10 Best Data Integration Solution Providers 2022.pdf
World’s 10 Best Data Integration Solution Providers 2022.pdfWorld’s 10 Best Data Integration Solution Providers 2022.pdf
World’s 10 Best Data Integration Solution Providers 2022.pdf
 
Every angle jacques adriaansen
Every angle   jacques adriaansenEvery angle   jacques adriaansen
Every angle jacques adriaansen
 
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisatiesData Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
 
Big data
Big dataBig data
Big data
 
The Evolving Role of the Data Engineer - Whitepaper | Qubole
The Evolving Role of the Data Engineer - Whitepaper | QuboleThe Evolving Role of the Data Engineer - Whitepaper | Qubole
The Evolving Role of the Data Engineer - Whitepaper | Qubole
 
Big data
Big dataBig data
Big data
 

Big Data Mining Keynote presentation Sept 2013 09012013

  • 1. Harnessing Big Data and Analytics Sept 24th, 2013 Julio Da Silva Global IT Director of Enterprise Data Warehouse
  • 2. Page 2 Overview of Newmont Newmont Mining Corporation is primarily a gold producer, with significant assets or operations in the United States, Australia, Peru, Indonesia, Ghana, New Zealand and Mexico. Founded in 1921 and publicly traded since 1925, Newmont is one of the world’s largest gold producers and is the only gold company included in the S&P 500 Index and Fortune 500. Headquartered near Denver, Colorado, the company has around 40,000 employees and contractors worldwide.
  • 3. Page 3 It started with bits and bytes Gigabyte (1 000 000 000 Bytes) 1 Gigabyte: A pickup truck filled with paper Terabyte (1 000 000 000 000 Bytes) 10 Terabytes: The printed collection of the US Library of Congress Petabyte (1 000 000 000 000 000 Bytes) 20 Petabytes: Production of hard-disk drives in 1995 Exabyte (1 000 000 000 000 000 000 Bytes) 5 Exabytes: All words ever spoken by human beings. Zettabyte (1 000 000 000 000 000 000 000 Bytes)
  • 4. Page 4 What is Big Data Volume & Velocity “From the dawn of civilization until 2003, humankind generated 5 Exabytes of data. Now we produce 5 Exabytes every 2 days…. And the pace is accelerating.” Eric Schmidt, executive chairman, Google.
  • 5. Page 5 Where is this data coming from - Variety High speed networks (wireless and wired networks connecting):  Onboard computers on mobile equipment like trucks, shovels……  Sensors gathering data from: our high value production machinery/equipment – trucks, shovels, conveyors, processing  RFID tags (people, shipments, inventory…..)  Mobile phones/tablets enabling greater collection of information (words, photos, voice, video, gps….)  Social networks  Web 2.0 and collaborative solutions  Digitized Lab results
  • 6. Page 6 The data explosion meets the ever reduction in per unit costs for computing capabilities
  • 7. Page 7 What does Nirvana in Big data look like You may have heard of IBM’s Watson… Why Jeopardy? The game of Jeopardy! makes great demands on its players – from the range of topical knowledge covered to the nuances in language employed in the clues. The question IBM had for itself was “is it possible to build a computer system that could process big data and come up with sensible answers in seconds—so well that it could compete with human opponents?” A. What is the computer system that played against human opponents on “Jeopardy”… and won.
  • 8. Page 8 Getting started – It starts with a vision & not technology 8 ………..Be a data driven organization, making decisions that drive industry leading performance………. Guiding Principles: 1. Strategic and organization alignment (Top-Down) 2. Focus on Business Value 3. Trust the data quality 4. “Google” like speed to queries 5. Be easy to use 6. Be reliable 7. All at a low cost
  • 9. Page 9 Driving towards a culture of data driven decisions requires a foundation based on strong skills Data Governance Data Architecture Data Integration/ETL Reporting and Visualization Business Analysts
  • 10. Page 10 Driving towards a culture of data driven decisions requires a foundation based on strong skills Data Scientist  Data science seeks to use all available and relevant data to effectively tell a story that can be easily understood by non- practitioners  Incorporates varying elements and builds on techniques and theories from many fields, including mathematics, statistics, data engineering, pattern recognition and learning, advanced computing, visualization, uncertainty modeling, data warehousing, and high performance computing with the goal of extracting meaning from data and creating data products
  • 11. Page 11 Driving towards a culture of data driven decisions requires a foundation based on strong skills
  • 12. Confidential and proprietary. Copyright © 2012 Teradata Corporation.12 AUDIO & VIDEO WITSML TEXT DTS/DAS LOGS GIS MRO ERP TERADATA UNIFIED DATA ARCHITECTURE Reservoir Engineers Production Engineers Asset Managers OperatorsCustomers / PartnersGeologist HSEMaintenance CAPTURE | STORE | REFINE Big Data Management LANGUAGES MATH & STATS DATA MINING BUSINESS INTELLIGENCE APPLICATIONS DISCOVERY PLATFORM INTEGRATED DATA WAREHOUSE Big Data Analytics
  • 13. Confidential and proprietary. Copyright © 2012 Teradata Corporation.13 Business Analyst Discovery Platform Discovery Big Data Discovery Platform Requirements Data Sources Structured Data Multi- Structured Data Non relational Data OLTP DBMS’s Users Data Scientist SQL MapReduce Statistical Functions Discovery Tools • Structured and multi- structured data • Doesn’t require extensive data modeling • Doesn’t balance the books • Data completeness can be good enough • No stringent SLA’s Possible Analytics • Path to Machine Failure • Historical Well Behavior • Optimization Analytics • Machine data/process optimization
  • 14. Confidential and proprietary. Copyright © 2012 Teradata Corporation.14 Discovery Analytics Visualization Options • A visualization tool built upon Aster’s SQL-MapReduce framework. • Browser-based visualizations are produced using result sets from popular Aster operators such as nPath and cFilter. • Visual SQL-MR functions are perfect for visualizing path & pattern and graph for iterative analysis. • Connect to the Aster platform using ODBC. • Aster’s relational output is easily fed into and visualized by these partner tools (and other standard BI Tools). • Great for visualizing query results and for interactive reporting. Visual SQL-MapReduce® Functions Traditional BI Tools
  • 15. Confidential and proprietary. Copyright © 2012 Teradata Corporation.15 • Deliver valuable insight to lines of business resulting from deep analysis of all of your data, all of the time Uncover New Insights In Your Business Fraudulent Paths Golden Path to Application Submit Paths To Attrition
  • 16. Page 16 Opportunities in the mining world Production, Processing, Logistics, Distribution, Core/Drilling, Exploration, Social responsibility……. Asset Management Use case  Capture all sensor data & “black box” – structured and unstructured  Capture all relevant ERP and transaction data • ERP data (Work orders, Notifications, Preventative Maintenance schedules, Equipment costs over life-time, future planned costs, Fleet/operations performance data…...  Integrate production data plans • Real-time reporting events  Condition Based monitoring systems and alerts leveraging a Service Oriented Architecture With integrated data one has the ability to navigate and analyze “CONTEXT” in relation to the real-time event and make decisions. “CONTEXT” = comparison data (view by equipment, site, region, global); trends (History + planned), dependencies (production plans)
  • 17. Page 17 Logistics Use case  Big data • Capture all relevant ERP data (Inventory, purchasing, ..) • Capture RFID data • Capture Vendor data (Stock on hand, estimated duration to delivery…)  Real-time • Stock-out can create alerts leveraging a Service Oriented Architecture With integrated data one has the ability to navigate and analyze “CONTEXT” in relation to the real-time event and make decisions. “CONTEXT” = comparison data (view by site, region, global); trends (history + planned), dependencies (maintenance orders, purchase orders, shipments, reservations….) Opportunities in the mining world Production, Processing, Logistics, Distribution, Core/Drilling, Exploration, Social responsibility…….
  • 18. Page 18 Social responsibility Use case  Topsy, is a company based in San Francisco, that provides analyses from Twitter postings (tweets).  There is also Social Relationship Management Software-as-a- service (SaaS) technology that allows marketers to publish and engage fans on social networks and customize a brand's look, feel and message in an easy to use, scalable, and efficient method.  The screen captures to follow show some examples of what is possible in this space. Opportunities in the mining world Production, Processing, Logistics, Distribution, Core/Drilling, Exploration, Social responsibility…….
  • 19. Page 19 Opportunities in the mining world Production, Processing, Logistics, Distribution, Core/Drilling, Exploration, Social responsibility…….
  • 20. Page 20 Opportunities in the mining world Production, Processing, Logistics, Distribution, Core/Drilling, Exploration, Social responsibility…….
  • 21. Page 21 Summary  Big Data projects will fail if they are driven by technology. Companies that have successful implementation of Big Data start with a strong Business Case, and Big Data technology just happened to be used to solve the business questions. This has to be a Business driven initiative.  Find your Data Scientists – Business resource who has ability to understand data mining techniques and interpret predictive analyses  Enhance Technical Skills – IT resource who can develop with the Big Data technologies; Hadoop, Aster, MapReduce………..  Search for SaaS solutions where possible. Extremely expensive to pioneer new solutions and keep up with fast pace of change in this ever emerging area.  Align with your local Universities to sponsor support Master/PHD programs that will benefit both parties.