SlideShare uma empresa Scribd logo
1 de 11
BIG DATA & DATA SCIENCE
START-UP FOCUS POINTS
+ BUSINESS AND TECHNOLOGY
REFERENCE ARCHITECTURE
@TomZorde
I HAVE AN IDEA FOR A DATA SCIENCE START-UP
• Use these slides to focus conversation
• What stage are you at?
• What is the problem you’re trying to solve?
• What type of business model would work?
• Tools? – A rapidly evolving space.
• Reference Architecture helps identify what level of the stack
we’re talking about.
AREAS OF EARLY FOCUS
SEED STAGE - Research & Development
1. Research & Define Concept, business model, internal & sourced capabilities
2. Define customer value proposition and identify target market
ANGEL – Business Planning & Product Development
1. Identify services and products required and evaluate gaps for go-to-market readiness
2. Source funding partner to build minimum viable product and get commitment for round 2 funding
3. Assemble team and build MVP prototype exceeding expectations
ROUND 1/ SERIES A FUNDING – Commercially operational
ROUND 2 / SERIES B FUNDING – Fully Operational
ROUND 3 / SERIES C FUNDING – Expansion
IPO/ ACQUISITION
BUSINESS PLANNING & DEVELOPMENT - LOGICAL STEPS
1. Full business needs and information requirements
analysis. Business Drivers
• Revenue generation? Cost reduction? Customer
retention? Compliance?
• Process Improvement? Fraud detection?
Analytics? Dashboard?
• Solving a tough problem? Retiring/replacing
assets, technologies and systems?
2. Technology Evaluation and Selection
• Define requirements and objective first
• Evaluation a variety of technology stacks –
develop a framework first
3. Board Support for Start-up Resources
4. Prototyping, Discovery, and Planning
• Rent Infrastructure in Cloud – VMWare, AWS, MS
Azure and others
• Use Spare Hardware and Network Bandwidth
• Assessment, Proposal. Project/Program Plan for
next steps
• Start small and keep delivering
5. Architecture Design, Estimation, Business Case
6. Obtain funding and executive sponsorships,
owners, etc.
7. SDLC, don’t forget Hardware, Security, Testing,
Data governance etc.
FORESEEABLE CHALLENGES
Business urgency, time to market pressures
• Big Data /Data Science start up needs careful planning
• Big Data needs infrastructure, software stacks, people, start up plan
Lack of Big Data Resources, Lack of Sponsorships (except in some companies)
• Big Data is complex and multiple skill sets (mostly new to many companies) – Infrastructure, Administration,
Security, Programming, Testing, etc.
• Skepticism about Big Data
Integration with Existing Technologies and Systems
• Can not develop isolated big data solutions
• Integration with existing systems will be a top challenge (requires both sides to do additional work)
Open Sources: Stability, Maturity, and Security
INFORMATION AS A PRODUCT/SERVICE
TYPES OF RELEVANT BUSINESS MODELS
Differentiation
New Services
Customers Experience
Contextual Relevance
Brokering
Raw Data
Benchmarking
Analysis and Insight
(Meta Data)
Delivery
Market Place
Facilitator
Advertising
REFERENCE ARCHITECTURE
Decisions & Insight
Analytics & Discovery
Data Access and Distribution
Data Collection& Organisation
Infrastructure Platform
Monitoring,Alerts,Tools,
Security,Governance
• The technology stack is rapidly evolving with all traditional as well as new vendors providing offerings
• Open source tools remain at the foundation layers.
• Different use cases will require different technology tools.
REFERENCE ARCHITECTURE
Decisions & Insight
• IBM Watson
• Industry Specific
Analytics & Discovery
• SAP Business Objects
• IBM Cognos
• SAS Analytics
• Dell Statistica
• Oracle Hyperion
• Microsoft BI
• KNIME
• Pentaho
• Informatica
REFERENCE ARCHITECTURE
Data Access and Distribution
• Document: MongoDB, CouchDB
• Graph: Neo4j, Titan
• Key Value Pair: Riak, Redis
• Columnar: Cassandra, Hbase
• Search: Lucene, Solr, ElasticSearch
Monitoring, Alerts, Tools, Security, Governance:
• Hadoop:Apache, CloudEra, Hortonworks,
MapR, IBM
• SQL Mapping: Hive
• Big Data Transformation: Pig
• Hadoop Load: Sqoop
• Realtime-ETL: Storm
• Cluster Computing: Apache Spark
• Languages: Python, Java, R, Scala
REFERENCE ARCHITECTURE
Data Collection& Organisation (Batch & Real-Time)
• Hadoop
• Hadoop Map Reduce
• Mahout
Infrastructure Platform
• AWS
• Azure
• Mortar
• Google BigQuery
• Qubole
• Dell
• HP
• IBM
BIG DATA & DATA SCIENCE
START-UP FOCUS POINTS
@TomZorde
Thank you

Mais conteúdo relacionado

Mais procurados

FEASIBILITY ANALYSIS
FEASIBILITY ANALYSISFEASIBILITY ANALYSIS
FEASIBILITY ANALYSISSowmya M
 
Accountability – Managing the Risks of Innovation Procurement
Accountability – Managing the Risks of Innovation ProcurementAccountability – Managing the Risks of Innovation Procurement
Accountability – Managing the Risks of Innovation Procurementlisaabe
 
Carl Souchereau, SNC Lavalin T&D: Both Sides of the Fence
Carl Souchereau, SNC Lavalin T&D: Both Sides of the FenceCarl Souchereau, SNC Lavalin T&D: Both Sides of the Fence
Carl Souchereau, SNC Lavalin T&D: Both Sides of the FenceUMT
 

Mais procurados (7)

FEASIBILITY ANALYSIS
FEASIBILITY ANALYSISFEASIBILITY ANALYSIS
FEASIBILITY ANALYSIS
 
Accountability – Managing the Risks of Innovation Procurement
Accountability – Managing the Risks of Innovation ProcurementAccountability – Managing the Risks of Innovation Procurement
Accountability – Managing the Risks of Innovation Procurement
 
Research analyst
Research analystResearch analyst
Research analyst
 
SPI IQ for Retailers
SPI IQ for RetailersSPI IQ for Retailers
SPI IQ for Retailers
 
Carl Souchereau, SNC Lavalin T&D: Both Sides of the Fence
Carl Souchereau, SNC Lavalin T&D: Both Sides of the FenceCarl Souchereau, SNC Lavalin T&D: Both Sides of the Fence
Carl Souchereau, SNC Lavalin T&D: Both Sides of the Fence
 
Agile BI success factors
Agile BI success factorsAgile BI success factors
Agile BI success factors
 
Intranet planning and design
Intranet planning and design Intranet planning and design
Intranet planning and design
 

Destaque

Startup - Big Data - Data Science
Startup - Big Data - Data ScienceStartup - Big Data - Data Science
Startup - Big Data - Data ScienceTeguh Nugraha
 
Development Process for Micro Projects
Development Process for Micro ProjectsDevelopment Process for Micro Projects
Development Process for Micro ProjectsMartin Verrev
 
Data science unit introduction
Data science unit introductionData science unit introduction
Data science unit introductionGregg Barrett
 
KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY- SCOPE,THEORIES AND...
KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY-  SCOPE,THEORIES AND...KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY-  SCOPE,THEORIES AND...
KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY- SCOPE,THEORIES AND...Dr. Raju M. Mathew
 
Towards Neuro–Information Science
Towards Neuro–Information ScienceTowards Neuro–Information Science
Towards Neuro–Information Sciencejacekg
 
Big Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use casesBig Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use casesJeff Kelly
 
Sharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem DataSharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem DataTERN Australia
 
Semiotics and Information Science
Semiotics and Information ScienceSemiotics and Information Science
Semiotics and Information ScienceFlorence Paisey
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Perficient, Inc.
 
Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystemSlideCentral
 
Real time data services
Real time data servicesReal time data services
Real time data servicesRelevate
 
Real Time Big Data
Real Time Big DataReal Time Big Data
Real Time Big DataInfoFarm
 
Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystemmagda3695
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendCaserta
 
Big Data Ecosystem
Big Data EcosystemBig Data Ecosystem
Big Data EcosystemIvo Vachkov
 
Earley Executive Roundtable - Building a Digital Transformation Roadmap
Earley Executive Roundtable - Building a Digital Transformation RoadmapEarley Executive Roundtable - Building a Digital Transformation Roadmap
Earley Executive Roundtable - Building a Digital Transformation RoadmapEarley Information Science
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Caserta
 
J.M. Díaz Nafría: Science of Information: Emergence and evolution of meaning
J.M. Díaz Nafría: Science of Information: Emergence and evolution of meaningJ.M. Díaz Nafría: Science of Information: Emergence and evolution of meaning
J.M. Díaz Nafría: Science of Information: Emergence and evolution of meaningJosé Nafría
 
Conceptions of information science
Conceptions of information scienceConceptions of information science
Conceptions of information scienceJorge Prado
 
Data Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information ScienceData Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information ScienceJian Qin
 

Destaque (20)

Startup - Big Data - Data Science
Startup - Big Data - Data ScienceStartup - Big Data - Data Science
Startup - Big Data - Data Science
 
Development Process for Micro Projects
Development Process for Micro ProjectsDevelopment Process for Micro Projects
Development Process for Micro Projects
 
Data science unit introduction
Data science unit introductionData science unit introduction
Data science unit introduction
 
KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY- SCOPE,THEORIES AND...
KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY-  SCOPE,THEORIES AND...KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY-  SCOPE,THEORIES AND...
KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY- SCOPE,THEORIES AND...
 
Towards Neuro–Information Science
Towards Neuro–Information ScienceTowards Neuro–Information Science
Towards Neuro–Information Science
 
Big Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use casesBig Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use cases
 
Sharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem DataSharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem Data
 
Semiotics and Information Science
Semiotics and Information ScienceSemiotics and Information Science
Semiotics and Information Science
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
 
Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystem
 
Real time data services
Real time data servicesReal time data services
Real time data services
 
Real Time Big Data
Real Time Big DataReal Time Big Data
Real Time Big Data
 
Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystem
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
Big Data Ecosystem
Big Data EcosystemBig Data Ecosystem
Big Data Ecosystem
 
Earley Executive Roundtable - Building a Digital Transformation Roadmap
Earley Executive Roundtable - Building a Digital Transformation RoadmapEarley Executive Roundtable - Building a Digital Transformation Roadmap
Earley Executive Roundtable - Building a Digital Transformation Roadmap
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
 
J.M. Díaz Nafría: Science of Information: Emergence and evolution of meaning
J.M. Díaz Nafría: Science of Information: Emergence and evolution of meaningJ.M. Díaz Nafría: Science of Information: Emergence and evolution of meaning
J.M. Díaz Nafría: Science of Information: Emergence and evolution of meaning
 
Conceptions of information science
Conceptions of information scienceConceptions of information science
Conceptions of information science
 
Data Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information ScienceData Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information Science
 

Semelhante a Data Science Start-up Focus Points

Five Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data StrategyFive Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data StrategyPerficient, Inc.
 
IC-SDV 2018: Srin Achanta (SciTech Patent Art) Global Competitive Technology ...
IC-SDV 2018: Srin Achanta (SciTech Patent Art) Global Competitive Technology ...IC-SDV 2018: Srin Achanta (SciTech Patent Art) Global Competitive Technology ...
IC-SDV 2018: Srin Achanta (SciTech Patent Art) Global Competitive Technology ...Dr. Haxel Consult
 
Best practice for_agile_ds_projects
Best practice for_agile_ds_projectsBest practice for_agile_ds_projects
Best practice for_agile_ds_projectsKhalid Kahloot
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Domino Data Lab
 
Leverage Data Strategy as a Catalyst for Innovation
Leverage Data Strategy as a Catalyst for InnovationLeverage Data Strategy as a Catalyst for Innovation
Leverage Data Strategy as a Catalyst for InnovationGlorium Tech
 
Nem360 2017 setting technology trends into the strategic context v200
Nem360 2017 setting technology trends into the strategic context v200Nem360 2017 setting technology trends into the strategic context v200
Nem360 2017 setting technology trends into the strategic context v200Markku Rehberger
 
Technology Consulting by Prasanna
Technology Consulting by PrasannaTechnology Consulting by Prasanna
Technology Consulting by PrasannaSupportGCI
 
Warehouse components
Warehouse componentsWarehouse components
Warehouse componentsganblues
 
CareerInsightsSteveLacki.General.2016
CareerInsightsSteveLacki.General.2016CareerInsightsSteveLacki.General.2016
CareerInsightsSteveLacki.General.2016SteveLacki
 
Building a Complete View Across the Customer Experience on Oracle BICS
Building a Complete View Across the Customer Experience on Oracle BICSBuilding a Complete View Across the Customer Experience on Oracle BICS
Building a Complete View Across the Customer Experience on Oracle BICSShiv Bharti
 
Optimizing Your Outsourcing Portfolio – Deciding What to Source: Core vs. Con...
Optimizing Your Outsourcing Portfolio – Deciding What to Source: Core vs. Con...Optimizing Your Outsourcing Portfolio – Deciding What to Source: Core vs. Con...
Optimizing Your Outsourcing Portfolio – Deciding What to Source: Core vs. Con...Neo Group Inc
 
Innovative Data Leveraging for Procurement Analytics
Innovative Data Leveraging for Procurement AnalyticsInnovative Data Leveraging for Procurement Analytics
Innovative Data Leveraging for Procurement AnalyticsTejari
 
Eureka Data Science Analytic Process
Eureka Data Science Analytic ProcessEureka Data Science Analytic Process
Eureka Data Science Analytic ProcessAllen Nugent
 
Strategy Basecamp's IT Diagnostic - Six Steps to Improving Your Technology
Strategy Basecamp's IT Diagnostic - Six Steps to Improving Your TechnologyStrategy Basecamp's IT Diagnostic - Six Steps to Improving Your Technology
Strategy Basecamp's IT Diagnostic - Six Steps to Improving Your TechnologyPaul Osterberg
 
Career Conversation Technology Consulting
Career Conversation Technology ConsultingCareer Conversation Technology Consulting
Career Conversation Technology ConsultingSupportGCI
 
Are you getting the most out of your data?
Are you getting the most out of your data?Are you getting the most out of your data?
Are you getting the most out of your data?SAS Canada
 
Value of data in digital transformation
Value of data in digital transformationValue of data in digital transformation
Value of data in digital transformationLoihde Advisory
 

Semelhante a Data Science Start-up Focus Points (20)

Big data@work
Big data@workBig data@work
Big data@work
 
Five Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data StrategyFive Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data Strategy
 
IC-SDV 2018: Srin Achanta (SciTech Patent Art) Global Competitive Technology ...
IC-SDV 2018: Srin Achanta (SciTech Patent Art) Global Competitive Technology ...IC-SDV 2018: Srin Achanta (SciTech Patent Art) Global Competitive Technology ...
IC-SDV 2018: Srin Achanta (SciTech Patent Art) Global Competitive Technology ...
 
Best practice for_agile_ds_projects
Best practice for_agile_ds_projectsBest practice for_agile_ds_projects
Best practice for_agile_ds_projects
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field
 
Leverage Data Strategy as a Catalyst for Innovation
Leverage Data Strategy as a Catalyst for InnovationLeverage Data Strategy as a Catalyst for Innovation
Leverage Data Strategy as a Catalyst for Innovation
 
Engineering Global Content Planning - Pam Didner
Engineering Global Content Planning - Pam DidnerEngineering Global Content Planning - Pam Didner
Engineering Global Content Planning - Pam Didner
 
Nem360 2017 setting technology trends into the strategic context v200
Nem360 2017 setting technology trends into the strategic context v200Nem360 2017 setting technology trends into the strategic context v200
Nem360 2017 setting technology trends into the strategic context v200
 
Technology Consulting by Prasanna
Technology Consulting by PrasannaTechnology Consulting by Prasanna
Technology Consulting by Prasanna
 
Warehouse components
Warehouse componentsWarehouse components
Warehouse components
 
CareerInsightsSteveLacki.General.2016
CareerInsightsSteveLacki.General.2016CareerInsightsSteveLacki.General.2016
CareerInsightsSteveLacki.General.2016
 
Building a Complete View Across the Customer Experience on Oracle BICS
Building a Complete View Across the Customer Experience on Oracle BICSBuilding a Complete View Across the Customer Experience on Oracle BICS
Building a Complete View Across the Customer Experience on Oracle BICS
 
Optimizing Your Outsourcing Portfolio – Deciding What to Source: Core vs. Con...
Optimizing Your Outsourcing Portfolio – Deciding What to Source: Core vs. Con...Optimizing Your Outsourcing Portfolio – Deciding What to Source: Core vs. Con...
Optimizing Your Outsourcing Portfolio – Deciding What to Source: Core vs. Con...
 
Innovative Data Leveraging for Procurement Analytics
Innovative Data Leveraging for Procurement AnalyticsInnovative Data Leveraging for Procurement Analytics
Innovative Data Leveraging for Procurement Analytics
 
Eureka Data Science Analytic Process
Eureka Data Science Analytic ProcessEureka Data Science Analytic Process
Eureka Data Science Analytic Process
 
Transformação Digital de TI com EA
Transformação Digital de TI com EATransformação Digital de TI com EA
Transformação Digital de TI com EA
 
Strategy Basecamp's IT Diagnostic - Six Steps to Improving Your Technology
Strategy Basecamp's IT Diagnostic - Six Steps to Improving Your TechnologyStrategy Basecamp's IT Diagnostic - Six Steps to Improving Your Technology
Strategy Basecamp's IT Diagnostic - Six Steps to Improving Your Technology
 
Career Conversation Technology Consulting
Career Conversation Technology ConsultingCareer Conversation Technology Consulting
Career Conversation Technology Consulting
 
Are you getting the most out of your data?
Are you getting the most out of your data?Are you getting the most out of your data?
Are you getting the most out of your data?
 
Value of data in digital transformation
Value of data in digital transformationValue of data in digital transformation
Value of data in digital transformation
 

Mais de Tom Zorde

Internet of Everything Perth Community Update April 2018
Internet of Everything Perth Community Update April 2018Internet of Everything Perth Community Update April 2018
Internet of Everything Perth Community Update April 2018Tom Zorde
 
IoE Perth - Global update - Dec 5th 2016
IoE Perth - Global update - Dec 5th 2016IoE Perth - Global update - Dec 5th 2016
IoE Perth - Global update - Dec 5th 2016Tom Zorde
 
Building a Roadmap for Digital Transformation
Building a Roadmap for Digital TransformationBuilding a Roadmap for Digital Transformation
Building a Roadmap for Digital TransformationTom Zorde
 
Internet of Things Technology Points for discussion
Internet of Things Technology Points for discussionInternet of Things Technology Points for discussion
Internet of Things Technology Points for discussionTom Zorde
 
Interent of Things (IoT) & Data Science Contextual Reference Models
Interent of Things (IoT) & Data Science Contextual Reference ModelsInterent of Things (IoT) & Data Science Contextual Reference Models
Interent of Things (IoT) & Data Science Contextual Reference ModelsTom Zorde
 
Why Business Architecture for Internet of Things
Why Business Architecture for Internet of ThingsWhy Business Architecture for Internet of Things
Why Business Architecture for Internet of ThingsTom Zorde
 

Mais de Tom Zorde (6)

Internet of Everything Perth Community Update April 2018
Internet of Everything Perth Community Update April 2018Internet of Everything Perth Community Update April 2018
Internet of Everything Perth Community Update April 2018
 
IoE Perth - Global update - Dec 5th 2016
IoE Perth - Global update - Dec 5th 2016IoE Perth - Global update - Dec 5th 2016
IoE Perth - Global update - Dec 5th 2016
 
Building a Roadmap for Digital Transformation
Building a Roadmap for Digital TransformationBuilding a Roadmap for Digital Transformation
Building a Roadmap for Digital Transformation
 
Internet of Things Technology Points for discussion
Internet of Things Technology Points for discussionInternet of Things Technology Points for discussion
Internet of Things Technology Points for discussion
 
Interent of Things (IoT) & Data Science Contextual Reference Models
Interent of Things (IoT) & Data Science Contextual Reference ModelsInterent of Things (IoT) & Data Science Contextual Reference Models
Interent of Things (IoT) & Data Science Contextual Reference Models
 
Why Business Architecture for Internet of Things
Why Business Architecture for Internet of ThingsWhy Business Architecture for Internet of Things
Why Business Architecture for Internet of Things
 

Último

Case study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailCase study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailAriel592675
 
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607dollysharma2066
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckHajeJanKamps
 
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCRashishs7044
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCRashishs7044
 
Digital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfDigital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfJos Voskuil
 
Traction part 2 - EOS Model JAX Bridges.
Traction part 2 - EOS Model JAX Bridges.Traction part 2 - EOS Model JAX Bridges.
Traction part 2 - EOS Model JAX Bridges.Anamaria Contreras
 
Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...
Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...
Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...ShrutiBose4
 
Organizational Structure Running A Successful Business
Organizational Structure Running A Successful BusinessOrganizational Structure Running A Successful Business
Organizational Structure Running A Successful BusinessSeta Wicaksana
 
Marketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent ChirchirMarketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent Chirchirictsugar
 
8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCRashishs7044
 
IoT Insurance Observatory: summary 2024
IoT Insurance Observatory:  summary 2024IoT Insurance Observatory:  summary 2024
IoT Insurance Observatory: summary 2024Matteo Carbone
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Seta Wicaksana
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Kirill Klimov
 
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...ictsugar
 
Innovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfInnovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfrichard876048
 
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort ServiceCall US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Servicecallgirls2057
 
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdfNewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdfKhaled Al Awadi
 
Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03DallasHaselhorst
 

Último (20)

Case study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailCase study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detail
 
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
 
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR
 
Digital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfDigital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdf
 
Traction part 2 - EOS Model JAX Bridges.
Traction part 2 - EOS Model JAX Bridges.Traction part 2 - EOS Model JAX Bridges.
Traction part 2 - EOS Model JAX Bridges.
 
Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...
Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...
Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...
 
Organizational Structure Running A Successful Business
Organizational Structure Running A Successful BusinessOrganizational Structure Running A Successful Business
Organizational Structure Running A Successful Business
 
Marketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent ChirchirMarketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent Chirchir
 
8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR
 
IoT Insurance Observatory: summary 2024
IoT Insurance Observatory:  summary 2024IoT Insurance Observatory:  summary 2024
IoT Insurance Observatory: summary 2024
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024
 
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...
 
Innovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfInnovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdf
 
Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)
 
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort ServiceCall US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
 
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdfNewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
 
Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03
 

Data Science Start-up Focus Points

  • 1. BIG DATA & DATA SCIENCE START-UP FOCUS POINTS + BUSINESS AND TECHNOLOGY REFERENCE ARCHITECTURE @TomZorde
  • 2. I HAVE AN IDEA FOR A DATA SCIENCE START-UP • Use these slides to focus conversation • What stage are you at? • What is the problem you’re trying to solve? • What type of business model would work? • Tools? – A rapidly evolving space. • Reference Architecture helps identify what level of the stack we’re talking about.
  • 3. AREAS OF EARLY FOCUS SEED STAGE - Research & Development 1. Research & Define Concept, business model, internal & sourced capabilities 2. Define customer value proposition and identify target market ANGEL – Business Planning & Product Development 1. Identify services and products required and evaluate gaps for go-to-market readiness 2. Source funding partner to build minimum viable product and get commitment for round 2 funding 3. Assemble team and build MVP prototype exceeding expectations ROUND 1/ SERIES A FUNDING – Commercially operational ROUND 2 / SERIES B FUNDING – Fully Operational ROUND 3 / SERIES C FUNDING – Expansion IPO/ ACQUISITION
  • 4. BUSINESS PLANNING & DEVELOPMENT - LOGICAL STEPS 1. Full business needs and information requirements analysis. Business Drivers • Revenue generation? Cost reduction? Customer retention? Compliance? • Process Improvement? Fraud detection? Analytics? Dashboard? • Solving a tough problem? Retiring/replacing assets, technologies and systems? 2. Technology Evaluation and Selection • Define requirements and objective first • Evaluation a variety of technology stacks – develop a framework first 3. Board Support for Start-up Resources 4. Prototyping, Discovery, and Planning • Rent Infrastructure in Cloud – VMWare, AWS, MS Azure and others • Use Spare Hardware and Network Bandwidth • Assessment, Proposal. Project/Program Plan for next steps • Start small and keep delivering 5. Architecture Design, Estimation, Business Case 6. Obtain funding and executive sponsorships, owners, etc. 7. SDLC, don’t forget Hardware, Security, Testing, Data governance etc.
  • 5. FORESEEABLE CHALLENGES Business urgency, time to market pressures • Big Data /Data Science start up needs careful planning • Big Data needs infrastructure, software stacks, people, start up plan Lack of Big Data Resources, Lack of Sponsorships (except in some companies) • Big Data is complex and multiple skill sets (mostly new to many companies) – Infrastructure, Administration, Security, Programming, Testing, etc. • Skepticism about Big Data Integration with Existing Technologies and Systems • Can not develop isolated big data solutions • Integration with existing systems will be a top challenge (requires both sides to do additional work) Open Sources: Stability, Maturity, and Security
  • 6. INFORMATION AS A PRODUCT/SERVICE TYPES OF RELEVANT BUSINESS MODELS Differentiation New Services Customers Experience Contextual Relevance Brokering Raw Data Benchmarking Analysis and Insight (Meta Data) Delivery Market Place Facilitator Advertising
  • 7. REFERENCE ARCHITECTURE Decisions & Insight Analytics & Discovery Data Access and Distribution Data Collection& Organisation Infrastructure Platform Monitoring,Alerts,Tools, Security,Governance • The technology stack is rapidly evolving with all traditional as well as new vendors providing offerings • Open source tools remain at the foundation layers. • Different use cases will require different technology tools.
  • 8. REFERENCE ARCHITECTURE Decisions & Insight • IBM Watson • Industry Specific Analytics & Discovery • SAP Business Objects • IBM Cognos • SAS Analytics • Dell Statistica • Oracle Hyperion • Microsoft BI • KNIME • Pentaho • Informatica
  • 9. REFERENCE ARCHITECTURE Data Access and Distribution • Document: MongoDB, CouchDB • Graph: Neo4j, Titan • Key Value Pair: Riak, Redis • Columnar: Cassandra, Hbase • Search: Lucene, Solr, ElasticSearch Monitoring, Alerts, Tools, Security, Governance: • Hadoop:Apache, CloudEra, Hortonworks, MapR, IBM • SQL Mapping: Hive • Big Data Transformation: Pig • Hadoop Load: Sqoop • Realtime-ETL: Storm • Cluster Computing: Apache Spark • Languages: Python, Java, R, Scala
  • 10. REFERENCE ARCHITECTURE Data Collection& Organisation (Batch & Real-Time) • Hadoop • Hadoop Map Reduce • Mahout Infrastructure Platform • AWS • Azure • Mortar • Google BigQuery • Qubole • Dell • HP • IBM
  • 11. BIG DATA & DATA SCIENCE START-UP FOCUS POINTS @TomZorde Thank you