SlideShare uma empresa Scribd logo
1 de 27
1
But how do I GET the
data?
Transparency Camp 2014
Shooju is a Web-Based Data Platform
2
• Consolidate your internal and external data sources
• Make all data searchable from one place
• Provide continuous updating
• Seamlessly integrate with tools and applications
• Share data across your entire organization
• Save time and energy while reducing errors and
problems with version control
Shooju saves time, improves data quality and enhances
data sharing across your entire organization
The Analytical Process
3
Data
Data Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
The Analytical Process
4
Data
Data Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
some place
The Analytical Process
5
Data
Data Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
some place
your tool of choice
The Analytical Process
6
Data
Data Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
some place
your tool of choice
your product
The Analytical Process
7
Data
Data Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
some place
your tool of choice
your product
The Fun Part 
The Analytical Process
8
Data
Data Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
some place
your tool of choice
your product
The Not Fun Part 
Big data vs. small data
9
A boring 2 x 2
10
The harsh 80/20 reality
11
Most organizations spend more time collecting,
cleaning, downloading, managing and
wrangling data than they do conducting analysis
Three ways to get data
• API
– Good
– Bad
• Scraping
• Manual
12
Defined as ETL (Extract,
Transform, Load) process
Method comparison
13
TechnicalExpertiserequired
Time (and annoyance)
Manual
Scraping
API
14
Average cost curve of data collection
Manual Collection
AverageCost
Number of times data is collected
15
Average cost curve of data collection
Manual Collection
AverageCost
Number of times data is collected
Scraping
16
Average cost curve of data collection
Manual Collection
AverageCost
Number of times data is collected
Scraping
API
How do you get your data?
What do you like?
What don’t you like?
17
Once the data is scraped, where can it go?
• CSV
• XLS
• DBF
• SQL
• NoSQL
• Many others
18
Where does your data go when you collect it?
19
1 Appendix
Shooju Value Added
Cost Savings
By saving analyst time and energy, Shooju allows analysts to do more with less,
reducing data management costs and putting more focus on high-value analysis.
Added Quality
Automating data processes internally will ensure that your data is accurate, up-to-date
and consistent across your entire organization.
Enhanced Decision Making
Having more accurate data available faster with more analyst time left for analysis
leads to enhanced decision making.
21
Cost
Savings
Added
Quality
Enhanced
Decision
Making
Shooju
Value
Added
22
Shooju
Sources
Excel
Add-In
& Other Tools
Custom
BI Apps
Web
Search
Auto-
Import
Drivers
# of analysts retrieving
time saved in retrieval
# of sources
frequency of retrieval
# of analysts refreshing
time saved in tool refresh
# of sources
frequency of refresh
time to integrate data
analysts contributing data
# of tools created
analyst upload time
# of analysts searching
time saved in search
# of sources
frequency of search
5 analysts
65 min / source
22 sources
18 times / year
11 analysts
74 min / source
22 sources
14 times / year
9 min / source
22 sources
32 times / year
$97k
(14%)
$73k
(10%)
$248k
(35%)
$702kTotal:
Cost Savings
13 analysts
14 wk of dev. saved
8 analysts contributing
2 apps created
$284k
(41%)
40 min 10 times / year
Sample Cost Savings
Cost Savings Added Quality Enhanced Decision MakingShooju Value Added
* Based on real 40-person
organization. Assumed
annual wages vary
between $30k and $140k.
$410k
savings
equivalent to
10% of HR
spend*
Shooju speeds up
custom BI application
development by making
all data natively
accessible and
continuously updated in
any BI tool or custom
app.
USD (%)
Added Quality: The Three “Cs”
23
Cost Savings Added QualityShooju Value Added
Consistency
Shooju ensures that all analysts are using the same data
across all their tools and applications. By allowing
analysts to upload their own data to the platform, internal
data as well as external data now flows seamlessly -
without messy spreadsheet links.
Currency
By automatically pulling in the latest source data through the
Shooju importer layer, Shooju ensures that all of your
spreadsheets and models are populated with the latest data.
Our native plugins for Excel, Access and all your other tools
allow data to flow through directly without any need for the
analyst to download or copy and paste.
Correctness
The more data is touched by human hands, the more prone it is to errors. By streamlining
workflows and automating work processes, Shooju eliminates most of these errors, saving
time and ensuring that the data you rely on is more accurate.
Enhanced Decision Making
We support any data source
24
Ask us about non-mainstream data
sources that traditional data providers
don’t carry.
Shooju Data Process
25
Shooju vs. Custom Data Warehouse
Custom Data
Warehouse Shooju
Design Custom “Plug-and-play”
Cost 7+ digits 5-6 digits
Rollout timeline Months / Years Hours
Scalability Minimal Infinite
Flexibility Low High
Maintenance High Low
Stakeholders IT controlled Analyst run / IT maintained
Tool and app support Clunky, requiring IT Native tool support
26
Data warehouse projects are costly, time consuming and
result in inflexible systems with low adoption rates
Shooju vs. Off-the-shelf Data Management*
Off-the-shelf
Data Management* Shooju
Service focus Data provision/management Process improvement
Prepackaged data feeds Many None
Custom data feeds None (not natively supported) Included(all feeds are custom)
Internal data integration Weeks (high consulting fees) Days (included in service)
Process flexibility Low High
Analyst learning curve Weeks Hours
Ease of migrating off Very difficult/impossible Easy
Annual fee 6-7 digits 5-6 digits
27
Data management* solutions focus on generic data
provision rather than process improvement and limit
analysts to a closed and inflexible data ecosystem.
* Top-ranked providers in the EnergyRisk Data Management category include: Morningstar, ZE Power Group, SunGard, Allegro, Pioneer
Solutions, SAS, and InteractiveData. See http://www.slideshare.net/Allegrodev/energy-risk-magazines-etrm-software-rankings-2013

Mais conteúdo relacionado

Mais procurados

Big Data Business Wins: Real-time Inventory Tracking with Hadoop
Big Data Business Wins: Real-time Inventory Tracking with HadoopBig Data Business Wins: Real-time Inventory Tracking with Hadoop
Big Data Business Wins: Real-time Inventory Tracking with Hadoop
DataWorks Summit
 
2015-11-13 Data for Administrative Professionals
2015-11-13  Data for Administrative Professionals2015-11-13  Data for Administrative Professionals
2015-11-13 Data for Administrative Professionals
Tara E. Browne, DTM
 

Mais procurados (19)

Empowering Customers with Personalized Insights
Empowering Customers with Personalized InsightsEmpowering Customers with Personalized Insights
Empowering Customers with Personalized Insights
 
The Benefits of Predictive and Proactive Support for an Enterprise Data Hub
The Benefits of Predictive and Proactive Support for an Enterprise Data HubThe Benefits of Predictive and Proactive Support for an Enterprise Data Hub
The Benefits of Predictive and Proactive Support for an Enterprise Data Hub
 
Data Migrations powered by GalenETL
Data Migrations powered by GalenETLData Migrations powered by GalenETL
Data Migrations powered by GalenETL
 
Quelles nouveautés avec la version 6.5 de Splunk Enterprise
Quelles nouveautés avec la version 6.5 de Splunk EnterpriseQuelles nouveautés avec la version 6.5 de Splunk Enterprise
Quelles nouveautés avec la version 6.5 de Splunk Enterprise
 
Big Data Business Wins: Real-time Inventory Tracking with Hadoop
Big Data Business Wins: Real-time Inventory Tracking with HadoopBig Data Business Wins: Real-time Inventory Tracking with Hadoop
Big Data Business Wins: Real-time Inventory Tracking with Hadoop
 
How to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsHow to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT Operations
 
ODSC data science to DataOps
ODSC data science to DataOpsODSC data science to DataOps
ODSC data science to DataOps
 
Tableau - Make your SEO data work for you!
Tableau - Make your SEO data work for you!Tableau - Make your SEO data work for you!
Tableau - Make your SEO data work for you!
 
Should You Invest In DataOps Services?
Should You Invest In DataOps Services?Should You Invest In DataOps Services?
Should You Invest In DataOps Services?
 
Florida's Natural Growers BI case study
Florida's Natural Growers BI case studyFlorida's Natural Growers BI case study
Florida's Natural Growers BI case study
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects Fail
 
Data quality management Basic
Data quality management BasicData quality management Basic
Data quality management Basic
 
Architectural Health Check for Postgres
Architectural Health Check for PostgresArchitectural Health Check for Postgres
Architectural Health Check for Postgres
 
2015-11-13 Data for Administrative Professionals
2015-11-13  Data for Administrative Professionals2015-11-13  Data for Administrative Professionals
2015-11-13 Data for Administrative Professionals
 
925 plenary rexer_using our laptop
925 plenary rexer_using our laptop925 plenary rexer_using our laptop
925 plenary rexer_using our laptop
 
Taking Splunk to the Next Level - Management Breakout Session
Taking Splunk to the Next Level - Management Breakout SessionTaking Splunk to the Next Level - Management Breakout Session
Taking Splunk to the Next Level - Management Breakout Session
 
Why Data Science Projects Fail?
Why Data Science Projects Fail?Why Data Science Projects Fail?
Why Data Science Projects Fail?
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes Biobanking
 
Everything you wanted to know about data ops
Everything you wanted to know about data opsEverything you wanted to know about data ops
Everything you wanted to know about data ops
 

Destaque

A microservice approach for legacy modernisation
A microservice approach for legacy modernisationA microservice approach for legacy modernisation
A microservice approach for legacy modernisation
luisw19
 

Destaque (20)

OBIEE 11.1.1.7: Upgrade y Nuevas Características
OBIEE 11.1.1.7: Upgrade y Nuevas CaracterísticasOBIEE 11.1.1.7: Upgrade y Nuevas Características
OBIEE 11.1.1.7: Upgrade y Nuevas Características
 
Data modeling
Data modelingData modeling
Data modeling
 
Incredible ODI tips to work with Hyperion tools that you ever wanted to know
Incredible ODI tips to work with Hyperion tools that you ever wanted to knowIncredible ODI tips to work with Hyperion tools that you ever wanted to know
Incredible ODI tips to work with Hyperion tools that you ever wanted to know
 
Oracle BI Hybrid BI : Mode 1 + Mode 2, Cloud + On-Premise Business Analytics
Oracle BI Hybrid BI : Mode 1 + Mode 2, Cloud + On-Premise Business AnalyticsOracle BI Hybrid BI : Mode 1 + Mode 2, Cloud + On-Premise Business Analytics
Oracle BI Hybrid BI : Mode 1 + Mode 2, Cloud + On-Premise Business Analytics
 
Tableau Best Practices for OBIEE
Tableau Best Practices for OBIEETableau Best Practices for OBIEE
Tableau Best Practices for OBIEE
 
How to solve complex business requirements with Oracle Data Integrator?
How to solve complex business requirements with Oracle Data Integrator?How to solve complex business requirements with Oracle Data Integrator?
How to solve complex business requirements with Oracle Data Integrator?
 
Logical DB Design (OOP)
Logical DB Design (OOP)Logical DB Design (OOP)
Logical DB Design (OOP)
 
A microservice approach for legacy modernisation
A microservice approach for legacy modernisationA microservice approach for legacy modernisation
A microservice approach for legacy modernisation
 
Empowering Business Users: OBIEE 12c Visual Analyzer and Data Mashup
Empowering Business Users: OBIEE 12c Visual Analyzer and Data MashupEmpowering Business Users: OBIEE 12c Visual Analyzer and Data Mashup
Empowering Business Users: OBIEE 12c Visual Analyzer and Data Mashup
 
OUG Ireland Meet-up - Updates from Oracle Open World 2016
OUG Ireland Meet-up - Updates from Oracle Open World 2016OUG Ireland Meet-up - Updates from Oracle Open World 2016
OUG Ireland Meet-up - Updates from Oracle Open World 2016
 
Ranges, ranges everywhere (Oracle SQL)
Ranges, ranges everywhere (Oracle SQL)Ranges, ranges everywhere (Oracle SQL)
Ranges, ranges everywhere (Oracle SQL)
 
OUG Ireland Meet-up 12th January
OUG Ireland Meet-up 12th JanuaryOUG Ireland Meet-up 12th January
OUG Ireland Meet-up 12th January
 
Domain model
Domain modelDomain model
Domain model
 
How to read a data model
How to read a data modelHow to read a data model
How to read a data model
 
Row Pattern Matching in Oracle Database 12c
Row Pattern Matching in Oracle Database 12cRow Pattern Matching in Oracle Database 12c
Row Pattern Matching in Oracle Database 12c
 
Data modelling 101
Data modelling 101Data modelling 101
Data modelling 101
 
Data models
Data modelsData models
Data models
 
Dbms models
Dbms modelsDbms models
Dbms models
 
Data Modeling PPT
Data Modeling PPTData Modeling PPT
Data Modeling PPT
 
SQL: The one language to rule all your data
SQL: The one language to rule all your dataSQL: The one language to rule all your data
SQL: The one language to rule all your data
 

Semelhante a But how do I GET the data? Transparency Camp 2014

data_blending
data_blendingdata_blending
data_blending
subit1615
 
TB8568_8568_Presentation
TB8568_8568_PresentationTB8568_8568_Presentation
TB8568_8568_Presentation
Ronnie Falgout
 
Advanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project DeliveryAdvanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project Delivery
Mark Constable
 

Semelhante a But how do I GET the data? Transparency Camp 2014 (20)

data_blending
data_blendingdata_blending
data_blending
 
DataOps , cbuswaw April '23
DataOps , cbuswaw April '23DataOps , cbuswaw April '23
DataOps , cbuswaw April '23
 
How Can You Implement DataOps In Your Existing Workflow?
How Can You Implement DataOps In Your Existing Workflow?How Can You Implement DataOps In Your Existing Workflow?
How Can You Implement DataOps In Your Existing Workflow?
 
Breed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptxBreed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptx
 
Starting Your Modern DataOps Journey
Starting Your Modern DataOps JourneyStarting Your Modern DataOps Journey
Starting Your Modern DataOps Journey
 
Big Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation SlidesBig Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation Slides
 
Latest trends in Business Analytics
Latest trends in Business AnalyticsLatest trends in Business Analytics
Latest trends in Business Analytics
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
Data summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsData summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data ops
 
Modernize your Infrastructure and Mobilize Your Data
Modernize your Infrastructure and Mobilize Your DataModernize your Infrastructure and Mobilize Your Data
Modernize your Infrastructure and Mobilize Your Data
 
Toad Business Intelligence Suite
Toad Business Intelligence Suite Toad Business Intelligence Suite
Toad Business Intelligence Suite
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
MTX Portland Office 365 Strategic Capabilities Sep2017
MTX Portland Office 365 Strategic Capabilities Sep2017MTX Portland Office 365 Strategic Capabilities Sep2017
MTX Portland Office 365 Strategic Capabilities Sep2017
 
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
 
TB8568_8568_Presentation
TB8568_8568_PresentationTB8568_8568_Presentation
TB8568_8568_Presentation
 
Advanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project DeliveryAdvanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project Delivery
 
Big Data Analytics Architecture Powerpoint Presentation Slides
Big Data Analytics Architecture Powerpoint Presentation SlidesBig Data Analytics Architecture Powerpoint Presentation Slides
Big Data Analytics Architecture Powerpoint Presentation Slides
 
Data-Driven DevOps: Mining Machine Data for 'Metrics that Matter' in a DevOps...
Data-Driven DevOps: Mining Machine Data for 'Metrics that Matter' in a DevOps...Data-Driven DevOps: Mining Machine Data for 'Metrics that Matter' in a DevOps...
Data-Driven DevOps: Mining Machine Data for 'Metrics that Matter' in a DevOps...
 
Intro of Key Features of Soft CAAT Ent Software
Intro of Key Features of Soft CAAT Ent SoftwareIntro of Key Features of Soft CAAT Ent Software
Intro of Key Features of Soft CAAT Ent Software
 
What's New in Pentaho 7.0?
What's New in Pentaho 7.0?What's New in Pentaho 7.0?
What's New in Pentaho 7.0?
 

Último

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Último (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

But how do I GET the data? Transparency Camp 2014

  • 1. 1 But how do I GET the data? Transparency Camp 2014
  • 2. Shooju is a Web-Based Data Platform 2 • Consolidate your internal and external data sources • Make all data searchable from one place • Provide continuous updating • Seamlessly integrate with tools and applications • Share data across your entire organization • Save time and energy while reducing errors and problems with version control Shooju saves time, improves data quality and enhances data sharing across your entire organization
  • 3. The Analytical Process 3 Data Data Data Data Data Data Data Data Data Data Data Data
  • 4. The Analytical Process 4 Data Data Data Data Data Data Data Data Data Data Data Data some place
  • 5. The Analytical Process 5 Data Data Data Data Data Data Data Data Data Data Data Data some place your tool of choice
  • 6. The Analytical Process 6 Data Data Data Data Data Data Data Data Data Data Data Data some place your tool of choice your product
  • 7. The Analytical Process 7 Data Data Data Data Data Data Data Data Data Data Data Data some place your tool of choice your product The Fun Part 
  • 8. The Analytical Process 8 Data Data Data Data Data Data Data Data Data Data Data Data some place your tool of choice your product The Not Fun Part 
  • 9. Big data vs. small data 9
  • 10. A boring 2 x 2 10
  • 11. The harsh 80/20 reality 11 Most organizations spend more time collecting, cleaning, downloading, managing and wrangling data than they do conducting analysis
  • 12. Three ways to get data • API – Good – Bad • Scraping • Manual 12 Defined as ETL (Extract, Transform, Load) process
  • 14. 14 Average cost curve of data collection Manual Collection AverageCost Number of times data is collected
  • 15. 15 Average cost curve of data collection Manual Collection AverageCost Number of times data is collected Scraping
  • 16. 16 Average cost curve of data collection Manual Collection AverageCost Number of times data is collected Scraping API
  • 17. How do you get your data? What do you like? What don’t you like? 17
  • 18. Once the data is scraped, where can it go? • CSV • XLS • DBF • SQL • NoSQL • Many others 18
  • 19. Where does your data go when you collect it? 19
  • 21. Shooju Value Added Cost Savings By saving analyst time and energy, Shooju allows analysts to do more with less, reducing data management costs and putting more focus on high-value analysis. Added Quality Automating data processes internally will ensure that your data is accurate, up-to-date and consistent across your entire organization. Enhanced Decision Making Having more accurate data available faster with more analyst time left for analysis leads to enhanced decision making. 21 Cost Savings Added Quality Enhanced Decision Making Shooju Value Added
  • 22. 22 Shooju Sources Excel Add-In & Other Tools Custom BI Apps Web Search Auto- Import Drivers # of analysts retrieving time saved in retrieval # of sources frequency of retrieval # of analysts refreshing time saved in tool refresh # of sources frequency of refresh time to integrate data analysts contributing data # of tools created analyst upload time # of analysts searching time saved in search # of sources frequency of search 5 analysts 65 min / source 22 sources 18 times / year 11 analysts 74 min / source 22 sources 14 times / year 9 min / source 22 sources 32 times / year $97k (14%) $73k (10%) $248k (35%) $702kTotal: Cost Savings 13 analysts 14 wk of dev. saved 8 analysts contributing 2 apps created $284k (41%) 40 min 10 times / year Sample Cost Savings Cost Savings Added Quality Enhanced Decision MakingShooju Value Added * Based on real 40-person organization. Assumed annual wages vary between $30k and $140k. $410k savings equivalent to 10% of HR spend* Shooju speeds up custom BI application development by making all data natively accessible and continuously updated in any BI tool or custom app. USD (%)
  • 23. Added Quality: The Three “Cs” 23 Cost Savings Added QualityShooju Value Added Consistency Shooju ensures that all analysts are using the same data across all their tools and applications. By allowing analysts to upload their own data to the platform, internal data as well as external data now flows seamlessly - without messy spreadsheet links. Currency By automatically pulling in the latest source data through the Shooju importer layer, Shooju ensures that all of your spreadsheets and models are populated with the latest data. Our native plugins for Excel, Access and all your other tools allow data to flow through directly without any need for the analyst to download or copy and paste. Correctness The more data is touched by human hands, the more prone it is to errors. By streamlining workflows and automating work processes, Shooju eliminates most of these errors, saving time and ensuring that the data you rely on is more accurate. Enhanced Decision Making
  • 24. We support any data source 24 Ask us about non-mainstream data sources that traditional data providers don’t carry.
  • 26. Shooju vs. Custom Data Warehouse Custom Data Warehouse Shooju Design Custom “Plug-and-play” Cost 7+ digits 5-6 digits Rollout timeline Months / Years Hours Scalability Minimal Infinite Flexibility Low High Maintenance High Low Stakeholders IT controlled Analyst run / IT maintained Tool and app support Clunky, requiring IT Native tool support 26 Data warehouse projects are costly, time consuming and result in inflexible systems with low adoption rates
  • 27. Shooju vs. Off-the-shelf Data Management* Off-the-shelf Data Management* Shooju Service focus Data provision/management Process improvement Prepackaged data feeds Many None Custom data feeds None (not natively supported) Included(all feeds are custom) Internal data integration Weeks (high consulting fees) Days (included in service) Process flexibility Low High Analyst learning curve Weeks Hours Ease of migrating off Very difficult/impossible Easy Annual fee 6-7 digits 5-6 digits 27 Data management* solutions focus on generic data provision rather than process improvement and limit analysts to a closed and inflexible data ecosystem. * Top-ranked providers in the EnergyRisk Data Management category include: Morningstar, ZE Power Group, SunGard, Allegro, Pioneer Solutions, SAS, and InteractiveData. See http://www.slideshare.net/Allegrodev/energy-risk-magazines-etrm-software-rankings-2013

Notas do Editor

  1. ----- Meeting Notes (5/30/12 21:35) ----- hey there
  2. ----- Meeting Notes (5/30/12 21:35) ----- hey there