SlideShare uma empresa Scribd logo
1 de 14
Baixar para ler offline
Agility for Big Data
My journey implementing an Agile method to Big Data applications
Who I am
What is the hardest part about bringing
agility to your big data applications?
“The more data you give the business,
the more questions they will ask”
Jose Carlos Eiras
Served as CIO at Kraft Foods, Philip Morris, General Motors and DHL
Reporting over Workable Software
Reporting over Workable Software
• Problems experienced
• Customer don’t know about they want until they see that
• Very long feedback cycle because of waiting for quality data
• Developing workable software is much more expensive than generating a report manually
• Workable software without data to use is even more expensive
• Switching cost between tasks is high, but the switching cost between projects is even higher
• Releasing a feature to All Users will result in more questions coming in, either because of data
quality or other valid reasons
• Very low product success rate, lots of resources wasted and low team spirit
Reporting over Workable Software
• Solutions
• Focus on a very specific customer group and generate reports for them
• Collect data that targets a very specific customer group, like: parents in Box Hill area who
work in IT
• Manually generated reports
• Data quality easier to control over a small amount of data
• Deliver reports to end users in the most cost effective way: eg face to face, email, or open
source BI tools
• Get feedback and test hypothesis
• Focus on a subset of data while discovering the value of existing data
• Apply new methodology to a subset of data in a much more effective way
• Data quality easier to control on a subset of data
• Focus on one customer and get feedback from the client
• Test hypothesis
Reporting over Workable Software
• Solutions
• Data Freedom - Empowering people (example - data scientists exploring data values)
• Provide an SQL-like interface for users to easily access the data
• Provide semantic schema so that users can easily find where to find right data
• Document your data if necessary to help other people understand, decipher and use data
• Provide easy-to-use report designs for accessing data like Pentaho, Jasper Report
• Provide easy to use scheduling tools like Oozie, or general BI tools
• Mentally, developers should provide support for other people to freely explor data in ways they like
• In the scenario that data must be accessed through developers, those developers should think
about what stops other users from accessing data
• Safeguard to prevent cluster overloading
• The overall result will be to increase the speed of feedback - dramatically
Reporting over Workable Software
• More to try
• Automated data quality control
• Explore different ways for the customer service team to address data quality issues
• Sampling data for product discovery programs
• Explore ways to test a hypothesis in an even quicker manner – example: customer
centric data collection and reporting
• Explore a wider scale of data freedom through web service
Continuous Delivery
• Continuous delivery, where to start?
• Problems: legacy systems, low unit test coverage, low functional/ integration test
coverage, no acceptance testing, not enough testing data, and so on…
• Start with an easy problem so that it is achievable and will help to build team trust
• Must have – testing data and integration testing suites
Continuous Delivery
• Build pipeline //dev box//build//daily build server//alpha//beta//production//
• Testing Data - you will never cover all scenarios, so what do you do? Hybrid data
fixtures with data manual produced, generated, and from production
• Versioning Data
• Keep data clean as code, refactor your data often
• Backward and forward compatibility
• Vertical slicing story, architecture and teams
• NoSQL database engines
• Start continuous delivery for some components NOW and learn from
Deployment != Release
• Separate deployment from release
• Tips
• Data batch toggles
• Feature toggles
• Customer/ Country/ Region releases
• Manually generated report area
• Don’t forget about “exclusive” toggles
• Leave release up to the production manager. They release and they organize
press releases.
Q&A
What is the hardest part about bringing
agility to your big data applications?
My Personal Information
• LinkedIn Profile: http://au.linkedin.com/pub/charlie-cheng/24/92/978/
• Twitter: @charlie_cheng
Are you looking for some training and find it is hard to select the right one?
We are running a customer discovery program on it at StudyIsFun.
Please contact me at charlie750123@gmail.com if you are interested.

Mais conteúdo relacionado

Mais procurados

Choosing a Database
Choosing a DatabaseChoosing a Database
Choosing a Database
501 Commons
 
Please tech and psl webinar updated mbp
Please tech and psl webinar updated mbpPlease tech and psl webinar updated mbp
Please tech and psl webinar updated mbp
PleaseTech
 
Agile Methodology - Data Migration v1.0
Agile Methodology - Data Migration v1.0Agile Methodology - Data Migration v1.0
Agile Methodology - Data Migration v1.0
Julian Samuels
 
Oracle To Sql Server migration process
Oracle To Sql Server migration processOracle To Sql Server migration process
Oracle To Sql Server migration process
harirk1986
 
Systems Migration
Systems MigrationSystems Migration
Systems Migration
richchihlee
 
Cumsa 2010 SCLogic Presentation
Cumsa 2010 SCLogic PresentationCumsa 2010 SCLogic Presentation
Cumsa 2010 SCLogic Presentation
Ashherly01
 

Mais procurados (20)

Choosing a Database
Choosing a DatabaseChoosing a Database
Choosing a Database
 
Pyxa's Approach to Migration Projects
Pyxa's Approach to Migration ProjectsPyxa's Approach to Migration Projects
Pyxa's Approach to Migration Projects
 
Break Free From Basic Query Tools
Break Free From Basic Query ToolsBreak Free From Basic Query Tools
Break Free From Basic Query Tools
 
CCXG Special Event, November 2020, Michael Vartanyan
CCXG Special Event, November 2020, Michael VartanyanCCXG Special Event, November 2020, Michael Vartanyan
CCXG Special Event, November 2020, Michael Vartanyan
 
My Sql Data Migration
My Sql Data MigrationMy Sql Data Migration
My Sql Data Migration
 
Testing the Data Warehouse—Big Data, Big Problems
Testing the Data Warehouse—Big Data, Big ProblemsTesting the Data Warehouse—Big Data, Big Problems
Testing the Data Warehouse—Big Data, Big Problems
 
Driving Faster Analytics at Symphony Health
Driving Faster Analytics at Symphony HealthDriving Faster Analytics at Symphony Health
Driving Faster Analytics at Symphony Health
 
Database migration
Database migrationDatabase migration
Database migration
 
Please tech and psl webinar updated mbp
Please tech and psl webinar updated mbpPlease tech and psl webinar updated mbp
Please tech and psl webinar updated mbp
 
Every IT Manager's Key to Better Data Management
Every IT Manager's Key to Better Data ManagementEvery IT Manager's Key to Better Data Management
Every IT Manager's Key to Better Data Management
 
Agile Methodology - Data Migration v1.0
Agile Methodology - Data Migration v1.0Agile Methodology - Data Migration v1.0
Agile Methodology - Data Migration v1.0
 
Oracle To Sql Server migration process
Oracle To Sql Server migration processOracle To Sql Server migration process
Oracle To Sql Server migration process
 
Ready for Fast Data: How Lightbend Enables Teams To Build Real-Time, Streamin...
Ready for Fast Data: How Lightbend Enables Teams To Build Real-Time, Streamin...Ready for Fast Data: How Lightbend Enables Teams To Build Real-Time, Streamin...
Ready for Fast Data: How Lightbend Enables Teams To Build Real-Time, Streamin...
 
Enterprise Architecture
Enterprise Architecture Enterprise Architecture
Enterprise Architecture
 
Triton - The Facts
Triton - The FactsTriton - The Facts
Triton - The Facts
 
Systems Migration
Systems MigrationSystems Migration
Systems Migration
 
Anaeko Company Presentation V1.0
Anaeko Company Presentation V1.0Anaeko Company Presentation V1.0
Anaeko Company Presentation V1.0
 
Cumsa 2010 SCLogic Presentation
Cumsa 2010 SCLogic PresentationCumsa 2010 SCLogic Presentation
Cumsa 2010 SCLogic Presentation
 
Valley Medical Center - Steps for Avoiding Migration Pitfalls
Valley Medical Center - Steps for Avoiding Migration PitfallsValley Medical Center - Steps for Avoiding Migration Pitfalls
Valley Medical Center - Steps for Avoiding Migration Pitfalls
 
Building an Open Source Staff-Facing Tablet App for Library Assessment
Building an Open Source Staff-Facing Tablet App for Library AssessmentBuilding an Open Source Staff-Facing Tablet App for Library Assessment
Building an Open Source Staff-Facing Tablet App for Library Assessment
 

Destaque

Destaque (9)

Customer Retention Through Advanced Data Driven Marketing - Windsor Circle & ...
Customer Retention Through Advanced Data Driven Marketing - Windsor Circle & ...Customer Retention Through Advanced Data Driven Marketing - Windsor Circle & ...
Customer Retention Through Advanced Data Driven Marketing - Windsor Circle & ...
 
Google Webmaster Tools Webinar
Google Webmaster Tools WebinarGoogle Webmaster Tools Webinar
Google Webmaster Tools Webinar
 
7 Big Facts About Data-Driven Innovation
7 Big Facts About Data-Driven Innovation7 Big Facts About Data-Driven Innovation
7 Big Facts About Data-Driven Innovation
 
Business Impact From IoT? Just Add Data Science
Business Impact From IoT? Just Add Data ScienceBusiness Impact From IoT? Just Add Data Science
Business Impact From IoT? Just Add Data Science
 
How to Create Surveys to Read Your Audience's Minds
How to Create Surveys to Read Your Audience's MindsHow to Create Surveys to Read Your Audience's Minds
How to Create Surveys to Read Your Audience's Minds
 
Infographics Key Data KPI presentation slides
Infographics Key Data KPI presentation slidesInfographics Key Data KPI presentation slides
Infographics Key Data KPI presentation slides
 
6 Steps to Data Quality in Marketing Automation
6 Steps to Data Quality in Marketing Automation6 Steps to Data Quality in Marketing Automation
6 Steps to Data Quality in Marketing Automation
 
Data Decay: Why Your CRM Data SUCKS
Data Decay: Why Your CRM Data SUCKSData Decay: Why Your CRM Data SUCKS
Data Decay: Why Your CRM Data SUCKS
 
Excel Document Recovery to the Rescue
Excel Document Recovery to the RescueExcel Document Recovery to the Rescue
Excel Document Recovery to the Rescue
 

Semelhante a Agility for big data

data_blending
data_blendingdata_blending
data_blending
subit1615
 
Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse Optimization
Cloudera, Inc.
 

Semelhante a Agility for big data (20)

The Rise of Self -service Business Intelligence
The Rise of Self -service Business IntelligenceThe Rise of Self -service Business Intelligence
The Rise of Self -service Business Intelligence
 
Ray Scott - Agile Solutions – Leading with Test Data Management - EuroSTAR 2012
Ray Scott - Agile Solutions – Leading with Test Data Management - EuroSTAR 2012Ray Scott - Agile Solutions – Leading with Test Data Management - EuroSTAR 2012
Ray Scott - Agile Solutions – Leading with Test Data Management - EuroSTAR 2012
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
7 Keys to Simplify Your Data Conversion
7 Keys to Simplify Your Data Conversion7 Keys to Simplify Your Data Conversion
7 Keys to Simplify Your Data Conversion
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
 
Elements of a Good Information System
Elements of a Good Information SystemElements of a Good Information System
Elements of a Good Information System
 
Building Data Warehouse in SQL Server
Building Data Warehouse in SQL ServerBuilding Data Warehouse in SQL Server
Building Data Warehouse in SQL Server
 
data_blending
data_blendingdata_blending
data_blending
 
Data warehouseold
Data warehouseoldData warehouseold
Data warehouseold
 
The Changing Role of IT Staff
The Changing Role of IT StaffThe Changing Role of IT Staff
The Changing Role of IT Staff
 
SCRIMPS-STD: Test Automation Design Principles - and asking the right questions!
SCRIMPS-STD: Test Automation Design Principles - and asking the right questions!SCRIMPS-STD: Test Automation Design Principles - and asking the right questions!
SCRIMPS-STD: Test Automation Design Principles - and asking the right questions!
 
Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse Optimization
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
 
Data Warehouse Testing—The Next Opportunity for QA Leaders
Data Warehouse Testing—The Next Opportunity for QA LeadersData Warehouse Testing—The Next Opportunity for QA Leaders
Data Warehouse Testing—The Next Opportunity for QA Leaders
 
Data Foundation for Analytics Excellence by Tanimura, cathy from Okta
Data Foundation for Analytics Excellence by Tanimura, cathy from OktaData Foundation for Analytics Excellence by Tanimura, cathy from Okta
Data Foundation for Analytics Excellence by Tanimura, cathy from Okta
 
Data Management Workshop - ETOT 2016
Data Management Workshop - ETOT 2016Data Management Workshop - ETOT 2016
Data Management Workshop - ETOT 2016
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
DW (1).ppt
DW (1).pptDW (1).ppt
DW (1).ppt
 
How Celtra Optimizes its Advertising Platform with Databricks
How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with Databricks
 
Making the Case for Legacy Data in Modern Data Analytics Platforms
Making the Case for Legacy Data in Modern Data Analytics PlatformsMaking the Case for Legacy Data in Modern Data Analytics Platforms
Making the Case for Legacy Data in Modern Data Analytics Platforms
 

Último

Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
gajnagarg
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
SayantanBiswas37
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
gajnagarg
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 

Último (20)

Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 

Agility for big data

  • 1. Agility for Big Data My journey implementing an Agile method to Big Data applications
  • 3. What is the hardest part about bringing agility to your big data applications?
  • 4. “The more data you give the business, the more questions they will ask” Jose Carlos Eiras Served as CIO at Kraft Foods, Philip Morris, General Motors and DHL
  • 6. Reporting over Workable Software • Problems experienced • Customer don’t know about they want until they see that • Very long feedback cycle because of waiting for quality data • Developing workable software is much more expensive than generating a report manually • Workable software without data to use is even more expensive • Switching cost between tasks is high, but the switching cost between projects is even higher • Releasing a feature to All Users will result in more questions coming in, either because of data quality or other valid reasons • Very low product success rate, lots of resources wasted and low team spirit
  • 7. Reporting over Workable Software • Solutions • Focus on a very specific customer group and generate reports for them • Collect data that targets a very specific customer group, like: parents in Box Hill area who work in IT • Manually generated reports • Data quality easier to control over a small amount of data • Deliver reports to end users in the most cost effective way: eg face to face, email, or open source BI tools • Get feedback and test hypothesis • Focus on a subset of data while discovering the value of existing data • Apply new methodology to a subset of data in a much more effective way • Data quality easier to control on a subset of data • Focus on one customer and get feedback from the client • Test hypothesis
  • 8. Reporting over Workable Software • Solutions • Data Freedom - Empowering people (example - data scientists exploring data values) • Provide an SQL-like interface for users to easily access the data • Provide semantic schema so that users can easily find where to find right data • Document your data if necessary to help other people understand, decipher and use data • Provide easy-to-use report designs for accessing data like Pentaho, Jasper Report • Provide easy to use scheduling tools like Oozie, or general BI tools • Mentally, developers should provide support for other people to freely explor data in ways they like • In the scenario that data must be accessed through developers, those developers should think about what stops other users from accessing data • Safeguard to prevent cluster overloading • The overall result will be to increase the speed of feedback - dramatically
  • 9. Reporting over Workable Software • More to try • Automated data quality control • Explore different ways for the customer service team to address data quality issues • Sampling data for product discovery programs • Explore ways to test a hypothesis in an even quicker manner – example: customer centric data collection and reporting • Explore a wider scale of data freedom through web service
  • 10. Continuous Delivery • Continuous delivery, where to start? • Problems: legacy systems, low unit test coverage, low functional/ integration test coverage, no acceptance testing, not enough testing data, and so on… • Start with an easy problem so that it is achievable and will help to build team trust • Must have – testing data and integration testing suites
  • 11. Continuous Delivery • Build pipeline //dev box//build//daily build server//alpha//beta//production// • Testing Data - you will never cover all scenarios, so what do you do? Hybrid data fixtures with data manual produced, generated, and from production • Versioning Data • Keep data clean as code, refactor your data often • Backward and forward compatibility • Vertical slicing story, architecture and teams • NoSQL database engines • Start continuous delivery for some components NOW and learn from
  • 12. Deployment != Release • Separate deployment from release • Tips • Data batch toggles • Feature toggles • Customer/ Country/ Region releases • Manually generated report area • Don’t forget about “exclusive” toggles • Leave release up to the production manager. They release and they organize press releases.
  • 13. Q&A What is the hardest part about bringing agility to your big data applications?
  • 14. My Personal Information • LinkedIn Profile: http://au.linkedin.com/pub/charlie-cheng/24/92/978/ • Twitter: @charlie_cheng Are you looking for some training and find it is hard to select the right one? We are running a customer discovery program on it at StudyIsFun. Please contact me at charlie750123@gmail.com if you are interested.