SlideShare uma empresa Scribd logo
1 de 45
WHAT IS BIG DATA TESTING
STRATEGIES BEHIND BIG DATA TESTING
www.edureka.co
BIG DATA FORMS
BIG DATA TESTING ENVIRONMENT
TESTING BIG DATA
BIG DATA TESTING TOOLS
WHY DO WE NEED BIG DATA TESTING?
CHALLENGES FACED IN TESTING BIG DATA
BIG DATA TESTING V/S TRADITIONAL TESTING
WHY DO WE NEED BIG DATA TESTING
www.edureka.co
WHY DO WE NEED BIG DATA TESTING
www.edureka.co
www.edureka.co
WHY DO WE NEED BIG DATA TESTING
www.edureka.co
WHY DO WE NEED BIG DATA TESTING
www.edureka.co
WHAT IS BIG DATA TESTING
www.edureka.co
WHAT IS BIG DATA TESTING
www.edureka.co
Big Data Testing can be defined as the procedure that involves
examining and validating the functionality of the Big Data
Applications. Big Data is a collection of a huge amount of data
that traditional storage systems cannot handle.
www.edureka.co
STRATEGIES BEHIND BIG DATA TESTING
www.edureka.co
STRATEGIES BEHIND BIG DATA TESTING
www.edureka.co
Testing an Application that handles terabytes of data would
take the skill from a whole new level and out of the box
thinking.
STRATEGIES BEHIND BIG DATA TESTING
Real-Time Processing
Interactive Processing
Batch Processing
www.edureka.co
BIG DATA FORMS
www.edureka.co
Structured
Semi-Structured
Unstructured
BIG DATA FORMS
www.edureka.co
BIG DATA FORMS
www.edureka.co
BIG DATA TESTING ENVIRONMENT
www.edureka.co
BIG DATA TESTING ENVIRONMENT
www.edureka.co
BIG DATA TESTING ENVIRONMENT
The basic requirements that makeup Data Testing are as follows.
1. Space for Storing, Processing and Validating Terra bytes of data should be available.
2. The Cluster and its respective nodes should be responsive
3. Data Processing resources like the powerful CPU should be available
www.edureka.co
BIG DATA TESTING
www.edureka.co
BIG DATA TESTING
www.edureka.co
Data is first loaded from source to Big Data System using extracting tools. The Storage might be
HDFS, MongoDB or any similar storage. Then, the loaded data is cross-checked for errors and
missing values.
Data Ingestion
BIG DATA TESTING
www.edureka.co
In this stage, the key-value pairs for the data get generated. Later,
the MapReduce logic is applied to all the nodes and checked if the
algorithm works fine or not. A data validation process takes place
here to make sure the output is generated as expected.
Data Processing
www.edureka.co
BIG DATA TESTING
www.edureka.co
At this stage, the output generated is ready to be migrated to the
data warehouse. Here, The transformation logic is checked, the data
integrity is verified and the key-value pairs at the location are
validated for accuracy.
Data Validation
www.edureka.co
UNIT TESTING
The complete Big Data Application is divided into segments and each
segment is rigorously tested with multiple possibilities for an expected
outcome. If the segment fails, then it is sent back to the developments
and improvements.
www.edureka.co
FUNCTIONAL TESTING
Functional Testing can be otherwise called as the different
phases in testing the big data application. The Big Data
Application is designed to deal with huge blocks of data.
www.edureka.co
Data Storage Phase
Data Processing Phase
Data Ingestion Phase
Data Integrity Phase
Data Validation Phase
Report Generation Phase
www.edureka.co
NON-FUNCTIONAL TESTING
The Non-Functional Testing phase takes care of the three major
dimensions and characteristics of Big Data. The Volume, Velocity,
and finally the Variety of the Big Data.
www.edureka.co
1
3
5
2
4
Data Quality
Monitoring
Infrastructure
Data Security
Data
Performance
Fail-over Test
Mechanism
www.edureka.co
PERFORMANCE TESTING
Performance testing highly concentrates on the performance
delivered by all the components of the big data system.
www.edureka.co
Data Collection Phase
Data Ingestion Phase
Data Processing
Component Testing
www.edureka.co
PERFORMANCE TESTING APPROACH
www.edureka.co
PARAMETERS OF PERFORMANCE TESTING
Concurrency
Caching
Commit
logs
Timeout
Data
Storage
Performance
Testing
Parameters
www.edureka.co
ARCHITECTURE TESTING
Architecture testing concentrates on establishing a stable
Hadoop Architecture. The architecture of Big Data Processing
Application plays a key role in achieving smooth operations.
www.edureka.co
ARCHITECTURE TESTING
1
2
3
4
Performance Degradation
Node Failure
High Data Latency
High Maintainance
www.edureka.co
BIG DATA TESTING TOOLS
www.edureka.co
BIG DATA TESTING TOOLS
Data Processing
Data Storage
Data Ingestion
Data Migration
www.edureka.co
BIG DATA TESTING TOOLS
Data Ingestion Data Processing Data Storage Data Migration
www.edureka.co
KEY CHALLENGES FACED IN TESTING BIG DATA
www.edureka.co
KEY CHALLENGES FACED IN TESTING BIGDATA
• Big Data Testing is highly complicated, the process requires a highly
skilled official.
• Automated Big Data Testing procedures are and not suited for
unexpected errors.
• Virtual Machine latency creates latency in tests and managing
multimedia is a hazzle
• The volume of the data is one major challenge for testing
• Test environment and automation should be developed for different
platforms
• Each component is from a different technology, hence, requires isolated
testing
• No single tool can perform end to end testing
• High Degree of scripting is required for designing test cases
• Customized Solutions are required to increase performance and test
critical areas
www.edureka.co
BIG DATA TESTING v/s TRADITIONALTESTING
www.edureka.co
Big Data Testing Supports all types of data testing
Traditional testing supports only structured data testing
BIG DATA TESTING v/s TRADITIONAL DATA TESTING
www.edureka.co
Big Data Testing requires Research and Development
Traditional Data Testing doesn’t need Research and Development
BIG DATA TESTING v/s TRADITIONAL DATA TESTING
www.edureka.co
Data Size is Unlimited in Big Data Testing
Data Size is limited in Big Data Testing
BIG DATA TESTING v/s TRADITIONAL DATA TESTING
www.edureka.co
Big Data Testing requires special Environment
Traditional Data Testing doesn’t require special Environment
BIG DATA TESTING v/s TRADITIONAL DATA TESTING
www.edureka.co
Only the highly skilled and qualified candidates are preferred
Basic operations knowledge is enough to run tests
BIG DATA TESTING v/s TRADITIONAL DATA TESTING
www.edureka.co
www.edureka.co

Mais conteúdo relacionado

Mais de Edureka!

How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaEdureka!
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaEdureka!
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | EdurekaEdureka!
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEdureka!
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEdureka!
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaEdureka!
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaEdureka!
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaEdureka!
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaEdureka!
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaEdureka!
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | EdurekaEdureka!
 
ITIL® Tutorial for Beginners | ITIL® Foundation Training | Edureka
ITIL® Tutorial for Beginners | ITIL® Foundation Training | EdurekaITIL® Tutorial for Beginners | ITIL® Foundation Training | Edureka
ITIL® Tutorial for Beginners | ITIL® Foundation Training | EdurekaEdureka!
 
Difference between ITIL v3 and ITIL 4 | ITIL® Foundation Training | Edureka
Difference between ITIL v3 and ITIL 4 | ITIL® Foundation Training | EdurekaDifference between ITIL v3 and ITIL 4 | ITIL® Foundation Training | Edureka
Difference between ITIL v3 and ITIL 4 | ITIL® Foundation Training | EdurekaEdureka!
 
Jenkins vs Bamboo | Differences Between Jenkins and Bamboo | Edureka
Jenkins vs Bamboo | Differences Between Jenkins and Bamboo | EdurekaJenkins vs Bamboo | Differences Between Jenkins and Bamboo | Edureka
Jenkins vs Bamboo | Differences Between Jenkins and Bamboo | EdurekaEdureka!
 
What Is Digital Marketing? | Digital Marketing Tutorial | Edureka
What Is Digital Marketing? | Digital Marketing Tutorial | EdurekaWhat Is Digital Marketing? | Digital Marketing Tutorial | Edureka
What Is Digital Marketing? | Digital Marketing Tutorial | EdurekaEdureka!
 
What is JUnit? | Edureka
What is JUnit? | EdurekaWhat is JUnit? | Edureka
What is JUnit? | EdurekaEdureka!
 
Machine Learning in 10 Minutes | What is Machine Learning? | Edureka
Machine Learning in 10 Minutes | What is Machine Learning? | EdurekaMachine Learning in 10 Minutes | What is Machine Learning? | Edureka
Machine Learning in 10 Minutes | What is Machine Learning? | EdurekaEdureka!
 
Web Development Projects | Web Dev Project Ideas For Beginners | Edureka
Web Development Projects | Web Dev Project Ideas For Beginners | EdurekaWeb Development Projects | Web Dev Project Ideas For Beginners | Edureka
Web Development Projects | Web Dev Project Ideas For Beginners | EdurekaEdureka!
 
What's new in python 3.8? | Python 3.8 New Features | Edureka
What's new in python 3.8? | Python 3.8 New Features | EdurekaWhat's new in python 3.8? | Python 3.8 New Features | Edureka
What's new in python 3.8? | Python 3.8 New Features | EdurekaEdureka!
 

Mais de Edureka! (20)

How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
 
ITIL® Tutorial for Beginners | ITIL® Foundation Training | Edureka
ITIL® Tutorial for Beginners | ITIL® Foundation Training | EdurekaITIL® Tutorial for Beginners | ITIL® Foundation Training | Edureka
ITIL® Tutorial for Beginners | ITIL® Foundation Training | Edureka
 
Difference between ITIL v3 and ITIL 4 | ITIL® Foundation Training | Edureka
Difference between ITIL v3 and ITIL 4 | ITIL® Foundation Training | EdurekaDifference between ITIL v3 and ITIL 4 | ITIL® Foundation Training | Edureka
Difference between ITIL v3 and ITIL 4 | ITIL® Foundation Training | Edureka
 
Jenkins vs Bamboo | Differences Between Jenkins and Bamboo | Edureka
Jenkins vs Bamboo | Differences Between Jenkins and Bamboo | EdurekaJenkins vs Bamboo | Differences Between Jenkins and Bamboo | Edureka
Jenkins vs Bamboo | Differences Between Jenkins and Bamboo | Edureka
 
What Is Digital Marketing? | Digital Marketing Tutorial | Edureka
What Is Digital Marketing? | Digital Marketing Tutorial | EdurekaWhat Is Digital Marketing? | Digital Marketing Tutorial | Edureka
What Is Digital Marketing? | Digital Marketing Tutorial | Edureka
 
What is JUnit? | Edureka
What is JUnit? | EdurekaWhat is JUnit? | Edureka
What is JUnit? | Edureka
 
Machine Learning in 10 Minutes | What is Machine Learning? | Edureka
Machine Learning in 10 Minutes | What is Machine Learning? | EdurekaMachine Learning in 10 Minutes | What is Machine Learning? | Edureka
Machine Learning in 10 Minutes | What is Machine Learning? | Edureka
 
Web Development Projects | Web Dev Project Ideas For Beginners | Edureka
Web Development Projects | Web Dev Project Ideas For Beginners | EdurekaWeb Development Projects | Web Dev Project Ideas For Beginners | Edureka
Web Development Projects | Web Dev Project Ideas For Beginners | Edureka
 
What's new in python 3.8? | Python 3.8 New Features | Edureka
What's new in python 3.8? | Python 3.8 New Features | EdurekaWhat's new in python 3.8? | Python 3.8 New Features | Edureka
What's new in python 3.8? | Python 3.8 New Features | Edureka
 

Último

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 

Último (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 

Big Data Testing | Edureka

  • 1.
  • 2. WHAT IS BIG DATA TESTING STRATEGIES BEHIND BIG DATA TESTING www.edureka.co BIG DATA FORMS BIG DATA TESTING ENVIRONMENT TESTING BIG DATA BIG DATA TESTING TOOLS WHY DO WE NEED BIG DATA TESTING? CHALLENGES FACED IN TESTING BIG DATA BIG DATA TESTING V/S TRADITIONAL TESTING
  • 3. WHY DO WE NEED BIG DATA TESTING www.edureka.co
  • 4. WHY DO WE NEED BIG DATA TESTING www.edureka.co
  • 6. WHY DO WE NEED BIG DATA TESTING www.edureka.co
  • 7. WHY DO WE NEED BIG DATA TESTING www.edureka.co
  • 8. WHAT IS BIG DATA TESTING www.edureka.co
  • 9. WHAT IS BIG DATA TESTING www.edureka.co Big Data Testing can be defined as the procedure that involves examining and validating the functionality of the Big Data Applications. Big Data is a collection of a huge amount of data that traditional storage systems cannot handle. www.edureka.co
  • 10. STRATEGIES BEHIND BIG DATA TESTING www.edureka.co
  • 11. STRATEGIES BEHIND BIG DATA TESTING www.edureka.co Testing an Application that handles terabytes of data would take the skill from a whole new level and out of the box thinking.
  • 12. STRATEGIES BEHIND BIG DATA TESTING Real-Time Processing Interactive Processing Batch Processing www.edureka.co
  • 16. BIG DATA TESTING ENVIRONMENT www.edureka.co
  • 17. BIG DATA TESTING ENVIRONMENT www.edureka.co
  • 18. BIG DATA TESTING ENVIRONMENT The basic requirements that makeup Data Testing are as follows. 1. Space for Storing, Processing and Validating Terra bytes of data should be available. 2. The Cluster and its respective nodes should be responsive 3. Data Processing resources like the powerful CPU should be available www.edureka.co
  • 20. BIG DATA TESTING www.edureka.co Data is first loaded from source to Big Data System using extracting tools. The Storage might be HDFS, MongoDB or any similar storage. Then, the loaded data is cross-checked for errors and missing values. Data Ingestion
  • 21. BIG DATA TESTING www.edureka.co In this stage, the key-value pairs for the data get generated. Later, the MapReduce logic is applied to all the nodes and checked if the algorithm works fine or not. A data validation process takes place here to make sure the output is generated as expected. Data Processing www.edureka.co
  • 22. BIG DATA TESTING www.edureka.co At this stage, the output generated is ready to be migrated to the data warehouse. Here, The transformation logic is checked, the data integrity is verified and the key-value pairs at the location are validated for accuracy. Data Validation www.edureka.co
  • 23. UNIT TESTING The complete Big Data Application is divided into segments and each segment is rigorously tested with multiple possibilities for an expected outcome. If the segment fails, then it is sent back to the developments and improvements. www.edureka.co
  • 24. FUNCTIONAL TESTING Functional Testing can be otherwise called as the different phases in testing the big data application. The Big Data Application is designed to deal with huge blocks of data. www.edureka.co
  • 25. Data Storage Phase Data Processing Phase Data Ingestion Phase Data Integrity Phase Data Validation Phase Report Generation Phase www.edureka.co
  • 26. NON-FUNCTIONAL TESTING The Non-Functional Testing phase takes care of the three major dimensions and characteristics of Big Data. The Volume, Velocity, and finally the Variety of the Big Data. www.edureka.co
  • 28. PERFORMANCE TESTING Performance testing highly concentrates on the performance delivered by all the components of the big data system. www.edureka.co
  • 29. Data Collection Phase Data Ingestion Phase Data Processing Component Testing www.edureka.co
  • 31. PARAMETERS OF PERFORMANCE TESTING Concurrency Caching Commit logs Timeout Data Storage Performance Testing Parameters www.edureka.co
  • 32. ARCHITECTURE TESTING Architecture testing concentrates on establishing a stable Hadoop Architecture. The architecture of Big Data Processing Application plays a key role in achieving smooth operations. www.edureka.co
  • 33. ARCHITECTURE TESTING 1 2 3 4 Performance Degradation Node Failure High Data Latency High Maintainance www.edureka.co
  • 34. BIG DATA TESTING TOOLS www.edureka.co
  • 35. BIG DATA TESTING TOOLS Data Processing Data Storage Data Ingestion Data Migration www.edureka.co
  • 36. BIG DATA TESTING TOOLS Data Ingestion Data Processing Data Storage Data Migration www.edureka.co
  • 37. KEY CHALLENGES FACED IN TESTING BIG DATA www.edureka.co
  • 38. KEY CHALLENGES FACED IN TESTING BIGDATA • Big Data Testing is highly complicated, the process requires a highly skilled official. • Automated Big Data Testing procedures are and not suited for unexpected errors. • Virtual Machine latency creates latency in tests and managing multimedia is a hazzle • The volume of the data is one major challenge for testing • Test environment and automation should be developed for different platforms • Each component is from a different technology, hence, requires isolated testing • No single tool can perform end to end testing • High Degree of scripting is required for designing test cases • Customized Solutions are required to increase performance and test critical areas www.edureka.co
  • 39. BIG DATA TESTING v/s TRADITIONALTESTING www.edureka.co
  • 40. Big Data Testing Supports all types of data testing Traditional testing supports only structured data testing BIG DATA TESTING v/s TRADITIONAL DATA TESTING www.edureka.co
  • 41. Big Data Testing requires Research and Development Traditional Data Testing doesn’t need Research and Development BIG DATA TESTING v/s TRADITIONAL DATA TESTING www.edureka.co
  • 42. Data Size is Unlimited in Big Data Testing Data Size is limited in Big Data Testing BIG DATA TESTING v/s TRADITIONAL DATA TESTING www.edureka.co
  • 43. Big Data Testing requires special Environment Traditional Data Testing doesn’t require special Environment BIG DATA TESTING v/s TRADITIONAL DATA TESTING www.edureka.co
  • 44. Only the highly skilled and qualified candidates are preferred Basic operations knowledge is enough to run tests BIG DATA TESTING v/s TRADITIONAL DATA TESTING www.edureka.co