SlideShare uma empresa Scribd logo
1 de 32
Baixar para ler offline
What is “Data Engineering?”
Data Engineering Lab.
Kim Yong Dam
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
<Contents>
1. Introduction
2. What is Data Engineering?
3. Role of Data Engineer
4. What I’m doing..?
5. Future Work
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
1. Introduction
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
Price / Analysis
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
???? Price / Analysis
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
Optimization Price / Analysis
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
Optimization Price / Analysis
How?
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
2. What is Data Engineering?
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Data Engineering
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Tons of to do..
Tons of to do..
Tons of to do..
Tons of to do..
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Tons of to do..
Tons of to do..
Tons of to do..
Tons of to do..
Build Systems
with respect to
each data domain
Data Engineering
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Tons of to do..
Tons of to do..
Tons of to do..
Tons of to do..
“On Computer Architecture”
Data Engineering
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
3. Role of Data Engineer
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
https://jobs.apple.com/us/search?job=86260820&openJobId=86260820#&ss=Data%20Engineer&t=0&so=&pN=0&openJobId=99607161
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
https://cloud.google.com/certification/data-engineer
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
https://cloud.google.com/certification/data-engineer
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
https://cloud.google.com/certification/data-engineer
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
1. Designing data processing systems
2. Building and maintaining data structures and databases
3. Analyzing data and enabling machine learning
4. Modeling business processes for analysis and optimization
5. Ensuring reliability
6. Visualizing data and advocating policy
7. Designing for security and compliance
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
1. Designing data processing systems
2. Building and maintaining data structures and databases
3. Analyzing data and enabling machine learning
4. Modeling business processes for analysis and optimization
5. Ensuring reliability
6. Visualizing data and advocating policy
7. Designing for security and compliance
“Should focus on something”
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
4. What I’m doing?
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
1. Designing data processing systems
2. Building and maintaining data structures and databases
3. Analyzing data and enabling machine learning
4. Modeling business processes for analysis and optimization
5. Ensuring reliability
6. Visualizing data and advocating policy
7. Designing for security and compliance
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
1. Designing data processing systems
2. Building and maintaining data structures and databases
3. Analyzing data and enabling machine learning
4. Modeling business processes for analysis and optimization
5. Ensuring reliability
6. Visualizing data and advocating policy
7. Designing for security and compliance
For what?
For what?
For what?
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
http://www.ibmbigdatahub.com/infographic/four-vs-big-data
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/
Make a implemented connection
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/
As a TEAM!
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
5. Future Work
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Future Work
1. Tree Optimization for Spatial data in Non-Volatile Memory
2. Keyword Clustering for SNS data analysis
3. Clustering technique as unsupervised learning
4. Spatial Web Querying using Spatial Database
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Future Work
1. PB+ tree, R-tree for PCM
2. Ontology-based Keyword Clustering, Review on Sematic
Document Clustering
3. An efficient K-Means Algorithm integrated with Jaccard Distance
Measure for Document Clustering, A New Mallows Distance
Based Metric for Comparing Clusterings, Measuring Similarity
between Sets of Overlapping Clusters
4. Efficient Processing of Spatial Group Keyword Queries, Keyword
Search in Spatial Databases: Toward Searching by Document
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Q & A
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Thank you

Mais conteúdo relacionado

Mais procurados

Data Engineering.pdf
Data Engineering.pdfData Engineering.pdf
Data Engineering.pdfDatacademy.ai
 
What makes it worth becoming a Data Engineer?
What makes it worth becoming a Data Engineer?What makes it worth becoming a Data Engineer?
What makes it worth becoming a Data Engineer?Hadi Fadlallah
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Databricks
 
Building a Virtual Data Lake with Apache Arrow
Building a Virtual Data Lake with Apache ArrowBuilding a Virtual Data Lake with Apache Arrow
Building a Virtual Data Lake with Apache ArrowDremio Corporation
 
DOCSIS 3.0 Broadband Intelligence using IPDR
DOCSIS 3.0 Broadband Intelligence using IPDRDOCSIS 3.0 Broadband Intelligence using IPDR
DOCSIS 3.0 Broadband Intelligence using IPDRappliedbroadband
 
Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?DATAVERSITY
 
[DSC Europe 22] Overview of the Databricks Platform - Petar Zecevic
[DSC Europe 22] Overview of the Databricks Platform - Petar Zecevic[DSC Europe 22] Overview of the Databricks Platform - Petar Zecevic
[DSC Europe 22] Overview of the Databricks Platform - Petar ZecevicDataScienceConferenc1
 
Agile & Data Modeling – How Can They Work Together?
Agile & Data Modeling – How Can They Work Together?Agile & Data Modeling – How Can They Work Together?
Agile & Data Modeling – How Can They Work Together?DATAVERSITY
 
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball ApproachMicrosoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball ApproachMark Ginnebaugh
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogDATAVERSITY
 
Get Savvy with Snowflake
Get Savvy with SnowflakeGet Savvy with Snowflake
Get Savvy with SnowflakeMatillion
 
PySpark Best Practices
PySpark Best PracticesPySpark Best Practices
PySpark Best PracticesCloudera, Inc.
 
Building Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft AzureBuilding Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft AzureDmitry Anoshin
 
RWDG: Measuring Data Governance Performance
RWDG: Measuring Data Governance PerformanceRWDG: Measuring Data Governance Performance
RWDG: Measuring Data Governance PerformanceDATAVERSITY
 
ODSC May 2019 - The DataOps Manifesto
ODSC May 2019 - The DataOps ManifestoODSC May 2019 - The DataOps Manifesto
ODSC May 2019 - The DataOps ManifestoDataKitchen
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceDatabricks
 
Azure data analytics platform - A reference architecture
Azure data analytics platform - A reference architecture Azure data analytics platform - A reference architecture
Azure data analytics platform - A reference architecture Rajesh Kumar
 
Data Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesData Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesIvo Andreev
 

Mais procurados (20)

Data Engineering.pdf
Data Engineering.pdfData Engineering.pdf
Data Engineering.pdf
 
Data Engineering Basics
Data Engineering BasicsData Engineering Basics
Data Engineering Basics
 
Data engineering
Data engineeringData engineering
Data engineering
 
What makes it worth becoming a Data Engineer?
What makes it worth becoming a Data Engineer?What makes it worth becoming a Data Engineer?
What makes it worth becoming a Data Engineer?
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Building a Virtual Data Lake with Apache Arrow
Building a Virtual Data Lake with Apache ArrowBuilding a Virtual Data Lake with Apache Arrow
Building a Virtual Data Lake with Apache Arrow
 
DOCSIS 3.0 Broadband Intelligence using IPDR
DOCSIS 3.0 Broadband Intelligence using IPDRDOCSIS 3.0 Broadband Intelligence using IPDR
DOCSIS 3.0 Broadband Intelligence using IPDR
 
Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?
 
[DSC Europe 22] Overview of the Databricks Platform - Petar Zecevic
[DSC Europe 22] Overview of the Databricks Platform - Petar Zecevic[DSC Europe 22] Overview of the Databricks Platform - Petar Zecevic
[DSC Europe 22] Overview of the Databricks Platform - Petar Zecevic
 
Agile & Data Modeling – How Can They Work Together?
Agile & Data Modeling – How Can They Work Together?Agile & Data Modeling – How Can They Work Together?
Agile & Data Modeling – How Can They Work Together?
 
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball ApproachMicrosoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data Catalog
 
Get Savvy with Snowflake
Get Savvy with SnowflakeGet Savvy with Snowflake
Get Savvy with Snowflake
 
PySpark Best Practices
PySpark Best PracticesPySpark Best Practices
PySpark Best Practices
 
Building Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft AzureBuilding Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft Azure
 
RWDG: Measuring Data Governance Performance
RWDG: Measuring Data Governance PerformanceRWDG: Measuring Data Governance Performance
RWDG: Measuring Data Governance Performance
 
ODSC May 2019 - The DataOps Manifesto
ODSC May 2019 - The DataOps ManifestoODSC May 2019 - The DataOps Manifesto
ODSC May 2019 - The DataOps Manifesto
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Azure data analytics platform - A reference architecture
Azure data analytics platform - A reference architecture Azure data analytics platform - A reference architecture
Azure data analytics platform - A reference architecture
 
Data Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesData Warehouse Design and Best Practices
Data Warehouse Design and Best Practices
 

Semelhante a What is data engineering?

1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptxarpit206900
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI dayMohammed Barakat
 
What is data science ?
What is data science ?What is data science ?
What is data science ?ShahlKv
 
YASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYashShiva3
 
IRJET - Student Future Prediction System under Filtering Mechanism
IRJET - Student Future Prediction System under Filtering MechanismIRJET - Student Future Prediction System under Filtering Mechanism
IRJET - Student Future Prediction System under Filtering MechanismIRJET Journal
 
Boost Your Data Career with Predictive Analytics! Learn How ?
Boost Your Data Career with Predictive Analytics! Learn How ? Boost Your Data Career with Predictive Analytics! Learn How ?
Boost Your Data Career with Predictive Analytics! Learn How ? Edureka!
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Search Engine Scrapper
Search Engine ScrapperSearch Engine Scrapper
Search Engine ScrapperIRJET Journal
 
Intro to big data and applications - day 2
Intro to big data and applications - day 2Intro to big data and applications - day 2
Intro to big data and applications - day 2Parviz Vakili
 
Who is a data scientist
Who is a data scientist  Who is a data scientist
Who is a data scientist prateek kumar
 
Md._Shumon_Khan_CV_project_management
Md._Shumon_Khan_CV_project_managementMd._Shumon_Khan_CV_project_management
Md._Shumon_Khan_CV_project_managementshumon khan
 
Demystifying Data Science Webinar - February 14, 2018
Demystifying Data Science Webinar - February 14, 2018Demystifying Data Science Webinar - February 14, 2018
Demystifying Data Science Webinar - February 14, 2018Analytics8
 
EXPERIENCE RESUME
EXPERIENCE RESUMEEXPERIENCE RESUME
EXPERIENCE RESUMEAnuj Thakur
 

Semelhante a What is data engineering? (20)

Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Silabus mop 2
Silabus mop 2Silabus mop 2
Silabus mop 2
 
What is data science ?
What is data science ?What is data science ?
What is data science ?
 
YASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptx
 
Challenges of Executing AI
Challenges of Executing AIChallenges of Executing AI
Challenges of Executing AI
 
PRSN NEW RESUME
PRSN NEW RESUMEPRSN NEW RESUME
PRSN NEW RESUME
 
IRJET - Student Future Prediction System under Filtering Mechanism
IRJET - Student Future Prediction System under Filtering MechanismIRJET - Student Future Prediction System under Filtering Mechanism
IRJET - Student Future Prediction System under Filtering Mechanism
 
PRSN NEW RESUME
PRSN NEW RESUMEPRSN NEW RESUME
PRSN NEW RESUME
 
Boost Your Data Career with Predictive Analytics! Learn How ?
Boost Your Data Career with Predictive Analytics! Learn How ? Boost Your Data Career with Predictive Analytics! Learn How ?
Boost Your Data Career with Predictive Analytics! Learn How ?
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Search Engine Scrapper
Search Engine ScrapperSearch Engine Scrapper
Search Engine Scrapper
 
Intro to big data and applications - day 2
Intro to big data and applications - day 2Intro to big data and applications - day 2
Intro to big data and applications - day 2
 
Sample Resume Format
Sample Resume FormatSample Resume Format
Sample Resume Format
 
Who is a data scientist
Who is a data scientist  Who is a data scientist
Who is a data scientist
 
Md._Shumon_Khan_CV_project_management
Md._Shumon_Khan_CV_project_managementMd._Shumon_Khan_CV_project_management
Md._Shumon_Khan_CV_project_management
 
Demystifying Data Science Webinar - February 14, 2018
Demystifying Data Science Webinar - February 14, 2018Demystifying Data Science Webinar - February 14, 2018
Demystifying Data Science Webinar - February 14, 2018
 
EXPERIENCE RESUME
EXPERIENCE RESUMEEXPERIENCE RESUME
EXPERIENCE RESUME
 
Data Driven Economy @CMU
Data Driven Economy @CMUData Driven Economy @CMU
Data Driven Economy @CMU
 

Último

The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxVivek487417
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...Health
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Data Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdfData Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdftheeltifs
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样wsppdmt
 
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
SR-101-01012024-EN.docx  Federal Constitution  of the Swiss ConfederationSR-101-01012024-EN.docx  Federal Constitution  of the Swiss Confederation
SR-101-01012024-EN.docx Federal Constitution of the Swiss ConfederationEfruzAsilolu
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schscnajjemba
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........EfruzAsilolu
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...gajnagarg
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabiaahmedjiabur940
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxchadhar227
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxParas Gupta
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjurptikerjasaptiker
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRajesh Mondal
 

Último (20)

The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 
Data Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdfData Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdf
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
SR-101-01012024-EN.docx  Federal Constitution  of the Swiss ConfederationSR-101-01012024-EN.docx  Federal Constitution  of the Swiss Confederation
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptx
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 

What is data engineering?

  • 1. What is “Data Engineering?” Data Engineering Lab. Kim Yong Dam DataPub 12/3 Data Engineering Lab. in Sogang Univ.
  • 2. <Contents> 1. Introduction 2. What is Data Engineering? 3. Role of Data Engineer 4. What I’m doing..? 5. Future Work DataPub 12/3 Data Engineering Lab. in Sogang Univ.
  • 3. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 1. Introduction
  • 4. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
  • 5. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value
  • 6. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value Price / Analysis
  • 7. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value ???? Price / Analysis
  • 8. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value Optimization Price / Analysis
  • 9. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value Optimization Price / Analysis How?
  • 10. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 2. What is Data Engineering?
  • 11. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Data Engineering https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Tons of to do.. Tons of to do.. Tons of to do.. Tons of to do..
  • 12. DataPub 12/3 Data Engineering Lab. in Sogang Univ. https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Tons of to do.. Tons of to do.. Tons of to do.. Tons of to do.. Build Systems with respect to each data domain Data Engineering
  • 13. DataPub 12/3 Data Engineering Lab. in Sogang Univ. https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Tons of to do.. Tons of to do.. Tons of to do.. Tons of to do.. “On Computer Architecture” Data Engineering
  • 14. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 3. Role of Data Engineer
  • 15. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer https://jobs.apple.com/us/search?job=86260820&openJobId=86260820#&ss=Data%20Engineer&t=0&so=&pN=0&openJobId=99607161
  • 16. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer https://cloud.google.com/certification/data-engineer
  • 17. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer https://cloud.google.com/certification/data-engineer
  • 18. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer https://cloud.google.com/certification/data-engineer
  • 19. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer 1. Designing data processing systems 2. Building and maintaining data structures and databases 3. Analyzing data and enabling machine learning 4. Modeling business processes for analysis and optimization 5. Ensuring reliability 6. Visualizing data and advocating policy 7. Designing for security and compliance
  • 20. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer 1. Designing data processing systems 2. Building and maintaining data structures and databases 3. Analyzing data and enabling machine learning 4. Modeling business processes for analysis and optimization 5. Ensuring reliability 6. Visualizing data and advocating policy 7. Designing for security and compliance “Should focus on something”
  • 21. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 4. What I’m doing?
  • 22. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage 1. Designing data processing systems 2. Building and maintaining data structures and databases 3. Analyzing data and enabling machine learning 4. Modeling business processes for analysis and optimization 5. Ensuring reliability 6. Visualizing data and advocating policy 7. Designing for security and compliance
  • 23. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage 1. Designing data processing systems 2. Building and maintaining data structures and databases 3. Analyzing data and enabling machine learning 4. Modeling business processes for analysis and optimization 5. Ensuring reliability 6. Visualizing data and advocating policy 7. Designing for security and compliance For what? For what? For what?
  • 24. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage http://www.ibmbigdatahub.com/infographic/four-vs-big-data
  • 25. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/
  • 26. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/ Make a implemented connection
  • 27. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/ As a TEAM!
  • 28. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 5. Future Work
  • 29. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Future Work 1. Tree Optimization for Spatial data in Non-Volatile Memory 2. Keyword Clustering for SNS data analysis 3. Clustering technique as unsupervised learning 4. Spatial Web Querying using Spatial Database
  • 30. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Future Work 1. PB+ tree, R-tree for PCM 2. Ontology-based Keyword Clustering, Review on Sematic Document Clustering 3. An efficient K-Means Algorithm integrated with Jaccard Distance Measure for Document Clustering, A New Mallows Distance Based Metric for Comparing Clusterings, Measuring Similarity between Sets of Overlapping Clusters 4. Efficient Processing of Spatial Group Keyword Queries, Keyword Search in Spatial Databases: Toward Searching by Document
  • 31. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Q & A
  • 32. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Thank you