SlideShare a Scribd company logo
1 of 32
Download to read offline
What is “Data Engineering?”
Data Engineering Lab.
Kim Yong Dam
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
<Contents>
1. Introduction
2. What is Data Engineering?
3. Role of Data Engineer
4. What I’m doing..?
5. Future Work
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
1. Introduction
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
Price / Analysis
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
???? Price / Analysis
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
Optimization Price / Analysis
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Introduction
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Features / value
Optimization Price / Analysis
How?
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
2. What is Data Engineering?
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Data Engineering
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Tons of to do..
Tons of to do..
Tons of to do..
Tons of to do..
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Tons of to do..
Tons of to do..
Tons of to do..
Tons of to do..
Build Systems
with respect to
each data domain
Data Engineering
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
Tons of to do..
Tons of to do..
Tons of to do..
Tons of to do..
“On Computer Architecture”
Data Engineering
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
3. Role of Data Engineer
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
https://jobs.apple.com/us/search?job=86260820&openJobId=86260820#&ss=Data%20Engineer&t=0&so=&pN=0&openJobId=99607161
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
https://cloud.google.com/certification/data-engineer
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
https://cloud.google.com/certification/data-engineer
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
https://cloud.google.com/certification/data-engineer
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
1. Designing data processing systems
2. Building and maintaining data structures and databases
3. Analyzing data and enabling machine learning
4. Modeling business processes for analysis and optimization
5. Ensuring reliability
6. Visualizing data and advocating policy
7. Designing for security and compliance
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Role of Data Engineer
1. Designing data processing systems
2. Building and maintaining data structures and databases
3. Analyzing data and enabling machine learning
4. Modeling business processes for analysis and optimization
5. Ensuring reliability
6. Visualizing data and advocating policy
7. Designing for security and compliance
“Should focus on something”
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
4. What I’m doing?
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
1. Designing data processing systems
2. Building and maintaining data structures and databases
3. Analyzing data and enabling machine learning
4. Modeling business processes for analysis and optimization
5. Ensuring reliability
6. Visualizing data and advocating policy
7. Designing for security and compliance
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
1. Designing data processing systems
2. Building and maintaining data structures and databases
3. Analyzing data and enabling machine learning
4. Modeling business processes for analysis and optimization
5. Ensuring reliability
6. Visualizing data and advocating policy
7. Designing for security and compliance
For what?
For what?
For what?
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
http://www.ibmbigdatahub.com/infographic/four-vs-big-data
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/
Make a implemented connection
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
My Voyage
http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/
As a TEAM!
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
5. Future Work
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Future Work
1. Tree Optimization for Spatial data in Non-Volatile Memory
2. Keyword Clustering for SNS data analysis
3. Clustering technique as unsupervised learning
4. Spatial Web Querying using Spatial Database
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Future Work
1. PB+ tree, R-tree for PCM
2. Ontology-based Keyword Clustering, Review on Sematic
Document Clustering
3. An efficient K-Means Algorithm integrated with Jaccard Distance
Measure for Document Clustering, A New Mallows Distance
Based Metric for Comparing Clusterings, Measuring Similarity
between Sets of Overlapping Clusters
4. Efficient Processing of Spatial Group Keyword Queries, Keyword
Search in Spatial Databases: Toward Searching by Document
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Q & A
DataPub 12/3
Data Engineering Lab. in Sogang Univ.
Thank you

More Related Content

What's hot

The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
Databricks
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 

What's hot (20)

Data engineering design patterns
Data engineering design patternsData engineering design patterns
Data engineering design patterns
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture Design
 
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
How to Use a Semantic Layer to Deliver Actionable Insights at ScaleHow to Use a Semantic Layer to Deliver Actionable Insights at Scale
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
 
Demystifying data engineering
Demystifying data engineeringDemystifying data engineering
Demystifying data engineering
 
Learn to Use Databricks for the Full ML Lifecycle
Learn to Use Databricks for the Full ML LifecycleLearn to Use Databricks for the Full ML Lifecycle
Learn to Use Databricks for the Full ML Lifecycle
 
(The life of a) Data engineer
(The life of a) Data engineer(The life of a) Data engineer
(The life of a) Data engineer
 
Modern Data Flow
Modern Data FlowModern Data Flow
Modern Data Flow
 
Azure Data Factory v2
Azure Data Factory v2Azure Data Factory v2
Azure Data Factory v2
 
How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data Mesh
 
Data modeling for the business
Data modeling for the businessData modeling for the business
Data modeling for the business
 
Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse Architecture
 
Databricks Overview for MLOps
Databricks Overview for MLOpsDatabricks Overview for MLOps
Databricks Overview for MLOps
 
Introduction to Azure Databricks
Introduction to Azure DatabricksIntroduction to Azure Databricks
Introduction to Azure Databricks
 
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks Fundamentals
 
Open core summit: Observability for data pipelines with OpenLineage
Open core summit: Observability for data pipelines with OpenLineageOpen core summit: Observability for data pipelines with OpenLineage
Open core summit: Observability for data pipelines with OpenLineage
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks Delta
 
Building Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft AzureBuilding Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft Azure
 

Similar to What is data engineering?

Challenges of Executing AI
Challenges of Executing AIChallenges of Executing AI
Challenges of Executing AI
Dr. Umesh Rao.Hodeghatta
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
DATAVERSITY
 
Md._Shumon_Khan_CV_project_management
Md._Shumon_Khan_CV_project_managementMd._Shumon_Khan_CV_project_management
Md._Shumon_Khan_CV_project_management
shumon khan
 
EXPERIENCE RESUME
EXPERIENCE RESUMEEXPERIENCE RESUME
EXPERIENCE RESUME
Anuj Thakur
 

Similar to What is data engineering? (20)

Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Silabus mop 2
Silabus mop 2Silabus mop 2
Silabus mop 2
 
What is data science ?
What is data science ?What is data science ?
What is data science ?
 
YASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptx
 
Challenges of Executing AI
Challenges of Executing AIChallenges of Executing AI
Challenges of Executing AI
 
PRSN NEW RESUME
PRSN NEW RESUMEPRSN NEW RESUME
PRSN NEW RESUME
 
IRJET - Student Future Prediction System under Filtering Mechanism
IRJET - Student Future Prediction System under Filtering MechanismIRJET - Student Future Prediction System under Filtering Mechanism
IRJET - Student Future Prediction System under Filtering Mechanism
 
PRSN NEW RESUME
PRSN NEW RESUMEPRSN NEW RESUME
PRSN NEW RESUME
 
Boost Your Data Career with Predictive Analytics! Learn How ?
Boost Your Data Career with Predictive Analytics! Learn How ? Boost Your Data Career with Predictive Analytics! Learn How ?
Boost Your Data Career with Predictive Analytics! Learn How ?
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Search Engine Scrapper
Search Engine ScrapperSearch Engine Scrapper
Search Engine Scrapper
 
Intro to big data and applications - day 2
Intro to big data and applications - day 2Intro to big data and applications - day 2
Intro to big data and applications - day 2
 
Sample Resume Format
Sample Resume FormatSample Resume Format
Sample Resume Format
 
Who is a data scientist
Who is a data scientist  Who is a data scientist
Who is a data scientist
 
Md._Shumon_Khan_CV_project_management
Md._Shumon_Khan_CV_project_managementMd._Shumon_Khan_CV_project_management
Md._Shumon_Khan_CV_project_management
 
Demystifying Data Science Webinar - February 14, 2018
Demystifying Data Science Webinar - February 14, 2018Demystifying Data Science Webinar - February 14, 2018
Demystifying Data Science Webinar - February 14, 2018
 
EXPERIENCE RESUME
EXPERIENCE RESUMEEXPERIENCE RESUME
EXPERIENCE RESUME
 
Data Driven Economy @CMU
Data Driven Economy @CMUData Driven Economy @CMU
Data Driven Economy @CMU
 

Recently uploaded

Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
DilipVasan
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
cyebo
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
pyhepag
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
pyhepag
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
RafigAliyev2
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
cyebo
 

Recently uploaded (20)

Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp online
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdf
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)
 
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfGenerative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
 
Machine Learning for Accident Severity Prediction
Machine Learning for Accident Severity PredictionMachine Learning for Accident Severity Prediction
Machine Learning for Accident Severity Prediction
 
basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdf
 

What is data engineering?

  • 1. What is “Data Engineering?” Data Engineering Lab. Kim Yong Dam DataPub 12/3 Data Engineering Lab. in Sogang Univ.
  • 2. <Contents> 1. Introduction 2. What is Data Engineering? 3. Role of Data Engineer 4. What I’m doing..? 5. Future Work DataPub 12/3 Data Engineering Lab. in Sogang Univ.
  • 3. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 1. Introduction
  • 4. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/
  • 5. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value
  • 6. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value Price / Analysis
  • 7. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value ???? Price / Analysis
  • 8. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value Optimization Price / Analysis
  • 9. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Introduction https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Features / value Optimization Price / Analysis How?
  • 10. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 2. What is Data Engineering?
  • 11. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Data Engineering https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Tons of to do.. Tons of to do.. Tons of to do.. Tons of to do..
  • 12. DataPub 12/3 Data Engineering Lab. in Sogang Univ. https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Tons of to do.. Tons of to do.. Tons of to do.. Tons of to do.. Build Systems with respect to each data domain Data Engineering
  • 13. DataPub 12/3 Data Engineering Lab. in Sogang Univ. https://blog.hackerrank.com/the-biggest-misconception-about-data-scientists/ Tons of to do.. Tons of to do.. Tons of to do.. Tons of to do.. “On Computer Architecture” Data Engineering
  • 14. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 3. Role of Data Engineer
  • 15. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer https://jobs.apple.com/us/search?job=86260820&openJobId=86260820#&ss=Data%20Engineer&t=0&so=&pN=0&openJobId=99607161
  • 16. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer https://cloud.google.com/certification/data-engineer
  • 17. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer https://cloud.google.com/certification/data-engineer
  • 18. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer https://cloud.google.com/certification/data-engineer
  • 19. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer 1. Designing data processing systems 2. Building and maintaining data structures and databases 3. Analyzing data and enabling machine learning 4. Modeling business processes for analysis and optimization 5. Ensuring reliability 6. Visualizing data and advocating policy 7. Designing for security and compliance
  • 20. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Role of Data Engineer 1. Designing data processing systems 2. Building and maintaining data structures and databases 3. Analyzing data and enabling machine learning 4. Modeling business processes for analysis and optimization 5. Ensuring reliability 6. Visualizing data and advocating policy 7. Designing for security and compliance “Should focus on something”
  • 21. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 4. What I’m doing?
  • 22. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage 1. Designing data processing systems 2. Building and maintaining data structures and databases 3. Analyzing data and enabling machine learning 4. Modeling business processes for analysis and optimization 5. Ensuring reliability 6. Visualizing data and advocating policy 7. Designing for security and compliance
  • 23. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage 1. Designing data processing systems 2. Building and maintaining data structures and databases 3. Analyzing data and enabling machine learning 4. Modeling business processes for analysis and optimization 5. Ensuring reliability 6. Visualizing data and advocating policy 7. Designing for security and compliance For what? For what? For what?
  • 24. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage http://www.ibmbigdatahub.com/infographic/four-vs-big-data
  • 25. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/
  • 26. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/ Make a implemented connection
  • 27. DataPub 12/3 Data Engineering Lab. in Sogang Univ. My Voyage http://www.jobs.ac.uk/enhanced/industry/lifesciences-london/ As a TEAM!
  • 28. DataPub 12/3 Data Engineering Lab. in Sogang Univ. 5. Future Work
  • 29. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Future Work 1. Tree Optimization for Spatial data in Non-Volatile Memory 2. Keyword Clustering for SNS data analysis 3. Clustering technique as unsupervised learning 4. Spatial Web Querying using Spatial Database
  • 30. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Future Work 1. PB+ tree, R-tree for PCM 2. Ontology-based Keyword Clustering, Review on Sematic Document Clustering 3. An efficient K-Means Algorithm integrated with Jaccard Distance Measure for Document Clustering, A New Mallows Distance Based Metric for Comparing Clusterings, Measuring Similarity between Sets of Overlapping Clusters 4. Efficient Processing of Spatial Group Keyword Queries, Keyword Search in Spatial Databases: Toward Searching by Document
  • 31. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Q & A
  • 32. DataPub 12/3 Data Engineering Lab. in Sogang Univ. Thank you