SlideShare a Scribd company logo
1 of 39
Download to read offline
Open Source Software,
Distributed Systems,
Database as a Cloud Service
第106回オープンソースサロン・総会記念講演
Jul 29, 2016
Satoshi Tagomori (@tagomoris)
Satoshi "Moris" Tagomori
(@tagomoris)
Fluentd, MessagePack-Ruby, Norikra, ...
Treasure Data, Inc.
Topics
• What is Treasure Data?
• Who is tagomoris?
• Treasure Data: Database as a Service
• DB as a Service and Distributed Systems
• Distributed Systems and Open Source Software
• Open Source Software and Developers
http://rubybiz.jp/prize.html
API
Data
M
arts
O
DBC
/ JDBC
Sensor
ERP
CRM
RDBMS
Mobile
Web
Server
3 Complex ETL
4 End User System
2 Time consuming integration
1 Disparate data silos
Without

Treasure Data
Advanced
Analytics
Reporting
BI
API
Data
M
arts
O
DBC
/ JDBC
Sensor
ERP
CRM
RDBMS
Mobile
Web
Server
IoT
Connectors
Data
Connectors
JavaScript
SDK
Serverside
collector
Bulk
Loader
M
obile
SDK
With Treasure Data
3
Easy to Integrate
2 Zero Management
1
Easy to Collect
50+ Data Outputs
Multi-Tenant Cloud Service
300+ Data Sources
Advanced
Analytics
Reporting
BI
50+Integrations
Schema-flexible, Access via SQL,
Unlimited Users, Queries
HQ
Branch
Matsue
Treasure Data, Inc.
• Since Nov 2011
• Headquarters: Mountain View, CA, US
• Japan Branch: Marunouchi, Chiyoda, Tokyo
• Korea Branch: Gangnam, Seoul
• Some remote workers - US, UK, Costa Rica
Developers in TD
• Daily development in each offices
• Communication over Internet
• Slack, JIRA, Confluence & Zoom
• Frontend Team: mainly in US
• Console, Web services, etc
• Backend Team: mainly in JP
• Database, Distributed processing systems, etc
Satoshi "Moris" Tagomori
(@tagomoris)
Born in Matsue, Shimane
Living in Tokyo from 1999
Started to work
as an OSS developer
1. Asahi Net
Internal system developer
2. NTT DATA Intellilink
System consultant
3. livedoor - NHNJ - LINE
Infrastructure engineer
Data analytics platform
engineer
4. Treasure Data
Backend engineer
OSS developer
@tagomoris as
an Open Source Software Developer
• Author
• Norikra, Woothee, xbuild, Shib, Yabitz, Focuslight
• Many fluent-plugin-*
• And many libraries, tools, etc
• Committer, Maintainer
• Fluentd, MessagePack-Ruby, etc
• Contributor
• Docker (logging driver), etc
@tagomoris as
an Open Source Software Developer
• Talks
• Many programming conferences (local, global)
• Many small meetups
• Articles
• WEB+DB Magazine, Software Design
• Many blog posts
• Invented Event: ISUCON
OSS Developers in TD
• MessagePack, Fluentd, Embulk & Digdag founder
• Ruby committer
• Ruby & JRuby committer
• Fluentd & D-language committer
• Hadoop/Spark contributor, pyenv author, ...
Why Are OSS Developers
So Major in TD?
Treasure Data:
Database as a Cloud
Service
API
Data
M
arts
O
DBC
/ JDBC
Sensor
ERP
CRM
RDBMS
Mobile
Web
Server
IoT
Connectors
Data
Connectors
JavaScript
SDK
Serverside
collector
Bulk
Loader
M
obile
SDK
3
Easy to Integrate
2 Zero Management
1
Easy to Collect
50+ Data Outputs
Multi-Tenant Cloud Service
300+ Data Sources
Advanced
Analytics
Reporting
BI
50+Integrations
Database as a Cloud Service
• Collect data
• from remote site - customer side
• Store/Process data
• beyond cloud
• Integrate data
• to remote site - customer side
Two OSS Pattern in TD
• OSS to collect/integrate data from/to remote site
• OSS to store/process data
API
Data
M
arts
O
DBC
/ JDBC
Sensor
ERP
CRM
RDBMS
Mobile
Web
Server
IoT
Connectors
Data
Connectors
JavaScript
SDK
Serverside
collector
Bulk
Loader
M
obile
SDK
3
Easy to Integrate
2 Zero Management
1
Easy to Collect
50+ Data Outputs
Multi-Tenant Cloud Service
300+ Data Sources
Advanced
Analytics
Reporting
BI
50+Integrations
Make Input/Output Easy
• Agent installed in our customers systems
• OSS + Plugin to connect various systems
• No barrier to use TD
1.Make a great OSS product to do it
2.Make it major
3.Potential customer already uses it :)
• very easy to switch to use Treasure Data!
Multi-Tenant Cloud Service
API
Data
M
arts
O
DBC
/ JDBC
Sensor
ERP
CRM
RDBMS
Mobile
Web
Server
IoT
Connectors
Data
Connectors
JavaScript
SDK
Serverside
collector
Bulk
Loader
M
obile
SDK
3
Easy to Integrate
2 Zero Management
1
Easy to Collect
50+ Data Outputs
300+ Data Sources
Advanced
Analytics
Reporting
BI
50+Integrations
Database as a Service
and
Distributed Systems
Many Customers in a System
• Share computer resource
• Provide much more computer resource
• Reduce total cost :-)
Big Data in a System
• Manage big data from many customers
• Manage computing power for many customers
• Create a distributed system!
• for fast query processor
• for resource scheduler
• for high availability
Distributed Systems
and
Open Source Software
Distributed Systems
Distributed System Software
• Major software are all OSS
• Hadoop, Presto, Kafka, Storm, ...
• Concept and Implementation
• MapReduce concept was from Google
• Yahoo! engineers implemented it as Hadoop
• Many others made Hadoop better
• Data is always growing

-> Software MUST be growing too
Deploying Distributed System
• Many things make it hard to fix issues
• Big data, many computers, complex queries, ...
• We MUST fix our issues as soon as possible
• for our customers
• for our operation costs
DO IT YOURSELF! → OSS
Updating Distributed System
• It's very hard to update distributed systems
• many servers, no data lost, no downtime, ...
• Use OSS as-is without dirty fix
• to keep it easy to upgrade "software"
• Contribute your patch to community
• to use patched mainstream software as-is
Open Source Software
and
Developers
DIY Policy Makes "Tech" Company
• Do it yourself "At Your Own Risk": OSS
• Taking risk: more OSS
• OSS: more controllable than proprietary software
• We can read/contribute source code :)
• Technology problem: Can we take a risk? Or not?
Tech Company and Developers
• Taking risk for business success:

more focus on technology
• Quality of OSS depends on each developers
• Who is the committer of that product?
• Who can review quality of that product?
• Tech company needs great developers seriously!
OSS and Developers
• "OSS Committer", not "OSS Committing Company"
• the initiative by developer, not company
• Commit log shows everything about common things
• Who did contribute to that software?
• Who did develop that feature?
• Who did fix that problem?
• People can know who is a good software engineer
• it makes good developers happy!
Developers love OSS Company
• OSS Company: a kind of Tech Companies
• easy to find it: see committers/contributors
• Developers love:
• challenging "technical" tasks/issues to be solved
• great coworkers, like committers of great software
• nice salary brought by taking risk :P
Enjoy Engineering!
MOST IMPORTANT THING:
Thanks!

More Related Content

What's hot

Presto - Hadoop Conference Japan 2014
Presto - Hadoop Conference Japan 2014Presto - Hadoop Conference Japan 2014
Presto - Hadoop Conference Japan 2014
Sadayuki Furuhashi
 

What's hot (20)

Norikra Recent Updates
Norikra Recent UpdatesNorikra Recent Updates
Norikra Recent Updates
 
Presto At Treasure Data
Presto At Treasure DataPresto At Treasure Data
Presto At Treasure Data
 
Fluentd - Flexible, Stable, Scalable
Fluentd - Flexible, Stable, ScalableFluentd - Flexible, Stable, Scalable
Fluentd - Flexible, Stable, Scalable
 
Treasure Data and AWS - Developers.io 2015
Treasure Data and AWS - Developers.io 2015Treasure Data and AWS - Developers.io 2015
Treasure Data and AWS - Developers.io 2015
 
Data Analytics Service Company and Its Ruby Usage
Data Analytics Service Company and Its Ruby UsageData Analytics Service Company and Its Ruby Usage
Data Analytics Service Company and Its Ruby Usage
 
Tale of ISUCON and Its Bench Tools
Tale of ISUCON and Its Bench ToolsTale of ISUCON and Its Bench Tools
Tale of ISUCON and Its Bench Tools
 
Plazma - Treasure Data’s distributed analytical database -
Plazma - Treasure Data’s distributed analytical database -Plazma - Treasure Data’s distributed analytical database -
Plazma - Treasure Data’s distributed analytical database -
 
Presto anatomy
Presto anatomyPresto anatomy
Presto anatomy
 
Bullet: A Real Time Data Query Engine
Bullet: A Real Time Data Query EngineBullet: A Real Time Data Query Engine
Bullet: A Real Time Data Query Engine
 
Technologies for Data Analytics Platform
Technologies for Data Analytics PlatformTechnologies for Data Analytics Platform
Technologies for Data Analytics Platform
 
Ruby and Distributed Storage Systems
Ruby and Distributed Storage SystemsRuby and Distributed Storage Systems
Ruby and Distributed Storage Systems
 
Planet-scale Data Ingestion Pipeline: Bigdam
Planet-scale Data Ingestion Pipeline: BigdamPlanet-scale Data Ingestion Pipeline: Bigdam
Planet-scale Data Ingestion Pipeline: Bigdam
 
Presto - Hadoop Conference Japan 2014
Presto - Hadoop Conference Japan 2014Presto - Hadoop Conference Japan 2014
Presto - Hadoop Conference Japan 2014
 
Lambda Architecture Using SQL
Lambda Architecture Using SQLLambda Architecture Using SQL
Lambda Architecture Using SQL
 
Open Source Logging and Monitoring Tools
Open Source Logging and Monitoring ToolsOpen Source Logging and Monitoring Tools
Open Source Logging and Monitoring Tools
 
fluentd -- the missing log collector
fluentd -- the missing log collectorfluentd -- the missing log collector
fluentd -- the missing log collector
 
"How about no grep and zabbix?". ELK based alerts and metrics.
"How about no grep and zabbix?". ELK based alerts and metrics."How about no grep and zabbix?". ELK based alerts and metrics.
"How about no grep and zabbix?". ELK based alerts and metrics.
 
User Defined Partitioning on PlazmaDB
User Defined Partitioning on PlazmaDBUser Defined Partitioning on PlazmaDB
User Defined Partitioning on PlazmaDB
 
Fluentd and Kafka
Fluentd and KafkaFluentd and Kafka
Fluentd and Kafka
 
Using Morphlines for On-the-Fly ETL
Using Morphlines for On-the-Fly ETLUsing Morphlines for On-the-Fly ETL
Using Morphlines for On-the-Fly ETL
 

Viewers also liked (7)

Fighting API Compatibility On Fluentd Using "Black Magic"
Fighting API Compatibility On Fluentd Using "Black Magic"Fighting API Compatibility On Fluentd Using "Black Magic"
Fighting API Compatibility On Fluentd Using "Black Magic"
 
Modern Black Mages Fighting in the Real World
Modern Black Mages Fighting in the Real WorldModern Black Mages Fighting in the Real World
Modern Black Mages Fighting in the Real World
 
Fluentd Overview, Now and Then
Fluentd Overview, Now and ThenFluentd Overview, Now and Then
Fluentd Overview, Now and Then
 
20160730 fluentd meetup in matsue slide
20160730 fluentd meetup in matsue slide20160730 fluentd meetup in matsue slide
20160730 fluentd meetup in matsue slide
 
The Patterns of Distributed Logging and Containers
The Patterns of Distributed Logging and ContainersThe Patterns of Distributed Logging and Containers
The Patterns of Distributed Logging and Containers
 
AWSにおけるバッチ処理の ベストプラクティス - Developers.IO Meetup 05
AWSにおけるバッチ処理の ベストプラクティス - Developers.IO Meetup 05AWSにおけるバッチ処理の ベストプラクティス - Developers.IO Meetup 05
AWSにおけるバッチ処理の ベストプラクティス - Developers.IO Meetup 05
 
Fluentd v0.14 Plugin API Details
Fluentd v0.14 Plugin API DetailsFluentd v0.14 Plugin API Details
Fluentd v0.14 Plugin API Details
 

Similar to Open Source Software, Distributed Systems, Database as a Cloud Service

Accelerating a Path to Digital With a Cloud Data Strategy
Accelerating a Path to Digital With a Cloud Data StrategyAccelerating a Path to Digital With a Cloud Data Strategy
Accelerating a Path to Digital With a Cloud Data Strategy
MongoDB
 
Polyakov how i will break your enterprise. esb security and more
Polyakov   how i will break your enterprise. esb security and morePolyakov   how i will break your enterprise. esb security and more
Polyakov how i will break your enterprise. esb security and more
DefconRussia
 
L’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazioneL’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazione
MongoDB
 

Similar to Open Source Software, Distributed Systems, Database as a Cloud Service (20)

Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
#Techorama belgium 2018 vincent biret deep dive with the #MicrosoftGraph
#Techorama belgium 2018 vincent biret deep dive with the #MicrosoftGraph#Techorama belgium 2018 vincent biret deep dive with the #MicrosoftGraph
#Techorama belgium 2018 vincent biret deep dive with the #MicrosoftGraph
 
Can Your Mobile Infrastructure Survive 1 Million Concurrent Users?
Can Your Mobile Infrastructure Survive 1 Million Concurrent Users?Can Your Mobile Infrastructure Survive 1 Million Concurrent Users?
Can Your Mobile Infrastructure Survive 1 Million Concurrent Users?
 
Unlocking the Value of Your Data Lake
Unlocking the Value of Your Data LakeUnlocking the Value of Your Data Lake
Unlocking the Value of Your Data Lake
 
Presto @ Zalando - Big Data Tech Warsaw 2020
Presto @ Zalando - Big Data Tech Warsaw 2020Presto @ Zalando - Big Data Tech Warsaw 2020
Presto @ Zalando - Big Data Tech Warsaw 2020
 
The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)
The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)
The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)
 
AWS Dublin Briefing - Logentries Customer Presentation
AWS Dublin Briefing - Logentries Customer PresentationAWS Dublin Briefing - Logentries Customer Presentation
AWS Dublin Briefing - Logentries Customer Presentation
 
2014.10.22 Building Azure Solutions with Office 365
2014.10.22 Building Azure Solutions with Office 3652014.10.22 Building Azure Solutions with Office 365
2014.10.22 Building Azure Solutions with Office 365
 
How leading financial services organisations are winning with tech
How leading financial services organisations are winning with techHow leading financial services organisations are winning with tech
How leading financial services organisations are winning with tech
 
Accelerating a Path to Digital With a Cloud Data Strategy
Accelerating a Path to Digital With a Cloud Data StrategyAccelerating a Path to Digital With a Cloud Data Strategy
Accelerating a Path to Digital With a Cloud Data Strategy
 
Polyakov how i will break your enterprise. esb security and more
Polyakov   how i will break your enterprise. esb security and morePolyakov   how i will break your enterprise. esb security and more
Polyakov how i will break your enterprise. esb security and more
 
MongoDB - General Purpose Database
MongoDB - General Purpose DatabaseMongoDB - General Purpose Database
MongoDB - General Purpose Database
 
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
 
IBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lakeIBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lake
 
10/ EnterpriseDB @ OPEN'16
10/ EnterpriseDB @ OPEN'16 10/ EnterpriseDB @ OPEN'16
10/ EnterpriseDB @ OPEN'16
 
Discover MongoDB Atlas and MongoDB Stitch - DEM02-S - Mexico City AWS Summit
Discover MongoDB Atlas and MongoDB Stitch - DEM02-S - Mexico City AWS SummitDiscover MongoDB Atlas and MongoDB Stitch - DEM02-S - Mexico City AWS Summit
Discover MongoDB Atlas and MongoDB Stitch - DEM02-S - Mexico City AWS Summit
 
L’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazioneL’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazione
 
PostgreSQL as a Strategic Tool
PostgreSQL as a Strategic ToolPostgreSQL as a Strategic Tool
PostgreSQL as a Strategic Tool
 
Leapfrog into Serverless - a Deloitte-Amtrak Case Study | Serverless Confere...
Leapfrog into Serverless - a Deloitte-Amtrak Case Study | Serverless Confere...Leapfrog into Serverless - a Deloitte-Amtrak Case Study | Serverless Confere...
Leapfrog into Serverless - a Deloitte-Amtrak Case Study | Serverless Confere...
 

More from SATOSHI TAGOMORI

More from SATOSHI TAGOMORI (13)

Ractor's speed is not light-speed
Ractor's speed is not light-speedRactor's speed is not light-speed
Ractor's speed is not light-speed
 
Good Things and Hard Things of SaaS Development/Operations
Good Things and Hard Things of SaaS Development/OperationsGood Things and Hard Things of SaaS Development/Operations
Good Things and Hard Things of SaaS Development/Operations
 
Maccro Strikes Back
Maccro Strikes BackMaccro Strikes Back
Maccro Strikes Back
 
Invitation to the dark side of Ruby
Invitation to the dark side of RubyInvitation to the dark side of Ruby
Invitation to the dark side of Ruby
 
Hijacking Ruby Syntax in Ruby (RubyConf 2018)
Hijacking Ruby Syntax in Ruby (RubyConf 2018)Hijacking Ruby Syntax in Ruby (RubyConf 2018)
Hijacking Ruby Syntax in Ruby (RubyConf 2018)
 
Make Your Ruby Script Confusing
Make Your Ruby Script ConfusingMake Your Ruby Script Confusing
Make Your Ruby Script Confusing
 
Hijacking Ruby Syntax in Ruby
Hijacking Ruby Syntax in RubyHijacking Ruby Syntax in Ruby
Hijacking Ruby Syntax in Ruby
 
Lock, Concurrency and Throughput of Exclusive Operations
Lock, Concurrency and Throughput of Exclusive OperationsLock, Concurrency and Throughput of Exclusive Operations
Lock, Concurrency and Throughput of Exclusive Operations
 
Data Processing and Ruby in the World
Data Processing and Ruby in the WorldData Processing and Ruby in the World
Data Processing and Ruby in the World
 
Fluentd 101
Fluentd 101Fluentd 101
Fluentd 101
 
Hive dirty/beautiful hacks in TD
Hive dirty/beautiful hacks in TDHive dirty/beautiful hacks in TD
Hive dirty/beautiful hacks in TD
 
Data-Driven Development Era and Its Technologies
Data-Driven Development Era and Its TechnologiesData-Driven Development Era and Its Technologies
Data-Driven Development Era and Its Technologies
 
Engineer as a Leading Role
Engineer as a Leading RoleEngineer as a Leading Role
Engineer as a Leading Role
 

Recently uploaded

TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 

Recently uploaded (20)

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 

Open Source Software, Distributed Systems, Database as a Cloud Service

  • 1. Open Source Software, Distributed Systems, Database as a Cloud Service 第106回オープンソースサロン・総会記念講演 Jul 29, 2016 Satoshi Tagomori (@tagomoris)
  • 2. Satoshi "Moris" Tagomori (@tagomoris) Fluentd, MessagePack-Ruby, Norikra, ... Treasure Data, Inc.
  • 3. Topics • What is Treasure Data? • Who is tagomoris? • Treasure Data: Database as a Service • DB as a Service and Distributed Systems • Distributed Systems and Open Source Software • Open Source Software and Developers
  • 4.
  • 6.
  • 7. API Data M arts O DBC / JDBC Sensor ERP CRM RDBMS Mobile Web Server 3 Complex ETL 4 End User System 2 Time consuming integration 1 Disparate data silos Without
 Treasure Data Advanced Analytics Reporting BI
  • 8. API Data M arts O DBC / JDBC Sensor ERP CRM RDBMS Mobile Web Server IoT Connectors Data Connectors JavaScript SDK Serverside collector Bulk Loader M obile SDK With Treasure Data 3 Easy to Integrate 2 Zero Management 1 Easy to Collect 50+ Data Outputs Multi-Tenant Cloud Service 300+ Data Sources Advanced Analytics Reporting BI 50+Integrations Schema-flexible, Access via SQL, Unlimited Users, Queries
  • 10.
  • 11. Treasure Data, Inc. • Since Nov 2011 • Headquarters: Mountain View, CA, US • Japan Branch: Marunouchi, Chiyoda, Tokyo • Korea Branch: Gangnam, Seoul • Some remote workers - US, UK, Costa Rica
  • 12. Developers in TD • Daily development in each offices • Communication over Internet • Slack, JIRA, Confluence & Zoom • Frontend Team: mainly in US • Console, Web services, etc • Backend Team: mainly in JP • Database, Distributed processing systems, etc
  • 13. Satoshi "Moris" Tagomori (@tagomoris) Born in Matsue, Shimane Living in Tokyo from 1999
  • 14. Started to work as an OSS developer 1. Asahi Net Internal system developer 2. NTT DATA Intellilink System consultant 3. livedoor - NHNJ - LINE Infrastructure engineer Data analytics platform engineer 4. Treasure Data Backend engineer OSS developer
  • 15. @tagomoris as an Open Source Software Developer • Author • Norikra, Woothee, xbuild, Shib, Yabitz, Focuslight • Many fluent-plugin-* • And many libraries, tools, etc • Committer, Maintainer • Fluentd, MessagePack-Ruby, etc • Contributor • Docker (logging driver), etc
  • 16. @tagomoris as an Open Source Software Developer • Talks • Many programming conferences (local, global) • Many small meetups • Articles • WEB+DB Magazine, Software Design • Many blog posts • Invented Event: ISUCON
  • 17. OSS Developers in TD • MessagePack, Fluentd, Embulk & Digdag founder • Ruby committer • Ruby & JRuby committer • Fluentd & D-language committer • Hadoop/Spark contributor, pyenv author, ...
  • 18. Why Are OSS Developers So Major in TD?
  • 19. Treasure Data: Database as a Cloud Service
  • 20. API Data M arts O DBC / JDBC Sensor ERP CRM RDBMS Mobile Web Server IoT Connectors Data Connectors JavaScript SDK Serverside collector Bulk Loader M obile SDK 3 Easy to Integrate 2 Zero Management 1 Easy to Collect 50+ Data Outputs Multi-Tenant Cloud Service 300+ Data Sources Advanced Analytics Reporting BI 50+Integrations
  • 21. Database as a Cloud Service • Collect data • from remote site - customer side • Store/Process data • beyond cloud • Integrate data • to remote site - customer side
  • 22. Two OSS Pattern in TD • OSS to collect/integrate data from/to remote site • OSS to store/process data
  • 23. API Data M arts O DBC / JDBC Sensor ERP CRM RDBMS Mobile Web Server IoT Connectors Data Connectors JavaScript SDK Serverside collector Bulk Loader M obile SDK 3 Easy to Integrate 2 Zero Management 1 Easy to Collect 50+ Data Outputs Multi-Tenant Cloud Service 300+ Data Sources Advanced Analytics Reporting BI 50+Integrations
  • 24. Make Input/Output Easy • Agent installed in our customers systems • OSS + Plugin to connect various systems • No barrier to use TD 1.Make a great OSS product to do it 2.Make it major 3.Potential customer already uses it :) • very easy to switch to use Treasure Data!
  • 25. Multi-Tenant Cloud Service API Data M arts O DBC / JDBC Sensor ERP CRM RDBMS Mobile Web Server IoT Connectors Data Connectors JavaScript SDK Serverside collector Bulk Loader M obile SDK 3 Easy to Integrate 2 Zero Management 1 Easy to Collect 50+ Data Outputs 300+ Data Sources Advanced Analytics Reporting BI 50+Integrations
  • 26. Database as a Service and Distributed Systems
  • 27. Many Customers in a System • Share computer resource • Provide much more computer resource • Reduce total cost :-)
  • 28. Big Data in a System • Manage big data from many customers • Manage computing power for many customers • Create a distributed system! • for fast query processor • for resource scheduler • for high availability
  • 31. Distributed System Software • Major software are all OSS • Hadoop, Presto, Kafka, Storm, ... • Concept and Implementation • MapReduce concept was from Google • Yahoo! engineers implemented it as Hadoop • Many others made Hadoop better • Data is always growing
 -> Software MUST be growing too
  • 32. Deploying Distributed System • Many things make it hard to fix issues • Big data, many computers, complex queries, ... • We MUST fix our issues as soon as possible • for our customers • for our operation costs DO IT YOURSELF! → OSS
  • 33. Updating Distributed System • It's very hard to update distributed systems • many servers, no data lost, no downtime, ... • Use OSS as-is without dirty fix • to keep it easy to upgrade "software" • Contribute your patch to community • to use patched mainstream software as-is
  • 35. DIY Policy Makes "Tech" Company • Do it yourself "At Your Own Risk": OSS • Taking risk: more OSS • OSS: more controllable than proprietary software • We can read/contribute source code :) • Technology problem: Can we take a risk? Or not?
  • 36. Tech Company and Developers • Taking risk for business success:
 more focus on technology • Quality of OSS depends on each developers • Who is the committer of that product? • Who can review quality of that product? • Tech company needs great developers seriously!
  • 37. OSS and Developers • "OSS Committer", not "OSS Committing Company" • the initiative by developer, not company • Commit log shows everything about common things • Who did contribute to that software? • Who did develop that feature? • Who did fix that problem? • People can know who is a good software engineer • it makes good developers happy!
  • 38. Developers love OSS Company • OSS Company: a kind of Tech Companies • easy to find it: see committers/contributors • Developers love: • challenging "technical" tasks/issues to be solved • great coworkers, like committers of great software • nice salary brought by taking risk :P