O slideshow foi denunciado.
Utilizamos seu perfil e dados de atividades no LinkedIn para personalizar e exibir anúncios mais relevantes. Altere suas preferências de anúncios quando desejar.
TALEND OPEN STUDIO
                          OR, HOW I LEARNED TO RELAX AND ENJOY ETL




Thursday, March 5, 2009
TIM BERGLUND



Thursday, March 5, 2009
Thursday, March 5, 2009
Thursday, March 5, 2009
Thursday, March 5, 2009
DOSUG




Thursday, March 5, 2009
Thursday, March 5, 2009
DBA

Thursday, March 5, 2009
Thursday, March 5, 2009
Thursday, March 5, 2009
ACTUAL DATA

                             DISPERSED
                           IDIOSYNCRATIC
                             ...
DATA THE BUSINESS
                        LIKES
                             CENTRALIZED
                             CONS...
OUR SOFTWARE




Thursday, March 5, 2009
OUR REFERENCE




Thursday, March 5, 2009
ABOUT


                          AN OPEN-SOURCE STARTUP



Thursday, March 5, 2009
ABOUT


                           BASED IN FRANCE



Thursday, March 5, 2009
ABOUT


                          ENVISIONS LOW-COST ETL
                           (EXTRACT, TRANSFORM, AND LOAD)




Thu...
ABOUT


                          FREE AND SUBSCRIPTION
                               PRODUCTS




Thursday, March 5, 2009
ABOUT


                            ECLIPSE-BASED



Thursday, March 5, 2009
ABOUT


                 THEREFORE JAVA-BASED, BUT HAS
                     STRANGE PERL OPTION




Thursday, March 5, 2009
ABOUT


                          HAS A FRENCH ACCENT



Thursday, March 5, 2009
ABOUT


                          http://www.talend.com



Thursday, March 5, 2009
BASIC COMPONENTS



Thursday, March 5, 2009
Business Modeler


                            Data Processing
                                “Jobs”


                  ...
BUSINESS MODELER




Thursday, March 5, 2009
JOB DESIGNER

                             VISUAL WORKSPACE
                          WHERE DEVELOPMENT TAKES
            ...
JOB DESIGNER




Thursday, March 5, 2009
METADATA REPOSITORY

                          DATABASE CONNECTIONS
                               INPUT FILES
           ...
DEMO, PLX



Thursday, March 5, 2009
AND NOW,
                          SOME THEORY


Thursday, March 5, 2009
THE SYSTEM OF
                            RECORD


Thursday, March 5, 2009
THE SYSTEM OF
                            RECORD


                           TRANSACTIONAL



Thursday, March 5, 2009
THE SYSTEM OF
                            RECORD

                          HEAVILY NORMALIZED
                           ...
THE SYSTEM OF
                            RECORD


                          PRODUCTION SYSTEM



Thursday, March 5, 2009
THE DATA
                          WAREHOUSE


Thursday, March 5, 2009
THE DATA
                          WAREHOUSE


                            DERIVED



Thursday, March 5, 2009
THE DATA
                          WAREHOUSE


                          NONTRANSACTIONAL



Thursday, March 5, 2009
THE DATA
                          WAREHOUSE


                           DENORMALIZED



Thursday, March 5, 2009
THE DATA
                            WAREHOUSE


                          IMPORTANT, BUT OFFLINE



Thursday, March 5, 20...
HOW TO DO IT



Thursday, March 5, 2009
Thursday, March 5, 2009
Thursday, March 5, 2009
EXTRACT, TRANSFORM,
                       AND LOAD



                          EXTRACTING



Thursday, March 5, 2009
EXTRACT, TRANSFORM,
                       AND LOAD



                          CLEANING



Thursday, March 5, 2009
EXTRACT, TRANSFORM,
                       AND LOAD



                          CONFORMING



Thursday, March 5, 2009
EXTRACT, TRANSFORM,
                       AND LOAD



                          THE SNOWFLAKE SCHEMA



Thursday, March 5...
THE SNOWFLAKE
                             SCHEMA

                  ONE FACT REFERENCING MANY
                         DI...
THE FACT TABLE
                                  order_fact
                                    user_id (FK)
             ...
THE DIMENSION TABLE
                          user_dimension
                                  id (PK)
                   ...
ion

                          PUT TOGETHER
ey)




                                                       user_dimension
...
LET’S SEE ANOTHER
                          DEMO!


Thursday, March 5, 2009
NOW WHAT?

                                REPORTING
                               OLAP TOOLS
                           ...
A CRITIQUE


                          DOES VISUAL PROGRAMMING
                               REALLY WORK?




Thursday, M...
A CRITIQUE


                          WHY NOT JUST USE GROOVY?



Thursday, March 5, 2009
COMMERCIAL OPTIONS



                          THERE ARE MANY



Thursday, March 5, 2009
COMMERCIAL OPTIONS


                          THEY ALL PROBABLY INVOLVE
                                    GOLF




Thur...
COMMERCIAL OPTIONS


                  THIS IS DOSUG, SO WE’LL MOVE
                              ON




Thursday, March 5...
ACKNOWLEDGEMENTS


                             THANKS TO
                     www.intellidata.net FOR THE
               ...
THANK YOU!

                          TIM BERGLUND
                          AUGUST TECHNOLOGY GROUP, LLC
                ...
PHOTO CREDITS

                            OIL DRUMS: HTTP://WWW.FLICKR.COM/PHOTOS/THE_JUSTIFIED_SINNER/2720599186/
      ...
Próximos SlideShares
Carregando em…5
×

Talend Open Studio: How I Learned To Relax And Enjoy ETL

A presentation given at the Denver Open Source Users Group on Tuesday, March 3, 2009.

  • Entre para ver os comentários

Talend Open Studio: How I Learned To Relax And Enjoy ETL

  1. 1. TALEND OPEN STUDIO OR, HOW I LEARNED TO RELAX AND ENJOY ETL Thursday, March 5, 2009
  2. 2. TIM BERGLUND Thursday, March 5, 2009
  3. 3. Thursday, March 5, 2009
  4. 4. Thursday, March 5, 2009
  5. 5. Thursday, March 5, 2009
  6. 6. DOSUG Thursday, March 5, 2009
  7. 7. Thursday, March 5, 2009
  8. 8. DBA Thursday, March 5, 2009
  9. 9. Thursday, March 5, 2009
  10. 10. Thursday, March 5, 2009
  11. 11. ACTUAL DATA DISPERSED IDIOSYNCRATIC MESSY Thursday, March 5, 2009
  12. 12. DATA THE BUSINESS LIKES CENTRALIZED CONSISTENT ANSWERS QUESTIONS Thursday, March 5, 2009
  13. 13. OUR SOFTWARE Thursday, March 5, 2009
  14. 14. OUR REFERENCE Thursday, March 5, 2009
  15. 15. ABOUT AN OPEN-SOURCE STARTUP Thursday, March 5, 2009
  16. 16. ABOUT BASED IN FRANCE Thursday, March 5, 2009
  17. 17. ABOUT ENVISIONS LOW-COST ETL (EXTRACT, TRANSFORM, AND LOAD) Thursday, March 5, 2009
  18. 18. ABOUT FREE AND SUBSCRIPTION PRODUCTS Thursday, March 5, 2009
  19. 19. ABOUT ECLIPSE-BASED Thursday, March 5, 2009
  20. 20. ABOUT THEREFORE JAVA-BASED, BUT HAS STRANGE PERL OPTION Thursday, March 5, 2009
  21. 21. ABOUT HAS A FRENCH ACCENT Thursday, March 5, 2009
  22. 22. ABOUT http://www.talend.com Thursday, March 5, 2009
  23. 23. BASIC COMPONENTS Thursday, March 5, 2009
  24. 24. Business Modeler Data Processing “Jobs” Custom Java and Groovy Scripts Metadata Repository Thursday, March 5, 2009
  25. 25. BUSINESS MODELER Thursday, March 5, 2009
  26. 26. JOB DESIGNER VISUAL WORKSPACE WHERE DEVELOPMENT TAKES PLACE Thursday, March 5, 2009
  27. 27. JOB DESIGNER Thursday, March 5, 2009
  28. 28. METADATA REPOSITORY DATABASE CONNECTIONS INPUT FILES OUTPUT FILES SCHEMAS WEB SERVICES Thursday, March 5, 2009
  29. 29. DEMO, PLX Thursday, March 5, 2009
  30. 30. AND NOW, SOME THEORY Thursday, March 5, 2009
  31. 31. THE SYSTEM OF RECORD Thursday, March 5, 2009
  32. 32. THE SYSTEM OF RECORD TRANSACTIONAL Thursday, March 5, 2009
  33. 33. THE SYSTEM OF RECORD HEAVILY NORMALIZED (IDEALLY) Thursday, March 5, 2009
  34. 34. THE SYSTEM OF RECORD PRODUCTION SYSTEM Thursday, March 5, 2009
  35. 35. THE DATA WAREHOUSE Thursday, March 5, 2009
  36. 36. THE DATA WAREHOUSE DERIVED Thursday, March 5, 2009
  37. 37. THE DATA WAREHOUSE NONTRANSACTIONAL Thursday, March 5, 2009
  38. 38. THE DATA WAREHOUSE DENORMALIZED Thursday, March 5, 2009
  39. 39. THE DATA WAREHOUSE IMPORTANT, BUT OFFLINE Thursday, March 5, 2009
  40. 40. HOW TO DO IT Thursday, March 5, 2009
  41. 41. Thursday, March 5, 2009
  42. 42. Thursday, March 5, 2009
  43. 43. EXTRACT, TRANSFORM, AND LOAD EXTRACTING Thursday, March 5, 2009
  44. 44. EXTRACT, TRANSFORM, AND LOAD CLEANING Thursday, March 5, 2009
  45. 45. EXTRACT, TRANSFORM, AND LOAD CONFORMING Thursday, March 5, 2009
  46. 46. EXTRACT, TRANSFORM, AND LOAD THE SNOWFLAKE SCHEMA Thursday, March 5, 2009
  47. 47. THE SNOWFLAKE SCHEMA ONE FACT REFERENCING MANY DIMENSIONS Thursday, March 5, 2009
  48. 48. THE FACT TABLE order_fact user_id (FK) shipping_location_id (FK) billing_location_id (FK) payment_method_id (FK) line_item_group_id (FK) order_timestamp (fact) total (fact) subtotal (fact) shipping_cost (fact) Thursday, March 5, 2009
  49. 49. THE DIMENSION TABLE user_dimension id (PK) user_id (business key) username first_name last_name company show_only_same_mfg show_nonzero_inventory mailing_list_opt_in Thursday, March 5, 2009
  50. 50. ion PUT TOGETHER ey) user_dimension order_fact id (PK) user_id (FK) internal_user_id (business key) mfg shipping_location_id (FK) username tory billing_location_id (FK) first_name in payment_method_id (FK) last_name line_item_group_id (FK) sion company order_timestamp (fact) city total (fact) key) state subtotal (fact) mailing_list_opt_in shipping_cost (fact) mfg ntory in Thursday, March 5, 2009
  51. 51. LET’S SEE ANOTHER DEMO! Thursday, March 5, 2009
  52. 52. NOW WHAT? REPORTING OLAP TOOLS BUSINESS DASHBOARDS DATA LIFE CYCLE OPTIONS Thursday, March 5, 2009
  53. 53. A CRITIQUE DOES VISUAL PROGRAMMING REALLY WORK? Thursday, March 5, 2009
  54. 54. A CRITIQUE WHY NOT JUST USE GROOVY? Thursday, March 5, 2009
  55. 55. COMMERCIAL OPTIONS THERE ARE MANY Thursday, March 5, 2009
  56. 56. COMMERCIAL OPTIONS THEY ALL PROBABLY INVOLVE GOLF Thursday, March 5, 2009
  57. 57. COMMERCIAL OPTIONS THIS IS DOSUG, SO WE’LL MOVE ON Thursday, March 5, 2009
  58. 58. ACKNOWLEDGEMENTS THANKS TO www.intellidata.net FOR THE TEST DATA! Thursday, March 5, 2009
  59. 59. THANK YOU! TIM BERGLUND AUGUST TECHNOLOGY GROUP, LLC http://www.augusttechgroup.com tim.berglund@augusttechgroup.com @tlberglund Thursday, March 5, 2009
  60. 60. PHOTO CREDITS OIL DRUMS: HTTP://WWW.FLICKR.COM/PHOTOS/THE_JUSTIFIED_SINNER/2720599186/ JUNGLE: HTTP://WWW.FLICKR.COM/PHOTOS/LOLLYKNIT/1155225799/ FRENCH GARDEN: HTTP://WWW.FLICKR.COM/PHOTOS/NOMAD-PHOTOGRAPHY/23295537/ SNOWFLAKE: HTTP://WWW.FLICKR.COM/PHOTOS/JOHNCHARLTON/360919818/ Thursday, March 5, 2009

×