SlideShare uma empresa Scribd logo
1 de 7
Baixar para ler offline
Ingestion 101

                                            Presenter: Oleg Krook

                                                           September 29-30, 2012
                                                                Boston, MA



Contains Company Confidential Material – Do Not Disclose
Ingestion Pipeline Overview


        Landing Zone provides an entry point for data

        Input data is defined in Ed-Fi format
        Found at http://www.ed-fi.org/technical-documentation/

        Two input methods supported:
        •XML files followed by a control file
        •compressed ZIP file containing above files




 Contains Company Confidential Material – Do Not Disclose
Anatomy of an ingestion job Control files, Ed-Fi



      Control File Format


      The control file will be used solely as to define the set of inbound data files, and to perform
      basic integrity checking on these files. It contains a row of comma-separated values for each
      data file. Leading/trailing spaces are considered part of the values and will not be trimmed. The
      last value in any row must not be followed by a comma.


      The row format is:
      <file format>,<file type>,<file name>,<file checksum>


      , where
      <file format> Specifies the file format.
      At this time, edfi-xml is the only supported file format
      <file type> Represents the type of object(s) found in the file.
      In the case of Ed-Fi XML, the file type maps to the name of the appropriate interchange
      schema.




 Contains Company Confidential Material – Do Not Disclose
Anatomy of an ingestion job Control files, Ed-Fi Cont.



              <file name> Specifies the file's name.
              File names are case sensitive. This field may or may not be enclosed in double quotes.
              File names containing double quotes and/or commas should be enclosed in double-
              quotes.
              A double-quote appearing inside a field must be escaped by preceding it with another
              double quote.


              <file checksum> Is the file's MD5 checksum.
              The MD5 checksum is expressed as 32 hexadecimal digits with alphabetic characters
              always in lowercase.




 Contains Company Confidential Material – Do Not Disclose
Anatomy of an ingestion job Control files, Ed-Fi Cont.



      The control file format allows for specification of job-level parameters. These are
      specified in the control file as line entries preceded with the @ symbol.

      The following table describes the parameters that are currently supported in the control
      file:
      @dry-run
      Indicates that the results of ingestion processing should not be written to the core data store.
      @purge
      Deletes all previously ingested data from this tenant. All other content of the control file is
      ignored.


      A job control file may look as follows:


      @dry-run
      edfi-xml,StudentEnrollment,data.xml,756a5e96e330082424b83902908b070a




 Contains Company Confidential Material – Do Not Disclose
Error/Status Logs



      In the course of ingestion several log files are created and placed in the landing zone.
      These files are used to capture warning and errors at job level (per control file) or at
      resource level (per XML file within job).

      job-<jobId>.log                  Once for every job        INFO <jobId information>
                                                                 INFO [file] <resourceId> (<internalschema>)
                                                                 INFO [file] <resourceId> records considered: <#>
                                                                 INFO [file] <resourceId> records ingested successfully: <#>
                                                                 INFO [file] <resourceId> records failed: <#>
                                                                 INFO [configProperty] <list of config parameters>
                                                                 INFO <All|#> records process successfully
                                                                 INFO Processed <#> records


      job_warn-<jobId>.log             Job-level (non-resource   WARN <warning detail>
                                       specific)
                                       warnings present

      job_error-<jobId>.log            Job-level (non-resource   ERROR <error detail>
                                       specific)
                                       errors present

      warn.<resourceId>-               Resource-level            WARN <warning detail>
      <jobId>.log                      warnings present

      error.<resourceId>-              Resource-level            ERROR <error detail>
      <jobId>.log                      errors present




 Contains Company Confidential Material – Do Not Disclose
Offline Validation Tool



      Offline Validation Tool is an open sourced tool, to provide a
      way to check the format of the ingestion files for Ed-Fi
      format compliance before they get transmitted for ingestion.

      This provide an opportunity to check the file format on the
      spot instead of waiting to transmit and process the file on
      the SLI side.

      This tool only checks for structure, XML compliance, but
      does not check for referential integrity of data.




 Contains Company Confidential Material – Do Not Disclose

Mais conteúdo relacionado

Destaque

Une sécurité basée sur la confiance et non pas sur la peur
Une sécurité basée sur la confiance et non pas sur la peurUne sécurité basée sur la confiance et non pas sur la peur
Une sécurité basée sur la confiance et non pas sur la peurBee_Ware
 
10 features to check out in your subscription management solution
10 features to check out in your subscription management solution10 features to check out in your subscription management solution
10 features to check out in your subscription management solutionTechcello
 
Tehosta kiinteistöjen hallintaa! - Jari Kenttä – Sonera – Kuntamarkkinat 2013
Tehosta kiinteistöjen hallintaa! - Jari Kenttä – Sonera – Kuntamarkkinat 2013Tehosta kiinteistöjen hallintaa! - Jari Kenttä – Sonera – Kuntamarkkinat 2013
Tehosta kiinteistöjen hallintaa! - Jari Kenttä – Sonera – Kuntamarkkinat 2013Sonera
 
Jam 2012 решение: vito цвет
Jam 2012 решение: vito цветJam 2012 решение: vito цвет
Jam 2012 решение: vito цветJOHNNY JOKER
 
USAID Module 1: Understanding Value Chains Presentation
USAID Module 1:  Understanding Value Chains PresentationUSAID Module 1:  Understanding Value Chains Presentation
USAID Module 1: Understanding Value Chains PresentationMarket Development Training
 
Serial total video converter
Serial total video converterSerial total video converter
Serial total video convertersandro_junior
 
Gizarte lana aymar apirilak 12an entregatua
Gizarte lana  aymar apirilak 12an entregatuaGizarte lana  aymar apirilak 12an entregatua
Gizarte lana aymar apirilak 12an entregatuaTeresa Olabarria
 
Cloud, Mobile and Social: Three Industry Trends Of Which Every Lawyer Should ...
Cloud, Mobile and Social: Three Industry Trends Of Which Every Lawyer Should ...Cloud, Mobile and Social: Three Industry Trends Of Which Every Lawyer Should ...
Cloud, Mobile and Social: Three Industry Trends Of Which Every Lawyer Should ...Clio - Cloud-Based Legal Technology
 
Class Notes for Friday, November 26
Class Notes for Friday, November 26Class Notes for Friday, November 26
Class Notes for Friday, November 26Eric Olander
 

Destaque (14)

Une sécurité basée sur la confiance et non pas sur la peur
Une sécurité basée sur la confiance et non pas sur la peurUne sécurité basée sur la confiance et non pas sur la peur
Une sécurité basée sur la confiance et non pas sur la peur
 
10 features to check out in your subscription management solution
10 features to check out in your subscription management solution10 features to check out in your subscription management solution
10 features to check out in your subscription management solution
 
Tehosta kiinteistöjen hallintaa! - Jari Kenttä – Sonera – Kuntamarkkinat 2013
Tehosta kiinteistöjen hallintaa! - Jari Kenttä – Sonera – Kuntamarkkinat 2013Tehosta kiinteistöjen hallintaa! - Jari Kenttä – Sonera – Kuntamarkkinat 2013
Tehosta kiinteistöjen hallintaa! - Jari Kenttä – Sonera – Kuntamarkkinat 2013
 
Winter Survival Kit for Lawyers
Winter Survival Kit for LawyersWinter Survival Kit for Lawyers
Winter Survival Kit for Lawyers
 
La sexualidad
La sexualidadLa sexualidad
La sexualidad
 
Jam 2012 решение: vito цвет
Jam 2012 решение: vito цветJam 2012 решение: vito цвет
Jam 2012 решение: vito цвет
 
P5hflowbravo
P5hflowbravoP5hflowbravo
P5hflowbravo
 
Open Data Apps Contest
Open Data Apps ContestOpen Data Apps Contest
Open Data Apps Contest
 
USAID Module 1: Understanding Value Chains Presentation
USAID Module 1:  Understanding Value Chains PresentationUSAID Module 1:  Understanding Value Chains Presentation
USAID Module 1: Understanding Value Chains Presentation
 
Day 3 Skills Building Training
Day 3 Skills Building Training Day 3 Skills Building Training
Day 3 Skills Building Training
 
Serial total video converter
Serial total video converterSerial total video converter
Serial total video converter
 
Gizarte lana aymar apirilak 12an entregatua
Gizarte lana  aymar apirilak 12an entregatuaGizarte lana  aymar apirilak 12an entregatua
Gizarte lana aymar apirilak 12an entregatua
 
Cloud, Mobile and Social: Three Industry Trends Of Which Every Lawyer Should ...
Cloud, Mobile and Social: Three Industry Trends Of Which Every Lawyer Should ...Cloud, Mobile and Social: Three Industry Trends Of Which Every Lawyer Should ...
Cloud, Mobile and Social: Three Industry Trends Of Which Every Lawyer Should ...
 
Class Notes for Friday, November 26
Class Notes for Friday, November 26Class Notes for Friday, November 26
Class Notes for Friday, November 26
 

Semelhante a Slc ingestion presentation-boston_sep2012

Writing Swift code with great testability
Writing Swift code with great testabilityWriting Swift code with great testability
Writing Swift code with great testabilityJohn Sundell
 
Pf cs102 programming-8 [file handling] (1)
Pf cs102 programming-8 [file handling] (1)Pf cs102 programming-8 [file handling] (1)
Pf cs102 programming-8 [file handling] (1)Abdullah khawar
 
basics of file handling
basics of file handlingbasics of file handling
basics of file handlingpinkpreet_kaur
 
Basics of file handling
Basics of file handlingBasics of file handling
Basics of file handlingpinkpreet_kaur
 
Auditing security of Oracle DB (Karel Miko)
Auditing security of Oracle DB (Karel Miko)Auditing security of Oracle DB (Karel Miko)
Auditing security of Oracle DB (Karel Miko)DCIT, a.s.
 
Secure Programming
Secure ProgrammingSecure Programming
Secure Programmingalpha0
 
Salesforce Admin's guide : the data loader from the command line
Salesforce Admin's guide : the data loader from the command lineSalesforce Admin's guide : the data loader from the command line
Salesforce Admin's guide : the data loader from the command lineCyrille Coeurjoly
 
BGOUG 2012 - Drag & drop and other stuff - Using your database as a file server
BGOUG 2012 - Drag & drop and other stuff - Using your database as a file serverBGOUG 2012 - Drag & drop and other stuff - Using your database as a file server
BGOUG 2012 - Drag & drop and other stuff - Using your database as a file serverMarco Gralike
 
DevOpsDays InSpec Workshop
DevOpsDays InSpec WorkshopDevOpsDays InSpec Workshop
DevOpsDays InSpec WorkshopMandi Walls
 
Designing Tools and Implementing Workflows to Enhance Serials EDI
Designing Tools and Implementing Workflows to Enhance Serials EDIDesigning Tools and Implementing Workflows to Enhance Serials EDI
Designing Tools and Implementing Workflows to Enhance Serials EDIChristian Burris
 
java programming languageThe attached A12.txt file which has 2 col.pdf
java programming languageThe attached A12.txt file which has 2 col.pdfjava programming languageThe attached A12.txt file which has 2 col.pdf
java programming languageThe attached A12.txt file which has 2 col.pdfnitinarora01
 
Chapter 10.1
Chapter 10.1Chapter 10.1
Chapter 10.1sotlsoc
 
pointer, structure ,union and intro to file handling
 pointer, structure ,union and intro to file handling pointer, structure ,union and intro to file handling
pointer, structure ,union and intro to file handlingRai University
 
UKOUG 2011 - Drag, Drop and other Stuff. Using your Database as a File Server
UKOUG 2011 - Drag, Drop and other Stuff. Using your Database as a File ServerUKOUG 2011 - Drag, Drop and other Stuff. Using your Database as a File Server
UKOUG 2011 - Drag, Drop and other Stuff. Using your Database as a File ServerMarco Gralike
 
Firebird Security (in English): The Past and The Future
Firebird Security (in English): The Past and The FutureFirebird Security (in English): The Past and The Future
Firebird Security (in English): The Past and The FutureAlexey Kovyazin
 

Semelhante a Slc ingestion presentation-boston_sep2012 (20)

Writing Swift code with great testability
Writing Swift code with great testabilityWriting Swift code with great testability
Writing Swift code with great testability
 
Pf cs102 programming-8 [file handling] (1)
Pf cs102 programming-8 [file handling] (1)Pf cs102 programming-8 [file handling] (1)
Pf cs102 programming-8 [file handling] (1)
 
basics of file handling
basics of file handlingbasics of file handling
basics of file handling
 
Basics of file handling
Basics of file handlingBasics of file handling
Basics of file handling
 
Data file handling
Data file handlingData file handling
Data file handling
 
Auditing security of Oracle DB (Karel Miko)
Auditing security of Oracle DB (Karel Miko)Auditing security of Oracle DB (Karel Miko)
Auditing security of Oracle DB (Karel Miko)
 
File in cpp 2016
File in cpp 2016 File in cpp 2016
File in cpp 2016
 
Secure Programming
Secure ProgrammingSecure Programming
Secure Programming
 
Salesforce Admin's guide : the data loader from the command line
Salesforce Admin's guide : the data loader from the command lineSalesforce Admin's guide : the data loader from the command line
Salesforce Admin's guide : the data loader from the command line
 
BGOUG 2012 - Drag & drop and other stuff - Using your database as a file server
BGOUG 2012 - Drag & drop and other stuff - Using your database as a file serverBGOUG 2012 - Drag & drop and other stuff - Using your database as a file server
BGOUG 2012 - Drag & drop and other stuff - Using your database as a file server
 
Linux Internals Part - 2
Linux Internals Part - 2Linux Internals Part - 2
Linux Internals Part - 2
 
DevOpsDays InSpec Workshop
DevOpsDays InSpec WorkshopDevOpsDays InSpec Workshop
DevOpsDays InSpec Workshop
 
Designing Tools and Implementing Workflows to Enhance Serials EDI
Designing Tools and Implementing Workflows to Enhance Serials EDIDesigning Tools and Implementing Workflows to Enhance Serials EDI
Designing Tools and Implementing Workflows to Enhance Serials EDI
 
java programming languageThe attached A12.txt file which has 2 col.pdf
java programming languageThe attached A12.txt file which has 2 col.pdfjava programming languageThe attached A12.txt file which has 2 col.pdf
java programming languageThe attached A12.txt file which has 2 col.pdf
 
Chapter 10.1
Chapter 10.1Chapter 10.1
Chapter 10.1
 
SOFTCAMP SHIELDEX AD
SOFTCAMP SHIELDEX AD SOFTCAMP SHIELDEX AD
SOFTCAMP SHIELDEX AD
 
pointer, structure ,union and intro to file handling
 pointer, structure ,union and intro to file handling pointer, structure ,union and intro to file handling
pointer, structure ,union and intro to file handling
 
UKOUG 2011 - Drag, Drop and other Stuff. Using your Database as a File Server
UKOUG 2011 - Drag, Drop and other Stuff. Using your Database as a File ServerUKOUG 2011 - Drag, Drop and other Stuff. Using your Database as a File Server
UKOUG 2011 - Drag, Drop and other Stuff. Using your Database as a File Server
 
File handaling
File handalingFile handaling
File handaling
 
Firebird Security (in English): The Past and The Future
Firebird Security (in English): The Past and The FutureFirebird Security (in English): The Past and The Future
Firebird Security (in English): The Past and The Future
 

Mais de SLC is now inBloom!

Cognitive Maps by Gagan Annamreddy
Cognitive Maps by Gagan AnnamreddyCognitive Maps by Gagan Annamreddy
Cognitive Maps by Gagan AnnamreddySLC is now inBloom!
 
Herding Cats 2.0 by Team Blended Cats
Herding Cats 2.0 by Team Blended CatsHerding Cats 2.0 by Team Blended Cats
Herding Cats 2.0 by Team Blended CatsSLC is now inBloom!
 
Shared Learning Collaborative (SLC) Overview
Shared Learning Collaborative (SLC) OverviewShared Learning Collaborative (SLC) Overview
Shared Learning Collaborative (SLC) OverviewSLC is now inBloom!
 
Concept: Data Driven Professional Development by Team: Double Ds
Concept: Data Driven Professional Development by Team: Double DsConcept: Data Driven Professional Development by Team: Double Ds
Concept: Data Driven Professional Development by Team: Double DsSLC is now inBloom!
 
Concept: Intelli-Seats by The A-Team
Concept: Intelli-Seats by The A-Team Concept: Intelli-Seats by The A-Team
Concept: Intelli-Seats by The A-Team SLC is now inBloom!
 
Concept: Refining RTI: A systems approach to unifying scores by Team: The Ble...
Concept: Refining RTI: A systems approach to unifying scores by Team: The Ble...Concept: Refining RTI: A systems approach to unifying scores by Team: The Ble...
Concept: Refining RTI: A systems approach to unifying scores by Team: The Ble...SLC is now inBloom!
 
Concept: Herding Cats by Team: Cool Cats
Concept: Herding Cats by Team: Cool CatsConcept: Herding Cats by Team: Cool Cats
Concept: Herding Cats by Team: Cool CatsSLC is now inBloom!
 
Concept: Personalizing Mastery by Team: Runnin' Rebels
Concept: Personalizing Mastery by Team: Runnin' RebelsConcept: Personalizing Mastery by Team: Runnin' Rebels
Concept: Personalizing Mastery by Team: Runnin' RebelsSLC is now inBloom!
 
Slc technology101 boston-sep2012
Slc technology101 boston-sep2012Slc technology101 boston-sep2012
Slc technology101 boston-sep2012SLC is now inBloom!
 
Slc dashboard presentation-boston_sep2012
Slc dashboard presentation-boston_sep2012Slc dashboard presentation-boston_sep2012
Slc dashboard presentation-boston_sep2012SLC is now inBloom!
 
Slc camp technology getting started and api deep dive-boston_sep2012
Slc camp technology getting started and api deep dive-boston_sep2012Slc camp technology getting started and api deep dive-boston_sep2012
Slc camp technology getting started and api deep dive-boston_sep2012SLC is now inBloom!
 
SLC tagging content-Chicago_sept2012
SLC tagging content-Chicago_sept2012SLC tagging content-Chicago_sept2012
SLC tagging content-Chicago_sept2012SLC is now inBloom!
 
Slc entity fam diagrams v4 06072012_lrg_map
Slc entity fam diagrams v4 06072012_lrg_mapSlc entity fam diagrams v4 06072012_lrg_map
Slc entity fam diagrams v4 06072012_lrg_mapSLC is now inBloom!
 
Slc data mapping supplement v5 06252012
Slc data mapping supplement v5 06252012Slc data mapping supplement v5 06252012
Slc data mapping supplement v5 06252012SLC is now inBloom!
 

Mais de SLC is now inBloom! (20)

3R radar
3R radar3R radar
3R radar
 
Note e-fi team case nex
Note e-fi team case nexNote e-fi team case nex
Note e-fi team case nex
 
Rapid RtL - Forefront Math
Rapid RtL - Forefront MathRapid RtL - Forefront Math
Rapid RtL - Forefront Math
 
Kidget
KidgetKidget
Kidget
 
Intelliseats 2012
Intelliseats 2012Intelliseats 2012
Intelliseats 2012
 
Cognitive Maps by Gagan Annamreddy
Cognitive Maps by Gagan AnnamreddyCognitive Maps by Gagan Annamreddy
Cognitive Maps by Gagan Annamreddy
 
Herding Cats 2.0 by Team Blended Cats
Herding Cats 2.0 by Team Blended CatsHerding Cats 2.0 by Team Blended Cats
Herding Cats 2.0 by Team Blended Cats
 
Shared Learning Collaborative (SLC) Overview
Shared Learning Collaborative (SLC) OverviewShared Learning Collaborative (SLC) Overview
Shared Learning Collaborative (SLC) Overview
 
SLC Camp Denver Bounty Rules
SLC Camp Denver Bounty RulesSLC Camp Denver Bounty Rules
SLC Camp Denver Bounty Rules
 
Concept: Data Driven Professional Development by Team: Double Ds
Concept: Data Driven Professional Development by Team: Double DsConcept: Data Driven Professional Development by Team: Double Ds
Concept: Data Driven Professional Development by Team: Double Ds
 
Concept: Intelli-Seats by The A-Team
Concept: Intelli-Seats by The A-Team Concept: Intelli-Seats by The A-Team
Concept: Intelli-Seats by The A-Team
 
Concept: Refining RTI: A systems approach to unifying scores by Team: The Ble...
Concept: Refining RTI: A systems approach to unifying scores by Team: The Ble...Concept: Refining RTI: A systems approach to unifying scores by Team: The Ble...
Concept: Refining RTI: A systems approach to unifying scores by Team: The Ble...
 
Concept: Herding Cats by Team: Cool Cats
Concept: Herding Cats by Team: Cool CatsConcept: Herding Cats by Team: Cool Cats
Concept: Herding Cats by Team: Cool Cats
 
Concept: Personalizing Mastery by Team: Runnin' Rebels
Concept: Personalizing Mastery by Team: Runnin' RebelsConcept: Personalizing Mastery by Team: Runnin' Rebels
Concept: Personalizing Mastery by Team: Runnin' Rebels
 
Slc technology101 boston-sep2012
Slc technology101 boston-sep2012Slc technology101 boston-sep2012
Slc technology101 boston-sep2012
 
Slc dashboard presentation-boston_sep2012
Slc dashboard presentation-boston_sep2012Slc dashboard presentation-boston_sep2012
Slc dashboard presentation-boston_sep2012
 
Slc camp technology getting started and api deep dive-boston_sep2012
Slc camp technology getting started and api deep dive-boston_sep2012Slc camp technology getting started and api deep dive-boston_sep2012
Slc camp technology getting started and api deep dive-boston_sep2012
 
SLC tagging content-Chicago_sept2012
SLC tagging content-Chicago_sept2012SLC tagging content-Chicago_sept2012
SLC tagging content-Chicago_sept2012
 
Slc entity fam diagrams v4 06072012_lrg_map
Slc entity fam diagrams v4 06072012_lrg_mapSlc entity fam diagrams v4 06072012_lrg_map
Slc entity fam diagrams v4 06072012_lrg_map
 
Slc data mapping supplement v5 06252012
Slc data mapping supplement v5 06252012Slc data mapping supplement v5 06252012
Slc data mapping supplement v5 06252012
 

Último

Comparative Literature in India by Amiya dev.pptx
Comparative Literature in India by Amiya dev.pptxComparative Literature in India by Amiya dev.pptx
Comparative Literature in India by Amiya dev.pptxAvaniJani1
 
Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Celine George
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
DiskStorage_BasicFileStructuresandHashing.pdf
DiskStorage_BasicFileStructuresandHashing.pdfDiskStorage_BasicFileStructuresandHashing.pdf
DiskStorage_BasicFileStructuresandHashing.pdfChristalin Nelson
 
How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17Celine George
 
4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptxmary850239
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 
DBMSArchitecture_QueryProcessingandOptimization.pdf
DBMSArchitecture_QueryProcessingandOptimization.pdfDBMSArchitecture_QueryProcessingandOptimization.pdf
DBMSArchitecture_QueryProcessingandOptimization.pdfChristalin Nelson
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptx4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptxmary850239
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...Nguyen Thanh Tu Collection
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar
 
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFEPART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFEMISSRITIMABIOLOGYEXP
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
Indexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfIndexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfChristalin Nelson
 
ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6Vanessa Camilleri
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 

Último (20)

Comparative Literature in India by Amiya dev.pptx
Comparative Literature in India by Amiya dev.pptxComparative Literature in India by Amiya dev.pptx
Comparative Literature in India by Amiya dev.pptx
 
Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
DiskStorage_BasicFileStructuresandHashing.pdf
DiskStorage_BasicFileStructuresandHashing.pdfDiskStorage_BasicFileStructuresandHashing.pdf
DiskStorage_BasicFileStructuresandHashing.pdf
 
How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 
DBMSArchitecture_QueryProcessingandOptimization.pdf
DBMSArchitecture_QueryProcessingandOptimization.pdfDBMSArchitecture_QueryProcessingandOptimization.pdf
DBMSArchitecture_QueryProcessingandOptimization.pdf
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptx4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptx
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFEPART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Indexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfIndexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdf
 
ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 

Slc ingestion presentation-boston_sep2012

  • 1. Ingestion 101 Presenter: Oleg Krook September 29-30, 2012 Boston, MA Contains Company Confidential Material – Do Not Disclose
  • 2. Ingestion Pipeline Overview Landing Zone provides an entry point for data Input data is defined in Ed-Fi format Found at http://www.ed-fi.org/technical-documentation/ Two input methods supported: •XML files followed by a control file •compressed ZIP file containing above files Contains Company Confidential Material – Do Not Disclose
  • 3. Anatomy of an ingestion job Control files, Ed-Fi Control File Format The control file will be used solely as to define the set of inbound data files, and to perform basic integrity checking on these files. It contains a row of comma-separated values for each data file. Leading/trailing spaces are considered part of the values and will not be trimmed. The last value in any row must not be followed by a comma. The row format is: <file format>,<file type>,<file name>,<file checksum> , where <file format> Specifies the file format. At this time, edfi-xml is the only supported file format <file type> Represents the type of object(s) found in the file. In the case of Ed-Fi XML, the file type maps to the name of the appropriate interchange schema. Contains Company Confidential Material – Do Not Disclose
  • 4. Anatomy of an ingestion job Control files, Ed-Fi Cont. <file name> Specifies the file's name. File names are case sensitive. This field may or may not be enclosed in double quotes. File names containing double quotes and/or commas should be enclosed in double- quotes. A double-quote appearing inside a field must be escaped by preceding it with another double quote. <file checksum> Is the file's MD5 checksum. The MD5 checksum is expressed as 32 hexadecimal digits with alphabetic characters always in lowercase. Contains Company Confidential Material – Do Not Disclose
  • 5. Anatomy of an ingestion job Control files, Ed-Fi Cont. The control file format allows for specification of job-level parameters. These are specified in the control file as line entries preceded with the @ symbol. The following table describes the parameters that are currently supported in the control file: @dry-run Indicates that the results of ingestion processing should not be written to the core data store. @purge Deletes all previously ingested data from this tenant. All other content of the control file is ignored. A job control file may look as follows: @dry-run edfi-xml,StudentEnrollment,data.xml,756a5e96e330082424b83902908b070a Contains Company Confidential Material – Do Not Disclose
  • 6. Error/Status Logs In the course of ingestion several log files are created and placed in the landing zone. These files are used to capture warning and errors at job level (per control file) or at resource level (per XML file within job). job-<jobId>.log Once for every job INFO <jobId information> INFO [file] <resourceId> (<internalschema>) INFO [file] <resourceId> records considered: <#> INFO [file] <resourceId> records ingested successfully: <#> INFO [file] <resourceId> records failed: <#> INFO [configProperty] <list of config parameters> INFO <All|#> records process successfully INFO Processed <#> records job_warn-<jobId>.log Job-level (non-resource WARN <warning detail> specific) warnings present job_error-<jobId>.log Job-level (non-resource ERROR <error detail> specific) errors present warn.<resourceId>- Resource-level WARN <warning detail> <jobId>.log warnings present error.<resourceId>- Resource-level ERROR <error detail> <jobId>.log errors present Contains Company Confidential Material – Do Not Disclose
  • 7. Offline Validation Tool Offline Validation Tool is an open sourced tool, to provide a way to check the format of the ingestion files for Ed-Fi format compliance before they get transmitted for ingestion. This provide an opportunity to check the file format on the spot instead of waiting to transmit and process the file on the SLI side. This tool only checks for structure, XML compliance, but does not check for referential integrity of data. Contains Company Confidential Material – Do Not Disclose