SlideShare uma empresa Scribd logo
1 de 2
Baixar para ler offline
Alexander (‘Sasha’) Schwarzman, AGU Extreme Markup Languages 2006, Montréal, Canada Page 1 of 2
(sschwarzman@agu.org) August 7 – 11, 2006
Developing an STM DTD/Schema:
Strategic Design Choices
Alexander (‘Sasha’) Schwarzman, AGU (sschwarzman@agu.org)
Extreme Markup Languages 2006, Montréal, Canada
August 7 – 11, 2006
Requirements
 Does an agreed upon Requirements document exist? (Get one!)
 What is your XML’s role?
 Archival copy-of-record (preserving scientific content)?
 Means of producing a pretty PDF?
 Both?
 Much more?
Architecture
 When during production is XML created? How is accuracy checked at each stage?
 Dummy empty elements for not-yet-assigned metadata plus use of configurable
production-stage-specific Business Rules Checker / Validator / QC Tool?
 Multiple DTDs: a separate one for each production stage?
 XML “layering”: What “layer” to use for enforcing editorial style and business
rules?
 DTD / parser?
 Validator / Schematron?
 Human editors?
 Revisable unit (what is the elemental unit?)
 Article?
 Issue?
 Arbitrary / cross-journal article collection?
 Volume / year?
 Journal?
 More than one of these?
Scope
 For what material?
 Current?
 Future-only?
 Legacy?
 All of the above or some combination?
 What is the extent of an article / book?
 Does it include supplementary material, like datasets and computable spreadsheets?
 Do you model “extra stuff” as just another structured section or is it something different?
 Special links (“related links”) section?
Developing an STM DTD / Schema: Strategic Design Choices (cont’d)
Alexander (‘Sasha’) Schwarzman, AGU Extreme Markup Languages 2006, Montréal, Canada Page 2 of 2
(sschwarzman@agu.org) August 7 – 11, 2006
Modeling Language Choices
 Which constraint language is primary?
 DTD?
 XSD?
 RELAX NG?
 How many DTDs / schemas (purpose of each)?
 Authoring?
 Conversion / Transformation?
 Production?
 Archiving?
 Separate or shared: If your content includes journal article, newspaper article, book
chapter, book, case study, lecture notes, etc., should you use:
 Distinct DTD / schema for each?
 A large shared structure?
 A DTD / schema suite with common modules?
 “Off-the-shelf, Altered-to-fit, or Bespoke?” (T. Usdin)
 If altered, what public model?
 “compatible with” or “informed by” (subset or superset)?
 If bespoke, do you use any public models at all (for tables and math, for instance)?
Modeling Design Choices
 “Prussian” or “Californian”: prescriptive or descriptive? Flexible or enforcing?
 Generated or Explicit text? (depends on XML’s role)
 Preserve generation / rendition rules?
 Different approach for text and bibliographic references?
 How to model bibliographic references?
 Mixed content?
 Genre-specific “strict models” (with an escape hatch provided)?
 “Tag abuse” tolerance?
 How to reference non-XML components, e.g., figures, in XML?
 By an ID that maps to a set of multiple images in an archive?
 By naming a specific file from the set? Which one is “the mother of all images”?
 Which components to store / migrate? Is “storing cheaper than thinking”? (D. Lapeyre)
 How to model math?
 MathML presentation versus content (computation)?
 How to ensure the identicalness of the same math symbols in different browsers (same UNICODE
codepoints look differently in various browsers, e.g., epsilon and varepsilon)?
 LaTeX plus GIFs?
 How to ensure the identicalness of special characters that occur both in a displayed formula and
inline?
 Just GIFs?
 “Just because you can, doesn’t mean you should” (D. Lapeyre)
 The lure of modeling for its own sake. Simplicity maintains better over time

Mais conteúdo relacionado

Semelhante a Extreme-ML-2006-Poster-A-Schwarzman

DITA,Single-source, Multi-channel Publishing
DITA,Single-source, Multi-channel PublishingDITA,Single-source, Multi-channel Publishing
DITA,Single-source, Multi-channel PublishingBruce Conway
 
Keep Calm and Specialize your Content Model
Keep Calm and Specialize your Content ModelKeep Calm and Specialize your Content Model
Keep Calm and Specialize your Content Modelctnitchie
 
advDBMS_XML.pptx
advDBMS_XML.pptxadvDBMS_XML.pptx
advDBMS_XML.pptxIreneGetzi
 
DATA INTEGRATION (Gaining Access to Diverse Data).ppt
DATA INTEGRATION (Gaining Access to Diverse Data).pptDATA INTEGRATION (Gaining Access to Diverse Data).ppt
DATA INTEGRATION (Gaining Access to Diverse Data).pptcareerPointBasti
 
2011-Balisage-Poster-Schwarzman
2011-Balisage-Poster-Schwarzman2011-Balisage-Poster-Schwarzman
2011-Balisage-Poster-Schwarzmanaschwarzman
 
How publishing works in the digital era
How publishing works in the digital eraHow publishing works in the digital era
How publishing works in the digital eraApex CoVantage
 
ravenbenweb xml and its application .PPT
ravenbenweb xml and its application .PPTravenbenweb xml and its application .PPT
ravenbenweb xml and its application .PPTubaidullah75790
 
JATSPack and JATSPAN, a packaging format specification and a web site
JATSPack and JATSPAN, a packaging format specification and a web siteJATSPack and JATSPAN, a packaging format specification and a web site
JATSPack and JATSPAN, a packaging format specification and a web siteKlortho
 
Essential Tools Of An Xml Workflow2003comp
Essential Tools Of An Xml Workflow2003compEssential Tools Of An Xml Workflow2003comp
Essential Tools Of An Xml Workflow2003compljnd
 
Making your data work harder than you do
Making your data work harder than you doMaking your data work harder than you do
Making your data work harder than you doSusan Jane Williams
 
Evaluation of Research Tools
Evaluation of Research ToolsEvaluation of Research Tools
Evaluation of Research ToolsHATS
 
2009 PLANETS Vienna - MIXED migration to XML
2009 PLANETS Vienna - MIXED migration to XML2009 PLANETS Vienna - MIXED migration to XML
2009 PLANETS Vienna - MIXED migration to XMLDirk Roorda
 

Semelhante a Extreme-ML-2006-Poster-A-Schwarzman (20)

DITA,Single-source, Multi-channel Publishing
DITA,Single-source, Multi-channel PublishingDITA,Single-source, Multi-channel Publishing
DITA,Single-source, Multi-channel Publishing
 
Keep Calm and Specialize your Content Model
Keep Calm and Specialize your Content ModelKeep Calm and Specialize your Content Model
Keep Calm and Specialize your Content Model
 
Metadata
MetadataMetadata
Metadata
 
advDBMS_XML.pptx
advDBMS_XML.pptxadvDBMS_XML.pptx
advDBMS_XML.pptx
 
DATA INTEGRATION (Gaining Access to Diverse Data).ppt
DATA INTEGRATION (Gaining Access to Diverse Data).pptDATA INTEGRATION (Gaining Access to Diverse Data).ppt
DATA INTEGRATION (Gaining Access to Diverse Data).ppt
 
2011-Balisage-Poster-Schwarzman
2011-Balisage-Poster-Schwarzman2011-Balisage-Poster-Schwarzman
2011-Balisage-Poster-Schwarzman
 
How publishing works in the digital era
How publishing works in the digital eraHow publishing works in the digital era
How publishing works in the digital era
 
ravenbenweb xml and its application .PPT
ravenbenweb xml and its application .PPTravenbenweb xml and its application .PPT
ravenbenweb xml and its application .PPT
 
"If I knew then what I know now"
"If I knew then what I know now""If I knew then what I know now"
"If I knew then what I know now"
 
JATSPack and JATSPAN, a packaging format specification and a web site
JATSPack and JATSPAN, a packaging format specification and a web siteJATSPack and JATSPAN, a packaging format specification and a web site
JATSPack and JATSPAN, a packaging format specification and a web site
 
Essential Tools Of An Xml Workflow2003comp
Essential Tools Of An Xml Workflow2003compEssential Tools Of An Xml Workflow2003comp
Essential Tools Of An Xml Workflow2003comp
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
 
Making your data work harder than you do
Making your data work harder than you doMaking your data work harder than you do
Making your data work harder than you do
 
Evaluation of Research Tools
Evaluation of Research ToolsEvaluation of Research Tools
Evaluation of Research Tools
 
2009 PLANETS Vienna - MIXED migration to XML
2009 PLANETS Vienna - MIXED migration to XML2009 PLANETS Vienna - MIXED migration to XML
2009 PLANETS Vienna - MIXED migration to XML
 
XML schemas
XML schemasXML schemas
XML schemas
 
XML-talk
XML-talkXML-talk
XML-talk
 
Fossetcon15
Fossetcon15Fossetcon15
Fossetcon15
 
Scale2016
Scale2016Scale2016
Scale2016
 
XPATH_XSLT-1.pptx
XPATH_XSLT-1.pptxXPATH_XSLT-1.pptx
XPATH_XSLT-1.pptx
 

Mais de aschwarzman

2012-08-14-OSA-Pubs-IT_Presentation
2012-08-14-OSA-Pubs-IT_Presentation2012-08-14-OSA-Pubs-IT_Presentation
2012-08-14-OSA-Pubs-IT_Presentationaschwarzman
 
2012-05-20-CSE-2012_Schwarzman
2012-05-20-CSE-2012_Schwarzman2012-05-20-CSE-2012_Schwarzman
2012-05-20-CSE-2012_Schwarzmanaschwarzman
 
2012-03-20-AGU-Librarians_Presentation
2012-03-20-AGU-Librarians_Presentation2012-03-20-AGU-Librarians_Presentation
2012-03-20-AGU-Librarians_Presentationaschwarzman
 
2011-11-14-CrossRef-Workshops_Schwarzman
2011-11-14-CrossRef-Workshops_Schwarzman2011-11-14-CrossRef-Workshops_Schwarzman
2011-11-14-CrossRef-Workshops_Schwarzmanaschwarzman
 
2011-09-27-JATS-Con-Presentation_Schwarzman
2011-09-27-JATS-Con-Presentation_Schwarzman2011-09-27-JATS-Con-Presentation_Schwarzman
2011-09-27-JATS-Con-Presentation_Schwarzmanaschwarzman
 
Schwarzman-CSE2011
Schwarzman-CSE2011Schwarzman-CSE2011
Schwarzman-CSE2011aschwarzman
 
Schwarzman-JATS-Con-slides
Schwarzman-JATS-Con-slidesSchwarzman-JATS-Con-slides
Schwarzman-JATS-Con-slidesaschwarzman
 
XML2004-schwarzman
XML2004-schwarzmanXML2004-schwarzman
XML2004-schwarzmanaschwarzman
 
JATS-Con-Schwarzman-slides_corr-2016-04-29
JATS-Con-Schwarzman-slides_corr-2016-04-29JATS-Con-Schwarzman-slides_corr-2016-04-29
JATS-Con-Schwarzman-slides_corr-2016-04-29aschwarzman
 
Balisage_2011-08-03_Schwarzman
Balisage_2011-08-03_SchwarzmanBalisage_2011-08-03_Schwarzman
Balisage_2011-08-03_Schwarzmanaschwarzman
 
Balisage-2015-funding-poster
Balisage-2015-funding-posterBalisage-2015-funding-poster
Balisage-2015-funding-posteraschwarzman
 
Balisage-2015-sup-mat-poster
Balisage-2015-sup-mat-posterBalisage-2015-sup-mat-poster
Balisage-2015-sup-mat-posteraschwarzman
 
Using Schematron for appropriate layer validation: A case study
Using Schematron for appropriate layer validation: A case studyUsing Schematron for appropriate layer validation: A case study
Using Schematron for appropriate layer validation: A case studyaschwarzman
 
NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...
NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...
NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...aschwarzman
 
NISO-NFAIS Supplemental Journal Article Materials Working Group
NISO-NFAIS Supplemental Journal Article Materials Working GroupNISO-NFAIS Supplemental Journal Article Materials Working Group
NISO-NFAIS Supplemental Journal Article Materials Working Groupaschwarzman
 

Mais de aschwarzman (17)

dineen2013
dineen2013dineen2013
dineen2013
 
2012-08-14-OSA-Pubs-IT_Presentation
2012-08-14-OSA-Pubs-IT_Presentation2012-08-14-OSA-Pubs-IT_Presentation
2012-08-14-OSA-Pubs-IT_Presentation
 
2012-05-20-CSE-2012_Schwarzman
2012-05-20-CSE-2012_Schwarzman2012-05-20-CSE-2012_Schwarzman
2012-05-20-CSE-2012_Schwarzman
 
2012-03-20-AGU-Librarians_Presentation
2012-03-20-AGU-Librarians_Presentation2012-03-20-AGU-Librarians_Presentation
2012-03-20-AGU-Librarians_Presentation
 
2011-11-14-CrossRef-Workshops_Schwarzman
2011-11-14-CrossRef-Workshops_Schwarzman2011-11-14-CrossRef-Workshops_Schwarzman
2011-11-14-CrossRef-Workshops_Schwarzman
 
2011-09-27-JATS-Con-Presentation_Schwarzman
2011-09-27-JATS-Con-Presentation_Schwarzman2011-09-27-JATS-Con-Presentation_Schwarzman
2011-09-27-JATS-Con-Presentation_Schwarzman
 
Schwarzman-CSE2011
Schwarzman-CSE2011Schwarzman-CSE2011
Schwarzman-CSE2011
 
Schwarzman-JATS-Con-slides
Schwarzman-JATS-Con-slidesSchwarzman-JATS-Con-slides
Schwarzman-JATS-Con-slides
 
XML2004
XML2004XML2004
XML2004
 
XML2004-schwarzman
XML2004-schwarzmanXML2004-schwarzman
XML2004-schwarzman
 
JATS-Con-Schwarzman-slides_corr-2016-04-29
JATS-Con-Schwarzman-slides_corr-2016-04-29JATS-Con-Schwarzman-slides_corr-2016-04-29
JATS-Con-Schwarzman-slides_corr-2016-04-29
 
Balisage_2011-08-03_Schwarzman
Balisage_2011-08-03_SchwarzmanBalisage_2011-08-03_Schwarzman
Balisage_2011-08-03_Schwarzman
 
Balisage-2015-funding-poster
Balisage-2015-funding-posterBalisage-2015-funding-poster
Balisage-2015-funding-poster
 
Balisage-2015-sup-mat-poster
Balisage-2015-sup-mat-posterBalisage-2015-sup-mat-poster
Balisage-2015-sup-mat-poster
 
Using Schematron for appropriate layer validation: A case study
Using Schematron for appropriate layer validation: A case studyUsing Schematron for appropriate layer validation: A case study
Using Schematron for appropriate layer validation: A case study
 
NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...
NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...
NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...
 
NISO-NFAIS Supplemental Journal Article Materials Working Group
NISO-NFAIS Supplemental Journal Article Materials Working GroupNISO-NFAIS Supplemental Journal Article Materials Working Group
NISO-NFAIS Supplemental Journal Article Materials Working Group
 

Extreme-ML-2006-Poster-A-Schwarzman

  • 1. Alexander (‘Sasha’) Schwarzman, AGU Extreme Markup Languages 2006, Montréal, Canada Page 1 of 2 (sschwarzman@agu.org) August 7 – 11, 2006 Developing an STM DTD/Schema: Strategic Design Choices Alexander (‘Sasha’) Schwarzman, AGU (sschwarzman@agu.org) Extreme Markup Languages 2006, Montréal, Canada August 7 – 11, 2006 Requirements  Does an agreed upon Requirements document exist? (Get one!)  What is your XML’s role?  Archival copy-of-record (preserving scientific content)?  Means of producing a pretty PDF?  Both?  Much more? Architecture  When during production is XML created? How is accuracy checked at each stage?  Dummy empty elements for not-yet-assigned metadata plus use of configurable production-stage-specific Business Rules Checker / Validator / QC Tool?  Multiple DTDs: a separate one for each production stage?  XML “layering”: What “layer” to use for enforcing editorial style and business rules?  DTD / parser?  Validator / Schematron?  Human editors?  Revisable unit (what is the elemental unit?)  Article?  Issue?  Arbitrary / cross-journal article collection?  Volume / year?  Journal?  More than one of these? Scope  For what material?  Current?  Future-only?  Legacy?  All of the above or some combination?  What is the extent of an article / book?  Does it include supplementary material, like datasets and computable spreadsheets?  Do you model “extra stuff” as just another structured section or is it something different?  Special links (“related links”) section?
  • 2. Developing an STM DTD / Schema: Strategic Design Choices (cont’d) Alexander (‘Sasha’) Schwarzman, AGU Extreme Markup Languages 2006, Montréal, Canada Page 2 of 2 (sschwarzman@agu.org) August 7 – 11, 2006 Modeling Language Choices  Which constraint language is primary?  DTD?  XSD?  RELAX NG?  How many DTDs / schemas (purpose of each)?  Authoring?  Conversion / Transformation?  Production?  Archiving?  Separate or shared: If your content includes journal article, newspaper article, book chapter, book, case study, lecture notes, etc., should you use:  Distinct DTD / schema for each?  A large shared structure?  A DTD / schema suite with common modules?  “Off-the-shelf, Altered-to-fit, or Bespoke?” (T. Usdin)  If altered, what public model?  “compatible with” or “informed by” (subset or superset)?  If bespoke, do you use any public models at all (for tables and math, for instance)? Modeling Design Choices  “Prussian” or “Californian”: prescriptive or descriptive? Flexible or enforcing?  Generated or Explicit text? (depends on XML’s role)  Preserve generation / rendition rules?  Different approach for text and bibliographic references?  How to model bibliographic references?  Mixed content?  Genre-specific “strict models” (with an escape hatch provided)?  “Tag abuse” tolerance?  How to reference non-XML components, e.g., figures, in XML?  By an ID that maps to a set of multiple images in an archive?  By naming a specific file from the set? Which one is “the mother of all images”?  Which components to store / migrate? Is “storing cheaper than thinking”? (D. Lapeyre)  How to model math?  MathML presentation versus content (computation)?  How to ensure the identicalness of the same math symbols in different browsers (same UNICODE codepoints look differently in various browsers, e.g., epsilon and varepsilon)?  LaTeX plus GIFs?  How to ensure the identicalness of special characters that occur both in a displayed formula and inline?  Just GIFs?  “Just because you can, doesn’t mean you should” (D. Lapeyre)  The lure of modeling for its own sake. Simplicity maintains better over time