SlideShare uma empresa Scribd logo
1 de 21
Baixar para ler offline
• An abstract data set used to describe information contained in an
  (well-formed) XML document
• Provide a consistent set of definitions for use in other specifications
  that need to refer to the information in a well-formed XML document


• Not exhaustive; Include only those that are expected to be useful in
  future specifications
• Not minimum set of information that must be returned by an XML
  processor
• Analogous to tree
 Each XML document has an information set if it is well-formed
  and satisfies some namespace constraints
  • Not require to be valid
  • May be created by methods other than parsing an XML document
 XML document’s infoset
  • Consists of a a number of information items
  • At least a document information item and several others

 Information item
  • An abstraction description of some part of an XML document
  • Has a set of acossiated named properties

 Have 11 types of information items
 Information set is same as a tree
 Information item is same as a node of tree
 Have 11 types of information items
  1.   Document                          7. Comment
  2.   Element                           8. The Document Type Declaration
  3.   Attribute                         9. Unparsed Entity
  4.   Processing Instruction            10. Notation
  5.   Unexpanded Entity Reference       11. Namespace
  6.   Character

 Each information item has properties
  • Property named ‘xyz’ is indicated by [xyz]
 There is exactly one document information item in the infoset of
  an XML document
 All other information items are accessible from the properties of
  the document information item, either directly or indirectly
  through the properties of other information items
 Has properties
  • [children]           • [unparsed entities]           • [standalone]
  • [document element]   • [baseURI]                     • [version]
  • [notations]          • [character encoding scheme]   • [all declarations processed]
 There is an element information item for each element
  appearing in the XML document
  • One of the element information items is the value of the [document element]
    property of the document information item, corresponding to the root of the element
    tree, and

  • All other element information items are accessible by recursively following its
    [children] property

 Has properties
  • [namespace name]       • [children]                      • [in-scope namespaces]
  • [local name]           • [attributes]                    • [base URI]
  • [prefix]               • [namespace attributes]          • [parent]
 There is an attribute information item for each attribute
  (specified or defaulted) of each element in the document
  • including those which are namespace declarations

  • The latter however appear as members of an element's [namespace attributes]
    property rather than its [attributes] property

 Has properties
  • [namespace name]         • [normailized value]     • [references]
  • [local name]             • [specified]             • [owner element]
  • [prefix]                 • [attribute type]
 There is a processing instruction information item for each
  processing instruction in the document

 The XML declaration and text declarations for external parsed
  entities are not considered processing instructions

 Has properties
       • [target]     • [notation]
       • [content]    • [parent]
       • [base URI]
 A unexpanded entity reference information item serves as a
  placeholder by which an XML processor can indicate that it has
  not expanded an external parsed entity
 A validating XML processor, or a non-validating processor that
  reads all external general entities, will never generate
  unexpanded entity reference information items for a valid
  document.
 Has properties
       • [name]                • [declaration base URI]
       • [system identifier]   • [parent]
       • [public identifier]
 There is a character information item for each data character
  that appears in the document, whether literally, as a character
  reference, or within a CDATA section

 Each character is a logically separate information item, but XML
  applications are free to chunk characters into larger groups as
  necessary or desirable

 Has properties
       • [character code]               • [parent]
       • [element content whitespace]
 There is a comment information item for each XML comment
  in the original document, except for those appearing in the DTD
  (which are not represented)

 Has properties
       • [content]          • [parent]
 If the XML document has a document type declaration, then the
  information set contains a single document type declaration
  information item
 Note that entities and notations are provided as properties of
  the document information item, not the document type
  declaration information item
 Has properties
       • [system identifier]   • [children]
       • [public identifier]   • [parent]
 There is an unparsed entity information item for each
  unparsed general entity declared in the DTD

 Has properties
    • [name]                • [declaration base URI]
    • [system identifier]   • [notation name]
    • [public identifier]   • [notation]
 There is a notation information item for each notation
  declared in the DTD

 Has properties
       • [name]                • [public identifier]
       • [system identifier]   • [declaration base URI]
 Each element in the document has a namespace information
  item for each namespace that is in scope for that element

 Has properties
       • [prefix]        • [namespace name]
 Information Sets are extensible

 New recommendations can associate properties with info items
  by adding properties

 For example, XML Schema adds properties to the infoset to
  record the results of validation
  • Post-Schema -Validation Infoset (PSVI)

 Proprietary software can add their own properties too
1. The content models of elements, from ELEMENT declarations in the DTD.
2. The grouping and ordering of attribute declarations in ATTLIST declarations.
3. The order of attributes within a start-tag.
4. The document type name.
5. White space outside the document element.
6. White space immediately following the target name of a PI.
7. Whether characters are represented by character references.
8. White space within start-tags (other than significant white space in attribute
   values) and end-tags.
9. The difference between the two forms of an empty element: <foo/> and
   <foo></foo>.
10. The difference between CR, CR-LF, and LF line termination.
11. The order of declarations within the DTD.
12. The boundaries of conditional sections in the DTD.
13. The boundaries of parameter entities in the DTD.
14. The boundaries of general parsed entities.
15. The boundaries of CDATA marked sections.
16. Comments in the DTD.
17. The location of declarations (whether in internal or external subset or
    parameter entities).
18. Any ignored declarations, including those within an IGNORE conditional
    section, as well as entity and attribute declarations ignored because
    previous declarations override them.
19. The kind of quotation marks (single or double) used to quote attribute values.
20. The default value of attributes declared in the DTD.
2. Used in other specifications that need to refer to the
   information in a well-formed XML document
1. XML Information Set (Second Edition)

Mais conteúdo relacionado

Mais procurados

object oriented programing lecture 1
object oriented programing lecture 1object oriented programing lecture 1
object oriented programing lecture 1Geophery sanga
 
hidden surface elimination using z buffer algorithm
hidden surface elimination using z buffer algorithmhidden surface elimination using z buffer algorithm
hidden surface elimination using z buffer algorithmrajivagarwal23dei
 
Computer graphics(parametric cubic curves)
Computer graphics(parametric cubic curves)Computer graphics(parametric cubic curves)
Computer graphics(parametric cubic curves)Sinthia Sarker
 
Proxy design pattern (Class Ambassador)
Proxy design pattern (Class Ambassador)Proxy design pattern (Class Ambassador)
Proxy design pattern (Class Ambassador)Sameer Rathoud
 
DDA-line-drawing-algorithm.pptx
DDA-line-drawing-algorithm.pptxDDA-line-drawing-algorithm.pptx
DDA-line-drawing-algorithm.pptxSujataSonawane11
 
Issues in knowledge representation
Issues in knowledge representationIssues in knowledge representation
Issues in knowledge representationSravanthi Emani
 
Image colorization
Image colorizationImage colorization
Image colorizationPankti Fadia
 
OOPS with C++ | Concepts of OOPS | Introduction
OOPS with C++ | Concepts of OOPS | IntroductionOOPS with C++ | Concepts of OOPS | Introduction
OOPS with C++ | Concepts of OOPS | IntroductionADITYATANDONKECCSE
 
computer graphics
computer graphicscomputer graphics
computer graphicsMegabi Mamo
 
Graphics software standards
Graphics software standardsGraphics software standards
Graphics software standardsAnkit Garg
 

Mais procurados (20)

Basic image processing techniques
Basic image processing techniquesBasic image processing techniques
Basic image processing techniques
 
object oriented programing lecture 1
object oriented programing lecture 1object oriented programing lecture 1
object oriented programing lecture 1
 
Core java complete notes - Contact at +91-814-614-5674
Core java complete notes - Contact at +91-814-614-5674Core java complete notes - Contact at +91-814-614-5674
Core java complete notes - Contact at +91-814-614-5674
 
DDA algorithm
DDA algorithmDDA algorithm
DDA algorithm
 
hidden surface elimination using z buffer algorithm
hidden surface elimination using z buffer algorithmhidden surface elimination using z buffer algorithm
hidden surface elimination using z buffer algorithm
 
DART
DARTDART
DART
 
Computer graphics(parametric cubic curves)
Computer graphics(parametric cubic curves)Computer graphics(parametric cubic curves)
Computer graphics(parametric cubic curves)
 
Proxy design pattern (Class Ambassador)
Proxy design pattern (Class Ambassador)Proxy design pattern (Class Ambassador)
Proxy design pattern (Class Ambassador)
 
DDA-line-drawing-algorithm.pptx
DDA-line-drawing-algorithm.pptxDDA-line-drawing-algorithm.pptx
DDA-line-drawing-algorithm.pptx
 
Issues in knowledge representation
Issues in knowledge representationIssues in knowledge representation
Issues in knowledge representation
 
Proxy pattern
Proxy patternProxy pattern
Proxy pattern
 
Image colorization
Image colorizationImage colorization
Image colorization
 
PSO.ppt
PSO.pptPSO.ppt
PSO.ppt
 
Object representations
Object representationsObject representations
Object representations
 
Oops concepts
Oops conceptsOops concepts
Oops concepts
 
OOPS with C++ | Concepts of OOPS | Introduction
OOPS with C++ | Concepts of OOPS | IntroductionOOPS with C++ | Concepts of OOPS | Introduction
OOPS with C++ | Concepts of OOPS | Introduction
 
computer graphics
computer graphicscomputer graphics
computer graphics
 
Graphics software standards
Graphics software standardsGraphics software standards
Graphics software standards
 
Unit 1
Unit 1Unit 1
Unit 1
 
Proxy design pattern
Proxy design patternProxy design pattern
Proxy design pattern
 

Semelhante a XML Information set (20)

Web Technology Part 4
Web Technology Part 4Web Technology Part 4
Web Technology Part 4
 
Xml and webdata
Xml and webdataXml and webdata
Xml and webdata
 
Xml and webdata
Xml and webdataXml and webdata
Xml and webdata
 
Xml and webdata
Xml and webdataXml and webdata
Xml and webdata
 
Xml and webdata
Xml and webdataXml and webdata
Xml and webdata
 
Xml and webdata
Xml and webdataXml and webdata
Xml and webdata
 
Xml and webdata
Xml and webdataXml and webdata
Xml and webdata
 
Xml and webdata
Xml and webdataXml and webdata
Xml and webdata
 
Xml basics
Xml basicsXml basics
Xml basics
 
Ch2 neworder
Ch2 neworderCh2 neworder
Ch2 neworder
 
1 xml fundamentals
1 xml fundamentals1 xml fundamentals
1 xml fundamentals
 
M.FLORENCE DAYANA WEB DESIGN -Unit 5 XML
M.FLORENCE DAYANA WEB DESIGN -Unit 5   XMLM.FLORENCE DAYANA WEB DESIGN -Unit 5   XML
M.FLORENCE DAYANA WEB DESIGN -Unit 5 XML
 
01 xml document structure
01 xml document structure01 xml document structure
01 xml document structure
 
XML
XMLXML
XML
 
XML, DTD & XSD Overview
XML, DTD & XSD OverviewXML, DTD & XSD Overview
XML, DTD & XSD Overview
 
Introduction to XML
Introduction to XMLIntroduction to XML
Introduction to XML
 
chapter 4 web authoring unit 4 xml.pptx
chapter 4 web authoring  unit 4 xml.pptxchapter 4 web authoring  unit 4 xml.pptx
chapter 4 web authoring unit 4 xml.pptx
 
Systems Analyst and Design - Data Dictionary
Systems Analyst and Design -  Data DictionarySystems Analyst and Design -  Data Dictionary
Systems Analyst and Design - Data Dictionary
 
XML DTD DOCUMENT TYPE DEFINITION
XML DTD DOCUMENT TYPE DEFINITIONXML DTD DOCUMENT TYPE DEFINITION
XML DTD DOCUMENT TYPE DEFINITION
 
Xml
XmlXml
Xml
 

Mais de Hoang Nguyen

GANs and Applications
GANs and ApplicationsGANs and Applications
GANs and ApplicationsHoang Nguyen
 
Scrum - An introduction
Scrum - An introductionScrum - An introduction
Scrum - An introductionHoang Nguyen
 
Introduction to Cross-platform App Development
Introduction to Cross-platform App DevelopmentIntroduction to Cross-platform App Development
Introduction to Cross-platform App DevelopmentHoang Nguyen
 
Conistency of random forests
Conistency of random forestsConistency of random forests
Conistency of random forestsHoang Nguyen
 
Trust - Digital Signature
Trust - Digital SignatureTrust - Digital Signature
Trust - Digital SignatureHoang Nguyen
 
SOME SECURITY CHALLENGES IN CLOUD COMPUTING
SOME SECURITY CHALLENGES  IN CLOUD COMPUTINGSOME SECURITY CHALLENGES  IN CLOUD COMPUTING
SOME SECURITY CHALLENGES IN CLOUD COMPUTINGHoang Nguyen
 
Information, Data and Decision Making
Information, Data and Decision MakingInformation, Data and Decision Making
Information, Data and Decision MakingHoang Nguyen
 
Multiple processor systems
Multiple processor systemsMultiple processor systems
Multiple processor systemsHoang Nguyen
 
Multiprocessor Systems
Multiprocessor SystemsMultiprocessor Systems
Multiprocessor SystemsHoang Nguyen
 
Introduction to AOS course
Introduction to AOS courseIntroduction to AOS course
Introduction to AOS courseHoang Nguyen
 
Background Knowledge
Background KnowledgeBackground Knowledge
Background KnowledgeHoang Nguyen
 
Introduction to Information Security Course
Introduction to Information Security CourseIntroduction to Information Security Course
Introduction to Information Security CourseHoang Nguyen
 
Introduction to CNS Course
Introduction to CNS CourseIntroduction to CNS Course
Introduction to CNS CourseHoang Nguyen
 

Mais de Hoang Nguyen (20)

GANs and Applications
GANs and ApplicationsGANs and Applications
GANs and Applications
 
Scrum - An introduction
Scrum - An introductionScrum - An introduction
Scrum - An introduction
 
ORM in Django
ORM in DjangoORM in Django
ORM in Django
 
Introduction to Cross-platform App Development
Introduction to Cross-platform App DevelopmentIntroduction to Cross-platform App Development
Introduction to Cross-platform App Development
 
Conistency of random forests
Conistency of random forestsConistency of random forests
Conistency of random forests
 
Trust - Digital Signature
Trust - Digital SignatureTrust - Digital Signature
Trust - Digital Signature
 
Key Exchange
Key ExchangeKey Exchange
Key Exchange
 
SOME SECURITY CHALLENGES IN CLOUD COMPUTING
SOME SECURITY CHALLENGES  IN CLOUD COMPUTINGSOME SECURITY CHALLENGES  IN CLOUD COMPUTING
SOME SECURITY CHALLENGES IN CLOUD COMPUTING
 
Stream ciphers
Stream ciphersStream ciphers
Stream ciphers
 
Classical ciphers
Classical ciphersClassical ciphers
Classical ciphers
 
Confidentiality
ConfidentialityConfidentiality
Confidentiality
 
Information, Data and Decision Making
Information, Data and Decision MakingInformation, Data and Decision Making
Information, Data and Decision Making
 
Multiple processor systems
Multiple processor systemsMultiple processor systems
Multiple processor systems
 
Multiprocessor Systems
Multiprocessor SystemsMultiprocessor Systems
Multiprocessor Systems
 
Introduction to AOS course
Introduction to AOS courseIntroduction to AOS course
Introduction to AOS course
 
Background Knowledge
Background KnowledgeBackground Knowledge
Background Knowledge
 
Introduction to Information Security Course
Introduction to Information Security CourseIntroduction to Information Security Course
Introduction to Information Security Course
 
Introduction to CNS Course
Introduction to CNS CourseIntroduction to CNS Course
Introduction to CNS Course
 
Dynamic Testing
Dynamic TestingDynamic Testing
Dynamic Testing
 
Nosql intro
Nosql introNosql intro
Nosql intro
 

Último

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 

Último (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

XML Information set

  • 1.
  • 2. • An abstract data set used to describe information contained in an (well-formed) XML document • Provide a consistent set of definitions for use in other specifications that need to refer to the information in a well-formed XML document • Not exhaustive; Include only those that are expected to be useful in future specifications • Not minimum set of information that must be returned by an XML processor • Analogous to tree
  • 3.  Each XML document has an information set if it is well-formed and satisfies some namespace constraints • Not require to be valid • May be created by methods other than parsing an XML document
  • 4.  XML document’s infoset • Consists of a a number of information items • At least a document information item and several others  Information item • An abstraction description of some part of an XML document • Has a set of acossiated named properties  Have 11 types of information items  Information set is same as a tree  Information item is same as a node of tree
  • 5.  Have 11 types of information items 1. Document 7. Comment 2. Element 8. The Document Type Declaration 3. Attribute 9. Unparsed Entity 4. Processing Instruction 10. Notation 5. Unexpanded Entity Reference 11. Namespace 6. Character  Each information item has properties • Property named ‘xyz’ is indicated by [xyz]
  • 6.  There is exactly one document information item in the infoset of an XML document  All other information items are accessible from the properties of the document information item, either directly or indirectly through the properties of other information items  Has properties • [children] • [unparsed entities] • [standalone] • [document element] • [baseURI] • [version] • [notations] • [character encoding scheme] • [all declarations processed]
  • 7.  There is an element information item for each element appearing in the XML document • One of the element information items is the value of the [document element] property of the document information item, corresponding to the root of the element tree, and • All other element information items are accessible by recursively following its [children] property  Has properties • [namespace name] • [children] • [in-scope namespaces] • [local name] • [attributes] • [base URI] • [prefix] • [namespace attributes] • [parent]
  • 8.  There is an attribute information item for each attribute (specified or defaulted) of each element in the document • including those which are namespace declarations • The latter however appear as members of an element's [namespace attributes] property rather than its [attributes] property  Has properties • [namespace name] • [normailized value] • [references] • [local name] • [specified] • [owner element] • [prefix] • [attribute type]
  • 9.  There is a processing instruction information item for each processing instruction in the document  The XML declaration and text declarations for external parsed entities are not considered processing instructions  Has properties • [target] • [notation] • [content] • [parent] • [base URI]
  • 10.  A unexpanded entity reference information item serves as a placeholder by which an XML processor can indicate that it has not expanded an external parsed entity  A validating XML processor, or a non-validating processor that reads all external general entities, will never generate unexpanded entity reference information items for a valid document.  Has properties • [name] • [declaration base URI] • [system identifier] • [parent] • [public identifier]
  • 11.  There is a character information item for each data character that appears in the document, whether literally, as a character reference, or within a CDATA section  Each character is a logically separate information item, but XML applications are free to chunk characters into larger groups as necessary or desirable  Has properties • [character code] • [parent] • [element content whitespace]
  • 12.  There is a comment information item for each XML comment in the original document, except for those appearing in the DTD (which are not represented)  Has properties • [content] • [parent]
  • 13.  If the XML document has a document type declaration, then the information set contains a single document type declaration information item  Note that entities and notations are provided as properties of the document information item, not the document type declaration information item  Has properties • [system identifier] • [children] • [public identifier] • [parent]
  • 14.  There is an unparsed entity information item for each unparsed general entity declared in the DTD  Has properties • [name] • [declaration base URI] • [system identifier] • [notation name] • [public identifier] • [notation]
  • 15.  There is a notation information item for each notation declared in the DTD  Has properties • [name] • [public identifier] • [system identifier] • [declaration base URI]
  • 16.  Each element in the document has a namespace information item for each namespace that is in scope for that element  Has properties • [prefix] • [namespace name]
  • 17.  Information Sets are extensible  New recommendations can associate properties with info items by adding properties  For example, XML Schema adds properties to the infoset to record the results of validation • Post-Schema -Validation Infoset (PSVI)  Proprietary software can add their own properties too
  • 18. 1. The content models of elements, from ELEMENT declarations in the DTD. 2. The grouping and ordering of attribute declarations in ATTLIST declarations. 3. The order of attributes within a start-tag. 4. The document type name. 5. White space outside the document element. 6. White space immediately following the target name of a PI. 7. Whether characters are represented by character references. 8. White space within start-tags (other than significant white space in attribute values) and end-tags. 9. The difference between the two forms of an empty element: <foo/> and <foo></foo>. 10. The difference between CR, CR-LF, and LF line termination.
  • 19. 11. The order of declarations within the DTD. 12. The boundaries of conditional sections in the DTD. 13. The boundaries of parameter entities in the DTD. 14. The boundaries of general parsed entities. 15. The boundaries of CDATA marked sections. 16. Comments in the DTD. 17. The location of declarations (whether in internal or external subset or parameter entities). 18. Any ignored declarations, including those within an IGNORE conditional section, as well as entity and attribute declarations ignored because previous declarations override them. 19. The kind of quotation marks (single or double) used to quote attribute values. 20. The default value of attributes declared in the DTD.
  • 20. 2. Used in other specifications that need to refer to the information in a well-formed XML document
  • 21. 1. XML Information Set (Second Edition)