SlideShare uma empresa Scribd logo
1 de 34
Dynamic Chunking of Component-Authored
Information
Ben Colborn Owen Richter
Manager, Technical Publications Web Application Architect
2
Converged
compute and
storage
All
intelligence in
software
Distributed
everything
Self-healing
system
Web-scale converged infrastructure
Automation
and Rich
Analytics
3
Technical publications responsibilities
› Software documentation
› Release documentation
› Hardware documentation
› Support knowledge base
› Education collaboration
› Localization
4
Problem
Ben didn’t like any available options for publishing documentation
5
Monolithic documentation
6
Fragmented documentation
7
Advantages
Monolithic
•Easy to
produce
•Familiar for
audience
•Portable
Fragmented
•Easy to link
•Short page
load time
•Familiar for
authors
8
Opportunity
Growing company; development of new support portal
9
Every page is page one
› Every page is a potential entry point
› Sometimes hierarchy and sequence are relevant
› Often hierarchy and sequence are not relevant
› Multiplicity of navigation options is required
10
Information foraging behavior
› Information scent: Users estimate a given hunt’s likely
success from … assessing whether their path
exhibits cues related to the desired outcome.
› Informavores will keep clicking as long as they sense that
they're “getting warmer”—the scent must keep getting
stronger and stronger, or people give up.
› Progress must seem rapid enough to be worth the
predicted effort required to reach the destination.
› As users drill down the site, … provide feedback about
the current location and how it relates to users' tasks.
11
Documentation use cases
1. A new user may want to browse a complete high level
document.
2. A developing user may want an intermediate-sized chunk
that has subject/sequence affinity.
3. An experienced user may want a small chunk with a
particular item of information.
4. A support technician may need to provide a chunk scoped
at an intermediate level to a customer so they are not
overloaded with too much information, but also not given
too little.
12
Document levels
Document
Part
Chapter
Section
Topic
13
DITA gets us halfway there
 Authoring and management is done at the
topic level
 Chunking exists as an approach
but
 Chunking control is manual
 Chunks are static
14
Ben’s magical solution
If I had an infinite number of monkeys, I could
chunk all topics in all possible combinations
15
Cross-disciplinary thinking to the rescue
› We need a recursive document!
› A document is:
1. A title
2. A globally unique key (document name + sub document ID)
3. A locally unique key (sub document ID)
4. A list of tags
5. A (recursive) list of documents
› DITA is recursive but none of the existing presentation
mechanisms are recursive.
› JSON is a natural way to represent a recursive document.
› XSLT is a natural way to generate such a JSON document.
16
JSON generation process
DITA Source HTML JSON
17
Theoretical document: Complete
Document
1. Chapter
1.1 Section
2. Chapter
2.1 Section
2.1.1 Topic
2.2 Section
2.2.1 Topic
3. Chapter
18
Theoretical document: Chunks
1. Chapter
1.1 Section
2. Chapter
2.1 Section
2.1.1 Topic
2.2 Section
2.2.1 Topic
3. Chapter
2.1 Section
2.1.1 Topic
2.2 Section
2.2.1 Topic
2.1.1 Topic
2.2.1 Topic
1.1 Section
19
DITA to JSON 1: DITAMAP
Document
Properties
Topic
References
20
DITA to JSON 2: HTML index
Document
Properties
Topic
References
21
DITA to JSON 3: JSON
Document
Properties
Topic
Topic
22
DITA to JSON 4: Sub-document
Field Source
Title Topic title
ID Topic filename
Unique key Top-level document filename +
topic filename
Ancestors List of ancestor topics at all
levels
Summary* Topic shortdesc
Body Topic body
HREF Topic path + topic filename
Documents* List of sub-documents
23
Document Loading Process
Flatten each node Create Unique ID Establish ancestry
Convert relative
image and cross
references to
absolute links
Create a standalone
document of each
node
Load to DB
Load to search
index
24
Search
25
Task Topic
26
Chapter
27
Document
28
TOC
29
Multi-modality
30
DITA output targets
1. PDF: monolithic
2. ePUB: monolithic
3. HTML: fragmented
4. JSON: dynamically chunked
31
Conventions
› Images
› All image paths need to be converted to absolute paths. Having all of them in a
flat folder called “images” is one easy way to accomplish this.
› Cross References
› Cross reference links within the JSON are all relative. Like images, they need to
be converted to absolute links.
› JSON Tag Recursion
› It is tedious to add tags to all levels of the JSON Document, so most tags are
programmatically pulled through to all sub documents. Tags can be overridden
in children if desired.
› Permissions – can be set in source
› Anchors not supported
› We currently have a single page app making anchors difficult, but somewhat
irrelevant since each level is available as an independent link.
32
What’s next?
› More publishing automation
› Publishing is currently a 2 step process. JSON Publication followed by document loading.
It would be better to provide a 1 step process controlled by the document publisher.
› Holistic approach
› Search cultivation
› Search analytics
› Chat
› Case Deflection Analysis driving documentation.
› Tag-based navigation
33
Ben is less dissatisfied
Problems solved
• Apparently dynamic presentation
• Satisfactory context-sensitive help targets
• CMS/search loading
Problems not solved
• Static transformations
Problems created
• Content removal
• Proofing
• Custom software
Dynamic chunking of component-authored information

Mais conteúdo relacionado

Semelhante a Dynamic chunking of component-authored information

MongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 MeetupMongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 Meetupikanow
 
How IKANOW uses MongoDB to help organizations solve really big problems
How IKANOW uses MongoDB to help organizations solve really big problemsHow IKANOW uses MongoDB to help organizations solve really big problems
How IKANOW uses MongoDB to help organizations solve really big problemsikanow
 
Designing and Implementing Search Solutions
Designing and Implementing Search SolutionsDesigning and Implementing Search Solutions
Designing and Implementing Search SolutionsFindwise
 
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...IMPACT Centre of Competence
 
CQRS recipes or how to cook your architecture
CQRS recipes or how to cook your architectureCQRS recipes or how to cook your architecture
CQRS recipes or how to cook your architectureThomas Jaskula
 
Storing eBay's Media Metadata on MongoDB, by Yuri Finkelstein, Architect, eBay
Storing eBay's Media Metadata on MongoDB, by Yuri Finkelstein, Architect, eBayStoring eBay's Media Metadata on MongoDB, by Yuri Finkelstein, Architect, eBay
Storing eBay's Media Metadata on MongoDB, by Yuri Finkelstein, Architect, eBayMongoDB
 
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...MongoDB
 
Mimi Yin: Getting Things Done: Technology and Practice
Mimi Yin: Getting Things Done: Technology and PracticeMimi Yin: Getting Things Done: Technology and Practice
Mimi Yin: Getting Things Done: Technology and PracticeSteve Williams
 
Big Data and NoSQL in Microsoft-Land
Big Data and NoSQL in Microsoft-LandBig Data and NoSQL in Microsoft-Land
Big Data and NoSQL in Microsoft-LandAndrew Brust
 
OrientDB: Unlock the Value of Document Data Relationships
OrientDB: Unlock the Value of Document Data RelationshipsOrientDB: Unlock the Value of Document Data Relationships
OrientDB: Unlock the Value of Document Data RelationshipsFabrizio Fortino
 
MongoDB World 2019: MongoDB Cluster Design: From Redundancy to GDPR
MongoDB World 2019: MongoDB Cluster Design: From Redundancy to GDPRMongoDB World 2019: MongoDB Cluster Design: From Redundancy to GDPR
MongoDB World 2019: MongoDB Cluster Design: From Redundancy to GDPRMongoDB
 
Lessons learned from & best practices for migrating to SharePoint
Lessons learned from & best practices for migrating to SharePointLessons learned from & best practices for migrating to SharePoint
Lessons learned from & best practices for migrating to SharePointGareth Davies
 
IIIF and Linked Open Data: LODLAM 2020
IIIF and Linked Open Data: LODLAM 2020IIIF and Linked Open Data: LODLAM 2020
IIIF and Linked Open Data: LODLAM 2020David Newbury
 
mongodb-120401144140-phpapp01 claud camputing
mongodb-120401144140-phpapp01 claud camputingmongodb-120401144140-phpapp01 claud camputing
mongodb-120401144140-phpapp01 claud camputingmoeincanada007
 
The technology of the Human Protein Reference Database (draft, 2003)
The technology of the Human Protein Reference Database (draft, 2003)The technology of the Human Protein Reference Database (draft, 2003)
The technology of the Human Protein Reference Database (draft, 2003)Kiran Jonnalagadda
 
Quick dive into the big data pool without drowning - Demi Ben-Ari @ Panorays
Quick dive into the big data pool without drowning - Demi Ben-Ari @ PanoraysQuick dive into the big data pool without drowning - Demi Ben-Ari @ Panorays
Quick dive into the big data pool without drowning - Demi Ben-Ari @ PanoraysDemi Ben-Ari
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8dallemang
 
Cool NoSQL on Azure with DocumentDB
Cool NoSQL on Azure with DocumentDBCool NoSQL on Azure with DocumentDB
Cool NoSQL on Azure with DocumentDBJan Hentschel
 
Revolutionizing the hypatia metadata experience
Revolutionizing the hypatia metadata experienceRevolutionizing the hypatia metadata experience
Revolutionizing the hypatia metadata experienceKat Chuang
 

Semelhante a Dynamic chunking of component-authored information (20)

MongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 MeetupMongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 Meetup
 
How IKANOW uses MongoDB to help organizations solve really big problems
How IKANOW uses MongoDB to help organizations solve really big problemsHow IKANOW uses MongoDB to help organizations solve really big problems
How IKANOW uses MongoDB to help organizations solve really big problems
 
Designing and Implementing Search Solutions
Designing and Implementing Search SolutionsDesigning and Implementing Search Solutions
Designing and Implementing Search Solutions
 
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
 
CQRS recipes or how to cook your architecture
CQRS recipes or how to cook your architectureCQRS recipes or how to cook your architecture
CQRS recipes or how to cook your architecture
 
Storing eBay's Media Metadata on MongoDB, by Yuri Finkelstein, Architect, eBay
Storing eBay's Media Metadata on MongoDB, by Yuri Finkelstein, Architect, eBayStoring eBay's Media Metadata on MongoDB, by Yuri Finkelstein, Architect, eBay
Storing eBay's Media Metadata on MongoDB, by Yuri Finkelstein, Architect, eBay
 
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
 
Mimi Yin: Getting Things Done: Technology and Practice
Mimi Yin: Getting Things Done: Technology and PracticeMimi Yin: Getting Things Done: Technology and Practice
Mimi Yin: Getting Things Done: Technology and Practice
 
Big Data and NoSQL in Microsoft-Land
Big Data and NoSQL in Microsoft-LandBig Data and NoSQL in Microsoft-Land
Big Data and NoSQL in Microsoft-Land
 
OrientDB: Unlock the Value of Document Data Relationships
OrientDB: Unlock the Value of Document Data RelationshipsOrientDB: Unlock the Value of Document Data Relationships
OrientDB: Unlock the Value of Document Data Relationships
 
MongoDB World 2019: MongoDB Cluster Design: From Redundancy to GDPR
MongoDB World 2019: MongoDB Cluster Design: From Redundancy to GDPRMongoDB World 2019: MongoDB Cluster Design: From Redundancy to GDPR
MongoDB World 2019: MongoDB Cluster Design: From Redundancy to GDPR
 
Lessons learned from & best practices for migrating to SharePoint
Lessons learned from & best practices for migrating to SharePointLessons learned from & best practices for migrating to SharePoint
Lessons learned from & best practices for migrating to SharePoint
 
IIIF and Linked Open Data: LODLAM 2020
IIIF and Linked Open Data: LODLAM 2020IIIF and Linked Open Data: LODLAM 2020
IIIF and Linked Open Data: LODLAM 2020
 
mongodb-120401144140-phpapp01 claud camputing
mongodb-120401144140-phpapp01 claud camputingmongodb-120401144140-phpapp01 claud camputing
mongodb-120401144140-phpapp01 claud camputing
 
The technology of the Human Protein Reference Database (draft, 2003)
The technology of the Human Protein Reference Database (draft, 2003)The technology of the Human Protein Reference Database (draft, 2003)
The technology of the Human Protein Reference Database (draft, 2003)
 
Quick dive into the big data pool without drowning - Demi Ben-Ari @ Panorays
Quick dive into the big data pool without drowning - Demi Ben-Ari @ PanoraysQuick dive into the big data pool without drowning - Demi Ben-Ari @ Panorays
Quick dive into the big data pool without drowning - Demi Ben-Ari @ Panorays
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8
 
Relational vs. Non-Relational
Relational vs. Non-RelationalRelational vs. Non-Relational
Relational vs. Non-Relational
 
Cool NoSQL on Azure with DocumentDB
Cool NoSQL on Azure with DocumentDBCool NoSQL on Azure with DocumentDB
Cool NoSQL on Azure with DocumentDB
 
Revolutionizing the hypatia metadata experience
Revolutionizing the hypatia metadata experienceRevolutionizing the hypatia metadata experience
Revolutionizing the hypatia metadata experience
 

Mais de Ben Colborn

Terminology and taxonomy as foundations of content strategy
Terminology and taxonomy as foundations of content strategyTerminology and taxonomy as foundations of content strategy
Terminology and taxonomy as foundations of content strategyBen Colborn
 
Language quality as foundation of enterprise content strategy
Language quality as foundation of enterprise content strategyLanguage quality as foundation of enterprise content strategy
Language quality as foundation of enterprise content strategyBen Colborn
 
DITA as Interchange Format for Crowdsourcing and Acquisitions
DITA as Interchange Format for Crowdsourcing and AcquisitionsDITA as Interchange Format for Crowdsourcing and Acquisitions
DITA as Interchange Format for Crowdsourcing and AcquisitionsBen Colborn
 
Reference material: Topics or databases?
Reference material: Topics or databases?Reference material: Topics or databases?
Reference material: Topics or databases?Ben Colborn
 
Improving product usability with task complexity metrics
Improving product usability with task complexity metricsImproving product usability with task complexity metrics
Improving product usability with task complexity metricsBen Colborn
 
Automated QA of DITA content sets
Automated QA of DITA content setsAutomated QA of DITA content sets
Automated QA of DITA content setsBen Colborn
 

Mais de Ben Colborn (6)

Terminology and taxonomy as foundations of content strategy
Terminology and taxonomy as foundations of content strategyTerminology and taxonomy as foundations of content strategy
Terminology and taxonomy as foundations of content strategy
 
Language quality as foundation of enterprise content strategy
Language quality as foundation of enterprise content strategyLanguage quality as foundation of enterprise content strategy
Language quality as foundation of enterprise content strategy
 
DITA as Interchange Format for Crowdsourcing and Acquisitions
DITA as Interchange Format for Crowdsourcing and AcquisitionsDITA as Interchange Format for Crowdsourcing and Acquisitions
DITA as Interchange Format for Crowdsourcing and Acquisitions
 
Reference material: Topics or databases?
Reference material: Topics or databases?Reference material: Topics or databases?
Reference material: Topics or databases?
 
Improving product usability with task complexity metrics
Improving product usability with task complexity metricsImproving product usability with task complexity metrics
Improving product usability with task complexity metrics
 
Automated QA of DITA content sets
Automated QA of DITA content setsAutomated QA of DITA content sets
Automated QA of DITA content sets
 

Último

Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort ServiceBusty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort ServiceDelhi Call girls
 
💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋nirzagarg
 
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...SUHANI PANDEY
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查ydyuyu
 
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...tanu pandey
 
Yerawada ] Independent Escorts in Pune - Book 8005736733 Call Girls Available...
Yerawada ] Independent Escorts in Pune - Book 8005736733 Call Girls Available...Yerawada ] Independent Escorts in Pune - Book 8005736733 Call Girls Available...
Yerawada ] Independent Escorts in Pune - Book 8005736733 Call Girls Available...SUHANI PANDEY
 
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...singhpriety023
 
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
Real Escorts in Al Nahda +971524965298 Dubai Escorts Service
Real Escorts in Al Nahda +971524965298 Dubai Escorts ServiceReal Escorts in Al Nahda +971524965298 Dubai Escorts Service
Real Escorts in Al Nahda +971524965298 Dubai Escorts ServiceEscorts Call Girls
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445ruhi
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirtrahman018755
 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge GraphsEleniIlkou
 
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableSeo
 
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...tanu pandey
 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC
 
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls DubaiDubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubaikojalkojal131
 
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...roncy bisnoi
 

Último (20)

Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort ServiceBusty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
 
💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
 
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
 
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
 
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
 
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
 
Yerawada ] Independent Escorts in Pune - Book 8005736733 Call Girls Available...
Yerawada ] Independent Escorts in Pune - Book 8005736733 Call Girls Available...Yerawada ] Independent Escorts in Pune - Book 8005736733 Call Girls Available...
Yerawada ] Independent Escorts in Pune - Book 8005736733 Call Girls Available...
 
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
 
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
 
Real Escorts in Al Nahda +971524965298 Dubai Escorts Service
Real Escorts in Al Nahda +971524965298 Dubai Escorts ServiceReal Escorts in Al Nahda +971524965298 Dubai Escorts Service
Real Escorts in Al Nahda +971524965298 Dubai Escorts Service
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirt
 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
 
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
 
Russian Call Girls in %(+971524965298 )# Call Girls in Dubai
Russian Call Girls in %(+971524965298  )#  Call Girls in DubaiRussian Call Girls in %(+971524965298  )#  Call Girls in Dubai
Russian Call Girls in %(+971524965298 )# Call Girls in Dubai
 
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53
 
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls DubaiDubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
 
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
 

Dynamic chunking of component-authored information

  • 1. Dynamic Chunking of Component-Authored Information Ben Colborn Owen Richter Manager, Technical Publications Web Application Architect
  • 3. 3 Technical publications responsibilities › Software documentation › Release documentation › Hardware documentation › Support knowledge base › Education collaboration › Localization
  • 4. 4 Problem Ben didn’t like any available options for publishing documentation
  • 9. 9 Every page is page one › Every page is a potential entry point › Sometimes hierarchy and sequence are relevant › Often hierarchy and sequence are not relevant › Multiplicity of navigation options is required
  • 10. 10 Information foraging behavior › Information scent: Users estimate a given hunt’s likely success from … assessing whether their path exhibits cues related to the desired outcome. › Informavores will keep clicking as long as they sense that they're “getting warmer”—the scent must keep getting stronger and stronger, or people give up. › Progress must seem rapid enough to be worth the predicted effort required to reach the destination. › As users drill down the site, … provide feedback about the current location and how it relates to users' tasks.
  • 11. 11 Documentation use cases 1. A new user may want to browse a complete high level document. 2. A developing user may want an intermediate-sized chunk that has subject/sequence affinity. 3. An experienced user may want a small chunk with a particular item of information. 4. A support technician may need to provide a chunk scoped at an intermediate level to a customer so they are not overloaded with too much information, but also not given too little.
  • 13. 13 DITA gets us halfway there  Authoring and management is done at the topic level  Chunking exists as an approach but  Chunking control is manual  Chunks are static
  • 14. 14 Ben’s magical solution If I had an infinite number of monkeys, I could chunk all topics in all possible combinations
  • 15. 15 Cross-disciplinary thinking to the rescue › We need a recursive document! › A document is: 1. A title 2. A globally unique key (document name + sub document ID) 3. A locally unique key (sub document ID) 4. A list of tags 5. A (recursive) list of documents › DITA is recursive but none of the existing presentation mechanisms are recursive. › JSON is a natural way to represent a recursive document. › XSLT is a natural way to generate such a JSON document.
  • 17. 17 Theoretical document: Complete Document 1. Chapter 1.1 Section 2. Chapter 2.1 Section 2.1.1 Topic 2.2 Section 2.2.1 Topic 3. Chapter
  • 18. 18 Theoretical document: Chunks 1. Chapter 1.1 Section 2. Chapter 2.1 Section 2.1.1 Topic 2.2 Section 2.2.1 Topic 3. Chapter 2.1 Section 2.1.1 Topic 2.2 Section 2.2.1 Topic 2.1.1 Topic 2.2.1 Topic 1.1 Section
  • 19. 19 DITA to JSON 1: DITAMAP Document Properties Topic References
  • 20. 20 DITA to JSON 2: HTML index Document Properties Topic References
  • 21. 21 DITA to JSON 3: JSON Document Properties Topic Topic
  • 22. 22 DITA to JSON 4: Sub-document Field Source Title Topic title ID Topic filename Unique key Top-level document filename + topic filename Ancestors List of ancestor topics at all levels Summary* Topic shortdesc Body Topic body HREF Topic path + topic filename Documents* List of sub-documents
  • 23. 23 Document Loading Process Flatten each node Create Unique ID Establish ancestry Convert relative image and cross references to absolute links Create a standalone document of each node Load to DB Load to search index
  • 30. 30 DITA output targets 1. PDF: monolithic 2. ePUB: monolithic 3. HTML: fragmented 4. JSON: dynamically chunked
  • 31. 31 Conventions › Images › All image paths need to be converted to absolute paths. Having all of them in a flat folder called “images” is one easy way to accomplish this. › Cross References › Cross reference links within the JSON are all relative. Like images, they need to be converted to absolute links. › JSON Tag Recursion › It is tedious to add tags to all levels of the JSON Document, so most tags are programmatically pulled through to all sub documents. Tags can be overridden in children if desired. › Permissions – can be set in source › Anchors not supported › We currently have a single page app making anchors difficult, but somewhat irrelevant since each level is available as an independent link.
  • 32. 32 What’s next? › More publishing automation › Publishing is currently a 2 step process. JSON Publication followed by document loading. It would be better to provide a 1 step process controlled by the document publisher. › Holistic approach › Search cultivation › Search analytics › Chat › Case Deflection Analysis driving documentation. › Tag-based navigation
  • 33. 33 Ben is less dissatisfied Problems solved • Apparently dynamic presentation • Satisfactory context-sensitive help targets • CMS/search loading Problems not solved • Static transformations Problems created • Content removal • Proofing • Custom software

Notas do Editor

  1. Key Points: At its core, Nutanix eliminates complexity in the datacenter One of the root causes of complexity is the data storage architecture, specifically the storage network The Nutanix Virtual Computing Platform gets rid of the SAN and brings compute and storage together for virtualized environments This approach eliminates network bottlenecks and simplifies the architecture. This is particularly important with flash storage because the network can become a chokepoint for the system With a Nutanix solution, customers can easily add additional compute and storage by adding nodes on the go
  2. Software documentation Feature and task Text, image, video Context-sensitive help Release documentation Release notes Upgrade instructions Hardware documentation Replacement procedures System specifications Text, image, video
  3. Were publishing in PDF—bad for findability. Then publishing also in WebHelp—silos per document. Difficult to use web CMS (e.g. Drupal) as publishing endpoint—import/update complicated.
  4. High page count Deep nesting and poor scoping of pages Mismatch between page (8.5x11) and topic (standalone piece of information, variable length)
  5. Alignment between page and topic Small pieces without clear scope of relationships--only in TOC with the same deep nesting
  6. From Mark Baker
  7. From Nielsen Norman Group http://www.nngroup.com/articles/information-scent/ information foraging uses the analogy of wild animals gathering food to analyze how humans collect information online. Information foraging's most famous concept is information scent: users estimate a given hunt's likely success from the spoor: assessing whether their path exhibits cues related to the desired outcome. Informavores will keep clicking as long as they sense (to mix metaphors) that they're "getting warmer" -- the scent must keep getting stronger and stronger, or people give up. Progress must seem rapid enough to be worth the predicted effort required to reach the destination. Secondly, as users drill down the site, each page should clearly indicate that they're still on the path to the food. In other words, provide feedback about the current location and how it relates to users' tasks.
  8. Would like to be able to present a page at any of these levels. With the standard tools, only document (monolithic) and topic (fragmented) levels are possible.
  9. Want to keep the granular authoring and management Manual chunking (using @chunk) is of limited value
  10. Chunking is static It’s possible to envision how to have multiple chunk outputs but not how to handle them.
  11. Over to Owen.
  12. Is using XSLT too hard? No, the OT already uses it for all output types. Under 300 lines to read HTML2 output and create a single JSON file. New XSLT for each doc type? No, processing is generic. Publish JSON, PDF, ePUB
  13. Analyze into 8 pages
  14. Process all possible chunk combinations
  15. A single JSON document is loaded into a DB and a Search Index. The recursive list of subdocuments is flattened A single monolithic document is created for each sub-document. Each recursive node contains ancestry information to create breadcrumbs Table of Contents The table of contents is created only for the top level document, not scoped for each subdocument. Because siblings are shown in scope, a TOC becomes less relevant. On mobile devices, we can look at TOC or content, saving space. Links and Images The JSON document is published with relative links. The loading process converts these into absolute link. Your automated loader is your infinite number of monkeys.
  16. Demo hierarchy.ditamap
  17. CSH: Target linked to isn’t just what is obvious but provides more context Content removal: inconsistency between search results and available docs Productize?