SlideShare uma empresa Scribd logo
1 de 36
The Web Comes Alive with Data!
Schema.org and Structured Data on
the Web: Past, Present, Potential
@jaymyers
Google DevFest Twin Cities
February 8, 2014
• Early adopter
• Semantic
Web, Linked &
Open data
enthusiast
• Speaker
Web of Today
•
•
•
•

25 million web sites
Trillions of web pages
5 billion web pages change every day
1000x more web pages on the “deep web”
Structured data

Transform

User
Users
Transform
Structured data

Machines
“1980-10-21”
“Actress”*
birthDate
jobTitle
jobTitle

Kim Kardashian

“Director”*
“HollywoodLife”

provider

provider

“TooFab”

* questionable, but we’ll go with it
Goals
• Create a web for both humans and machines
• Entice webmasters to make metadata
available through structured HTML
• Gain access to the meaning of web sites
Early Attempts
• Meta Content
Framework
• RDF
• OWL
Semantic Web

“A new form of Web content that is meaningful to computers
will unleash a revolution of new possibilities” - TBL
Microformats (‘03)
Addresses, geo, blog posts, media
(images/ video), news, products,
recipes, reviews and more!
Microformats example
<div class="item hproduct">
<ol>
<li class="lister vcard"><a class="url fn" href="http://storename.com">Magers and Quinn</a></li>
<li class="category"><a href="http://storename.com/categories/books">Books</a></li>
</ol>
<img src="http://images.storename.com/products/ramsay-fast-food.jpg" class="photo" alt="gordon ramsay fast food
book" />
<p><span class="condition">New:</span> <span class="price">$27.99</span></p>
<p>Pub price: $35.00</p>
<p>Hardcover</p>
<p class="availability">Out of stock</p>
<h1 class="fn">Gordon Ramsay's Fast Food Recipes from the F Word</h1>
<p>By Ramsay, Gordon</p>
<dl class="identifier">
<dt>ISBN:</dt>
<dd>1554700647</dd>
</dl>
<dl class="identifier">
<dt>Publisher:</dt>
<dd>Amer Youth Hostels</dd>
</dl>
<h4>Publishers Comments</h4>
<p class="description">A celebrity host of Hell's Kitchen features more than one hundred accessible recipes that are
organized in accordance with everyday needs and special occasions, in a volume that places an emphasis on fast
preparation and features complementary tips on stocking a pantry.</p>
</div>
Ontology Models:FOAF
<mailto:jmyers1551@gmail.com>

<mailto:cledwyn@gmail.com>

foaf:mbox
“Jaydog”

foaf:knows

foaf:nick

foaf:homepage

<http://jaymmyers.tumblr.com>

foaf:mbox

Jay Myers
a foaf:Person

foaf:nick

Lloyd Cledwyn
a foaf:Person

“Professor
Lloyd”

foaf:homepage

<http://stthomas.edu>
Ontology Models:SKOS
category:Boston_Celtics
category:Minnesota_
Timberwolves

category:National_Basketball_Association_
franchise_relocations

category:Atlanta_Hawks

skos:broader
skos:broader

skos:broader

skos:broader
skos:broader

skos:prefLabel

National Basketball
Association teams
a skos:Concept
category:Defunct_National_
Basketball_Association_teams

“National Basketball Association teams”
Ontology Models:GoodRelations

“Make a delicious breakfast treat…”
“Euro Cuisine”
gr:description
gr:hasManufacturer

gr:includesObject
a gr:Offering

gr:hasMPN

Euro Cuisine – gr:category
8" Heart-Shape
Waffle Maker
a gr:ProductOrService

“WM520”

“Waffle_Makers”
RDFa
<html xmlns=“http://www.w3.org/1999/xhtml” xmlns:rdfs=“Http://www.w3.org/2000/01/rdf-schema#”
xmlns:dc=“http://purl.org/dc/elements/1.1/” xmlns:xsd=http://www.w3.org/2001/XMLSchema#
xmlns:foaf=“http://xmlns.com/foaf/0.1/” xmlns:gr=http://purl.org/goodrelations/v1#
xmlns:geo=http://www.w3.org/2003/01/geo/wgs84_pos# xmlns:v=http://www.w3.org/2006/vcard/ns#
xmlns:r=http://rdf.data-vocabulary.org/#>
<div class="vcard" typeof="gr:LocationOfSalesOrServiceProvisioning" about="#store_201">
<h1 id="site_title" property="geo:lat_long" content="29.521643, -98.493599">
<a href="http://stores.bestbuy.com/201">Best Buy - San Antonio</a>
</h1>
</div>
<div rel="v:adr">
<p class="geo" typeof="v:Address v:Work”>
<strong><span property="v:street-address">125 Nw Loop 410 Ste 201</span></strong><br />
<strong> <span property="v:locality">San Antonio, </span> <span property="v:region">TX</span>
<span property="v:postal-code">78216</span> </strong> <br />
Phone: <span property="v:tel"><span typeof="v:Tel v:Home">888-229-3770</span></span><br />
Email: <a href="mailto:Keith.Allen2@bestbuy.com" rel="v:email">Keith.Allen2@bestbuy.com</a></p>
<span rel="v:geo">GEO: <span property="v:latitude">29.521643</span>, <span property="v:longitude">98.493599</span></span></p>
</div>
Why?
Circa 2007
Additional content on
SERPs

Data automagically extracted from
HTML
Value prop:
“Give us your data in a machinereadable format and we’ll make
your stuff more attractive in search
results”
Results
• 1000x increase in structured markup
• Increases in user engagement (click throughs)
for SERP objects created from structured
markup
• Small number of interesting applications built
on top of structured data
But…
• Too many choices
(syntax, ontology, etc.),
fragmented
• A lot of bad markup –
up to 40%
• Not easy enough for
your average “Joe
Webmaster”
2010
schema.org
• Common vocabularies that search engines
can understand
• Lower the bar for webmasters to publish
data on the web
• Improve user experience through data
Introducing: Microdata
<div id="pagecontent" itemscope itemtype="http://schema.org/Person">
<a href="/media/rm974696448/nm2578007?ref_=nm_ov_ph"> <img id="name-poster"
alt="Kim Kardashian Picture" title="Kim Kardashian Picture"
src="http://ia.mediaimdb.com/images/M/MV5BMTc0MjkzOTAxNV5BMl5BanBnXkFtZTcwNTk1NjcyNw@@._V1_SX214_CR0,
0,214,317_.jpg" itemprop="image"/>
</a>
<h1 class="header"> <span class="itemprop" itemprop="name">Kim
Kardashian</span></h1>
<div class="infobar" id="name-job-categories">
<span class="itemprop" itemprop="jobTitle">Actress</span>
<span class="itemprop" itemprop="jobTitle">Producer</span>
</div>
<div class="inline" itemprop="description">
TV star, entrepreneur, fashion designer, and author (New York Times bestseller - "Kardashian Konfidential"), Kim Kardashian first burst onto the scene in 2007, after the premiere
of her hit E! Entertainment reality series ...
</div>
<time datetime="1980-10-21" itemprop="birthDate">
<a href="/search/name?birth_monthday=1021&refine=birth_monthday&ref_=nm_ov_bth_monthday" >October 21</a>,
<a href="/search/name?birth_year=1980&ref_=nm_ov_bth_year" >1980</a>
</time>
</div>
Looks Like We’ve Got Something Here!
• 15% of all sites contain schema.org markup
• Many major sites
• Adoption by content systems like Drupal and
Wordpress
• Around 1200 object types and growing
• Significant reduction in error rates
Practical Applications in Search
Yahoo! Related Entities
Practical Applications in Search
Yandex Islands
Practical Applications in Search
Google Knowledge Graph

Additional content driven
by schema.org derived
data
Practical Applications in Search
Google Knowledge Graph

Additional content driven
by schema.org derived
data
Other Applications
Pinterest Rich Pins
Other Applications
Gmail “Actions in the Inbox”
• Actions – rent a movie, buy something
• Orders – post transaction order
confirmation, shipping status
• Reservations – restaurant, travel, tickets
Other Applications
JSON-LD
{
"@context": "http://schema.org",
"@type": "Person",
"name": "John Doe",
"jobTitle": ”Technologist",
"affiliation": ”Big Boxen ‘R’ Us",
"additionalName": "Johnny",
"url": "http://www.example.com",
"address": {
"@type": "PostalAddress",
"streetAddress": "1234 Freeze Drive",
"addressLocality": ”Icebox",
"addressRegion": ”Minnesota"
}
}
Thank you
@jaymyers
Credits
Guha, Ramanathan V. “Light at the End of the Tunnel.” 12th International Semantic Web Conference
(ISWC), Sydney, NSW, Australia. 23 October 2013. Keynote Address.

Mais conteúdo relacionado

Mais procurados

SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
Micah Altman
 
Discover the invisible web
Discover the invisible webDiscover the invisible web
Discover the invisible web
drakowski
 
Drupal Open Source Everything
Drupal Open Source EverythingDrupal Open Source Everything
Drupal Open Source Everything
librarywebchic
 
Andrew Hoppin, CIO, NY State Senate
Andrew Hoppin, CIO, NY State SenateAndrew Hoppin, CIO, NY State Senate
Andrew Hoppin, CIO, NY State Senate
Acquia
 

Mais procurados (20)

Structured Data: It's All About the Graph!
Structured Data: It's All About the Graph!Structured Data: It's All About the Graph!
Structured Data: It's All About the Graph!
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of Entities
 
Schema.org where did that come from?
Schema.org where did that come from?Schema.org where did that come from?
Schema.org where did that come from?
 
Search search search
Search search searchSearch search search
Search search search
 
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
 
Demand, Media, and Search Analytics at AOL
Demand, Media, and Search Analytics at AOLDemand, Media, and Search Analytics at AOL
Demand, Media, and Search Analytics at AOL
 
Contextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationContextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data Foundation
 
Gary Price, MIT Program on Information Science
Gary Price, MIT Program on Information ScienceGary Price, MIT Program on Information Science
Gary Price, MIT Program on Information Science
 
Tracking TC QAI
Tracking TC QAITracking TC QAI
Tracking TC QAI
 
Evaluating search engines
Evaluating search enginesEvaluating search engines
Evaluating search engines
 
Schema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowSchema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & How
 
Schema.org: Where did that come from!
Schema.org: Where did that come from!Schema.org: Where did that come from!
Schema.org: Where did that come from!
 
Semantic Web and Schema.org
Semantic Web and Schema.orgSemantic Web and Schema.org
Semantic Web and Schema.org
 
Discover the invisible web
Discover the invisible webDiscover the invisible web
Discover the invisible web
 
Optimizing Content with SEO and Social Media
Optimizing Content with SEO and Social MediaOptimizing Content with SEO and Social Media
Optimizing Content with SEO and Social Media
 
Software Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental ScanSoftware Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental Scan
 
Drupal Open Source Everything
Drupal Open Source EverythingDrupal Open Source Everything
Drupal Open Source Everything
 
Andrew Hoppin, CIO, NY State Senate
Andrew Hoppin, CIO, NY State SenateAndrew Hoppin, CIO, NY State Senate
Andrew Hoppin, CIO, NY State Senate
 
Schema.org - Extending Benefits
Schema.org - Extending BenefitsSchema.org - Extending Benefits
Schema.org - Extending Benefits
 
Three Linked Data choices for Libraries
Three Linked Data choices for LibrariesThree Linked Data choices for Libraries
Three Linked Data choices for Libraries
 

Semelhante a The Web Comes Alive with Data! Schema.org and Structured Data on the Web: Past, Present, Potential

[HEWEBAR 2012] Beyond Desktop Browsing (HTML5)
[HEWEBAR 2012] Beyond Desktop Browsing (HTML5)[HEWEBAR 2012] Beyond Desktop Browsing (HTML5)
[HEWEBAR 2012] Beyond Desktop Browsing (HTML5)
Christopher Schmitt
 
Building Things Fast - and getting approval
Building Things Fast - and getting approvalBuilding Things Fast - and getting approval
Building Things Fast - and getting approval
Simon Willison
 

Semelhante a The Web Comes Alive with Data! Schema.org and Structured Data on the Web: Past, Present, Potential (20)

WTF is Semantic Web?
WTF is Semantic Web?WTF is Semantic Web?
WTF is Semantic Web?
 
Make your website 2 times faster
Make your website 2 times fasterMake your website 2 times faster
Make your website 2 times faster
 
Linked Data Presentation at TDWI Mpls
Linked Data Presentation at TDWI MplsLinked Data Presentation at TDWI Mpls
Linked Data Presentation at TDWI Mpls
 
[HEWEBAR 2012] Beyond Desktop Browsing (HTML5)
[HEWEBAR 2012] Beyond Desktop Browsing (HTML5)[HEWEBAR 2012] Beyond Desktop Browsing (HTML5)
[HEWEBAR 2012] Beyond Desktop Browsing (HTML5)
 
Slow kinda sucks
Slow kinda sucksSlow kinda sucks
Slow kinda sucks
 
Plone Interactivity
Plone InteractivityPlone Interactivity
Plone Interactivity
 
WEB I - 06 - CSS
WEB I - 06 - CSSWEB I - 06 - CSS
WEB I - 06 - CSS
 
2015 rubyconf - 百大媒體網站從 Wordpress 到 Rails 的大小事
2015 rubyconf - 百大媒體網站從 Wordpress 到 Rails 的大小事2015 rubyconf - 百大媒體網站從 Wordpress 到 Rails 的大小事
2015 rubyconf - 百大媒體網站從 Wordpress 到 Rails 的大小事
 
Css essentials
Css essentialsCss essentials
Css essentials
 
Diazo: Bridging Designers and Programmers
Diazo: Bridging Designers and ProgrammersDiazo: Bridging Designers and Programmers
Diazo: Bridging Designers and Programmers
 
HTML5 & CSS3 Flag
HTML5 & CSS3 FlagHTML5 & CSS3 Flag
HTML5 & CSS3 Flag
 
Building Things Fast - and getting approval
Building Things Fast - and getting approvalBuilding Things Fast - and getting approval
Building Things Fast - and getting approval
 
Advanced CSS Troubleshooting & Efficiency
Advanced CSS Troubleshooting & EfficiencyAdvanced CSS Troubleshooting & Efficiency
Advanced CSS Troubleshooting & Efficiency
 
Curtin University Frontend Web Development
Curtin University Frontend Web DevelopmentCurtin University Frontend Web Development
Curtin University Frontend Web Development
 
jQuery Makes Writing JavaScript Fun Again (for HTML5 User Group)
jQuery Makes Writing JavaScript Fun Again (for HTML5 User Group)jQuery Makes Writing JavaScript Fun Again (for HTML5 User Group)
jQuery Makes Writing JavaScript Fun Again (for HTML5 User Group)
 
File Upload 2015
File Upload 2015File Upload 2015
File Upload 2015
 
Fronteers 2012 - Lessons learned from building a SAAS app
Fronteers 2012 - Lessons learned from building a SAAS appFronteers 2012 - Lessons learned from building a SAAS app
Fronteers 2012 - Lessons learned from building a SAAS app
 
Responsive content
Responsive contentResponsive content
Responsive content
 
JavaScript front end performance optimizations
JavaScript front end performance optimizationsJavaScript front end performance optimizations
JavaScript front end performance optimizations
 
OOCSS in the Real World: A Case Study from the CBC
OOCSS in the Real World: A Case Study from the CBCOOCSS in the Real World: A Case Study from the CBC
OOCSS in the Real World: A Case Study from the CBC
 

Mais de Jay Myers

RDFa -- search engines and beyond!
RDFa -- search engines and beyond!RDFa -- search engines and beyond!
RDFa -- search engines and beyond!
Jay Myers
 
MIT Sloan Linked Data Ventures - Jay Myers
MIT Sloan Linked Data Ventures  - Jay MyersMIT Sloan Linked Data Ventures  - Jay Myers
MIT Sloan Linked Data Ventures - Jay Myers
Jay Myers
 
Increasing product and service visibility through front-end semantic web
Increasing product and service visibility through front-end semantic webIncreasing product and service visibility through front-end semantic web
Increasing product and service visibility through front-end semantic web
Jay Myers
 

Mais de Jay Myers (14)

Practical Applications of Semantic Web in Retail -- Semtech 2014
Practical Applications of Semantic Web in Retail -- Semtech 2014 Practical Applications of Semantic Web in Retail -- Semtech 2014
Practical Applications of Semantic Web in Retail -- Semtech 2014
 
SMX Advanced Seattle -- Structured Web of Data
SMX Advanced Seattle -- Structured Web of DataSMX Advanced Seattle -- Structured Web of Data
SMX Advanced Seattle -- Structured Web of Data
 
The Next Web of Linked Data
The Next Web of Linked DataThe Next Web of Linked Data
The Next Web of Linked Data
 
GS1: Better retailing through linked data
GS1: Better retailing through linked dataGS1: Better retailing through linked data
GS1: Better retailing through linked data
 
Better Retailing through Linked Data
Better Retailing through Linked DataBetter Retailing through Linked Data
Better Retailing through Linked Data
 
RDFa -- search engines and beyond!
RDFa -- search engines and beyond!RDFa -- search engines and beyond!
RDFa -- search engines and beyond!
 
Semantic Web Presentation at University of St Thomas,ust seis752
Semantic Web Presentation at University of St Thomas,ust seis752Semantic Web Presentation at University of St Thomas,ust seis752
Semantic Web Presentation at University of St Thomas,ust seis752
 
The Offspring of SEO and Semantic Web: SEO++
The Offspring of SEO  and Semantic Web: SEO++ The Offspring of SEO  and Semantic Web: SEO++
The Offspring of SEO and Semantic Web: SEO++
 
Semantic Web and Linked Data at TechMaine Conference
Semantic Web and Linked Data at TechMaine ConferenceSemantic Web and Linked Data at TechMaine Conference
Semantic Web and Linked Data at TechMaine Conference
 
NYC Semantic Web Meetup, Nov 2010
NYC Semantic Web Meetup, Nov 2010NYC Semantic Web Meetup, Nov 2010
NYC Semantic Web Meetup, Nov 2010
 
Sem web summit boston 2010
Sem web summit boston 2010Sem web summit boston 2010
Sem web summit boston 2010
 
MIT Sloan Linked Data Ventures - Jay Myers
MIT Sloan Linked Data Ventures  - Jay MyersMIT Sloan Linked Data Ventures  - Jay Myers
MIT Sloan Linked Data Ventures - Jay Myers
 
Increasing product and service visibility through front-end semantic web
Increasing product and service visibility through front-end semantic webIncreasing product and service visibility through front-end semantic web
Increasing product and service visibility through front-end semantic web
 
SES Chicago "Developments in Information Retrieval on the Web"
SES Chicago "Developments in Information Retrieval on the Web"SES Chicago "Developments in Information Retrieval on the Web"
SES Chicago "Developments in Information Retrieval on the Web"
 

The Web Comes Alive with Data! Schema.org and Structured Data on the Web: Past, Present, Potential

Notas do Editor

  1. Who am I
  2. examples
  3. What’s in it for me?