What is the current status quo of the Semantic Web as first mentioned by Tim Berners Lee in 2001?
Not only 10 blue links can drive you traffic anymore, Google has added many so called Knowlegde cards and panels to answer the specific informational need of their users. Sounds complicated, but it isn’t. If you ask for information, Google will try to answer it within the result pages.
I'll share my research from a theoretical point of view through exploring patents and papers, and actual testing cases in the live indices of Google. Getting your site listed as the source of an Answer Card can result in an increase of CTR as much as 16%. How to get listed? Come join my session and I'll shine some light on the factors that come into play when optimizing for Google's Knowledge graph.
4. ―The Semantic Web is a collaborative
movement led by international standards
body the World Wide Web
Consortium (W3C). The standard promotes
common data formats on the World Wide
Web‖
5. ―The Semantic Web provides a common
framework that allows data to be shared
and reused across application, enterprise,
and community boundaries‖
57. International Freelance SEO
SEO Consultant Metapeople
/ Netbooster Group
Brand Ambassador Majestic
Cycling & Skating
Science: Physics in particular
58.
59.
60. 1. Make data available
2. Use specific markup languages
3. Data is available for everyone
61. ―The Open Graph protocol enables any web
page to become a rich object in a social
graph. For instance, this is used on
Facebook to allow any web page to have
the same functionality as any other object
on Facebook.‖
68. 1. Schema.org microdata
2. Open Graph protocol
3. Title + metadescription element
4. Best guess from page content
Use: https://developers.google.com/+/web/snippet/
79. se·man·tics [si-man-tiks]
noun
the branch of linguistics that deals with the
study of meaning, changes in meaning,
and the principles that govern the
relationship between sentences or words
and their meanings
82. ―Microdata is a set of tags, introduced with
HTML5, that allows you to do this.‖
83.
84. • Is separated from the HTML
• Which gives more flexibility and scalabilty
options
• Used in more software, like the washing
machine I showed earlier
• But… Google hasn’t integrated everything
yet
94. • https://developers.google.com/structured-data/rich-
snippets/
• Schema Creator by Raven http://schema-creator.org/
• Schema.org Generator
http://www.microdatagenerator.com/
• Rich Snippets Testing Tool Bookmarklet
• http://www.blindfiveyearold.com/rich-snippets-testing-tool-bookmarklet
• Everything you need to know to generate
rich snippets: http://seogadget.com/micro-data-schema-org-
guide-to-generating-rich-snippets/
95. 1. You have specific data points available
2. SE’s accept specific markup language
3. SE’s accept certain snippets
4. Information within the SERPs is correct
• Implement code and check with the SE’s:
https://developers.google.com/structured-data/testing-tool/?hl=it
96.
97. • Make sure all items are structured and
nested in the correct way.
• Google Testing tool only shows errors
based on missing elements, not on wrong
coding!
103. ―Google doesn’t use markup for ranking
purposes at this time—but rich snippets can
make your web pages appear more
prominently in search results, so you may
see an increase in traffic.‖
Source:
https://support.google.com/webmasters/answer/1211158?hl=en
111. 406
368
288
248
228
182
177
148
135
Artificial Intelligence and Machine Learning
Algorithms and Theory
Human-Computer Interaction and Visualization
Natural Language Processing
Machine Perception
Information Retrieval and the Web
Security, Cryptography, and Privacy
Data Mining
Software Systems
Top 10 Research fields per # Publications
129. Four different methods to extract triples from web content
Natural Language
Processing tools
Entity recognition
Entity linkage
Entity verification
against Freebase
Source: https://www.cs.cmu.edu/~nlao/publication/2014.kdd.pdf
Document Object
Model
Either text or
database driven
―deep web‖ sources
Think of quering
HTML forms
570M tables on the
web
Relations are difficult
to extract
Schema matching
methods
Entity verification
against Freebase
Schema.org
Mostly people
related
Products & Events
are not stored
Mapping
Schema.org to
Freebase for
predicates
132. Exploring the power of tables on the Web
https://research.google.com/tables
133.
134.
135. The papers share some insights about the factors relevant to Google Tables results
Sources of data Google uses according to the paper
Optimise the
surrounding content
with relevant
captions and texts.
Use <th> table
headings to add
labels to specific
columns
Add relevant
attributes to your
table headings
focusing on the
queries used
Only add useful
content to the table.
Boilerplate content
is filtered out.
http://www.cidrdb.org/cidr2015/Papers/CIDR15_Paper3.pdf
136.
137. ―Extraction errors are far more prevalent than
source errors. Ignoring this distinction can
cause us to incorrectly distrust a website‖
138. Back to the basics for Google (and probably the other search engines too)
Links still tell something about
relationships between pages but also
between entities.
Simply search in the indices you already
have. In the case of Google, they already
have ―everything‖.
Simply gather user feedback from within
the search results.
147. One in 20 searches is health related according to Google.
148.
149.
150. Use Web based Fact
extraction, like DOM, tables
and annotated data
(Schema.org)
Text based extractors
adding more triples to the
datasets
Systems like described in the Biperpedia
paper. Data is enriched and quality
control takes place. Use partnerships for
trusted resources.
Use existing datasets like
Freebase / Wikidata to verify
extracted data and calculate
probability
164. Add schema.org Organization markup to your official website
Find example JSON-LD at
https://developers.google.com/structured-data/customize/overview
165. What about the localised Google search indices?
?
?
?
?
?
?
166.
167.
168. Contains the main
subject of the required
answer
Contains the main
subject of the required
answer
Within the content, the
question is answered in
a single sentence
No, Euro NCAP is more
authoritative in the EU
for car safety levels.
NHTSA for the US
175. Since not many are focusing on the getting into the Direct Answers yet, grab the positions first!
176. 95% of the cases had increased traffic - including movements within top 10 normal blue links.
Less than
expected, probably
because of quality of the
answer: results between -
5% and +6% traffic.
Results varied between -3%
and +11% depending on
previous position in the
SERPs
These were performing the
best, increases between 6
and 14%
Depending on the
topic, complicated topics
tend to get more clicks.
Average results between -
2% and 16% increase