SlideShare uma empresa Scribd logo
1 de 29
1
More Meaning. Better Results.
1
Building the Inform Semantic Publishing Ecosystem:
from Author to Audience
Marc Hadfield
VP, Research & Development
marc@inform.com
2
Marc Hadfield
• Semantic Technology, Computer Science
• Inform Technologies (Head of R&D)
‣ Semantic Technologies applied to Content Analysis & Distribution
• Alitora Systems (Co-Founder / CTO)
‣ Life Science Semantic Technology, Research, Big Data Analytics, Semantic HPC
‣ Life Science Natural Language Processing
• Columbia Genome Center
‣ NLP applied to Life Science Research Articles
• LCconnect (CTO)
‣ Letter-of-Credit Exchange
2
3
Semantics in Publishing…
3
• Ongoing Theme at ISWC 2010…
‣ NY Times
‣ Facebook (OpenGraph)
‣ Elsevier
‣ BBC
4
What is Inform?
4
• Inform is a content enrichment solution designed to increase consumer
engagement, page views and revenue.
• We provide a hosted Semantic Web Service for content publishers that:
1. Reads your article before you publish it
2. Turns main topics and entities (people, places, companies, organizations) into links
3. Provides feeds of related web content when you publish it
• New Direction: Optimizing Content Distribution via Direct Channels
• Web users moving away from destination web sites, but still want the destination web
site content.
• Companies utilizing Inform include:
Connecting your content
55
Audio, Video & Blogs
from the Web
Articles from
the Web
Content from Inform
Your Affiliates’ Content
Your Content
Affiliated
Content
Your
Content
Licensed
Content
Google Street View Topic 0.90
Google Company 1.00
Ireland Place 0.70
Norway Place 0.70
South Africa Place 0.70
Sweden Place 0.70
Brian McClendon Person 0.80
Mountain View, California Place 0.60
Wi-Fi Topic 0.50
6
Related Content Widgets
6
7
Inform Topic Pages, Micro Sites
7
8
My Job: Building the Semantic Platform…
8
• “Silo”-ed Semantic Technology  Semantic Web
‣ Aligned with Wikipedia, Leverage Linked Data for Mash-Ups
‣ RDFa, SKOS, Semantic SEO
• Semantic / NLP Engine
‣ Improve Features, Quality
• Semantic Data Infrastructure
‣ Scalable Infrastructure
• Semantic Data Analysis
‣ Algorithms (Topology of Graphs), Inference
‣ “PageRank” on semantic data
• Personalization, Usage Analysis
• Micro Sites
‣ Clusters of Topics, Generating Rich Content Experience
• Distributing to Social Platforms
‣ i.e. Facebook
9
Inform: Author to Audience
9
10
Leverage Inform Taxonomy
10
1111
Author 
‣ Content Creation Services
‣ Semantic Data Repository
‣ Semantic Data Analysis
‣ Content Selection Algorithms
‣ Webservices
‣ Content Distribution Services
 Audience
Inside the
Semantic
System
Architecture
12
Content Creation
12
• Article Creation Tool (ACT)
‣ Author Tools
‣ Embed in CMS, Tumblr / Wordpress Plugin
• Publisher Portal
‣ Editorial Tool
‣ Content Feeds
• Web Crawl
• Summarizer
‣ Create smart “blurbs” to advertise article
• LinkedData
‣ Freebase, Wikipedia, DBPedia, et cetera.
13
ACT Tool
13
14
ACT Tool
14
15
ACT Tool, Tumblr, Wordpress
15
16
Publisher Portal
16
17
Summarizer
17
18
Semantic Data Repository
18
• Data Master / Data Node
‣ Federated Semantic Data Managers
‣ SPARQL Triplestore (scalable cluster)
‣ Semantic Search
‣ Search Indexes (Semi-Structured and Full-Text Search)
‣ Lucene/Siren (Sindice)
‣ Facets, Frequency Counts
‣ Cache (In-Memory)
‣ Blob Store (Voldemort)
‣ Listener to Activity (Flume)
‣ User Activity (clicks)
‣ Content Activity (content updates)
‣ Near Real-Time Trends, Analysis
‣ Compute Algorithms (Stored Procedures in Groovy)
‣ Long Term Content Archive (offline)
19
Semantic Data Analysis
19
• Natural Language Processing
‣ Rules & Machine Learning, Training
‣ 500K articles per day, 4,000 unique sites
‣ Text Extraction, Section/Sentence Extraction
‣ Tokenization, Part-of-Speech, Noun/Verb Phrases
‣ Entity Extraction, Entity Normalization
‣ Topic Extraction, Summarization, Clustering
• User Activity
‣ User Model (Personalization)
• Semantic Inference
‣ F-Logic, Multi-Domain
‣ Linked Data Mash-Ups
• Semantic Graph Topology
‣ Entity / Property Importance Metrics, Ranking, “PageRank”
‣ Which triples in LinkedData are interesting?
20
Content Selection Algorithms
20
• Model of User, Personalization
‣ Social Networks provide Context
• Semantic Analysis of Content
• Algorithms
‣ Maximize Relevancy / Relatedness (Meets Editorial Criteria)
‣ Maximize Click-Through
‣ Cute Kitten vs. Engagement Issue
‣ Maximize Monetization
Goal: Content Exchange
21
Webservices
21
• REST
‣ Outputs RDF / JSON Data
• Natural Language Processing
‣ Article to Semantic MetaData
• Related Content
‣ Inputs: Content, Personalization, Algorithm
‣ Articles
‣ Semantic Mash-Ups
‣ Topics
‣ Entities
• Semantic Query, Site Search
• Storage, Content Repository
22
Content Distribution Services
22
• Customer Destinations (Traditional Business)
‣ Deep Integration
• Publisher Widgets
‣ Levels of Lightweight Integration
‣ Example: Related-Content-Widget in JavaScript
• Inform.com
‣ Topic Pages
• Micro Sites
‣ Several Thousand Owned-and-Operated Domains/Sites, Topic Driven
• Social Networks
‣ Facebook
Tools:
• Semantic SEO
‣ RDFa, SKOS
23
Semantic MetaData, RDFa
23
http://inspector.sindice.com
24
Facebook App
24
25
Using Facebook OpenGraph
25
Relevancy Algorithm:
Combine:
•Trending / Popular Topics
•Trending / Popular Articles
•Personalization “Liked” Topics
•Personalization “Liked” Articles
•User Profiles (“Users like you…”)
26
Facebook “Liked” Topics
26
27
Facebook Article Stream
27
28
Inform: Author to Audience via Semantics
28
29
Thanks for your attention!
29
Questions?
Contact Information:
Marc Hadfield
marc@inform.com

Mais conteúdo relacionado

Mais procurados

Structured data: Where did that come from & why are Google asking for it
Structured data: Where did that come from & why are Google asking for itStructured data: Where did that come from & why are Google asking for it
Structured data: Where did that come from & why are Google asking for itRichard Wallis
 
Fried data summit data quality data analytics together
Fried data summit data quality data analytics togetherFried data summit data quality data analytics together
Fried data summit data quality data analytics togetherJeff Fried
 
A Real-World Implementation of Linked Data
A Real-World Implementation of Linked DataA Real-World Implementation of Linked Data
A Real-World Implementation of Linked DataDimitri van Hees
 
How to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointHow to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointJoris Poelmans
 
How LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data VisualizationHow LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data VisualizationChi-Yi Kuan
 
Focused Crawling for Structured Data
Focused Crawling for Structured DataFocused Crawling for Structured Data
Focused Crawling for Structured DataRobert Meusel
 
Schema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowSchema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowRichard Wallis
 
The SAS Search Journey: Using AI to Move from Google to Lucidworks - Alex Fl...
The SAS Search Journey:  Using AI to Move from Google to Lucidworks - Alex Fl...The SAS Search Journey:  Using AI to Move from Google to Lucidworks - Alex Fl...
The SAS Search Journey: Using AI to Move from Google to Lucidworks - Alex Fl...Lucidworks
 
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
Neo4j   graphs in the real world - graph days d.c. - april 14, 2015Neo4j   graphs in the real world - graph days d.c. - april 14, 2015
Neo4j graphs in the real world - graph days d.c. - april 14, 2015Neo4j
 
DWCNZ - Content Types: Love Them or Lose It
DWCNZ - Content Types: Love Them or Lose ItDWCNZ - Content Types: Love Them or Lose It
DWCNZ - Content Types: Love Them or Lose ItMarc D Anderson
 
Real-time big data analytics based on product recommendations case study
Real-time big data analytics based on product recommendations case studyReal-time big data analytics based on product recommendations case study
Real-time big data analytics based on product recommendations case studydeep.bi
 
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data SourcesKESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data SourcesLinked Enterprise Date Services
 
Understanding voice of the member via text mining
Understanding voice of the member via text miningUnderstanding voice of the member via text mining
Understanding voice of the member via text miningChi-Yi Kuan
 
Instant Security and User Management in Spring Boot
Instant Security and User Management in Spring BootInstant Security and User Management in Spring Boot
Instant Security and User Management in Spring BootRemy Champion
 
S4: The Self-Service Semantic Suite
S4: The Self-Service Semantic SuiteS4: The Self-Service Semantic Suite
S4: The Self-Service Semantic SuiteMarin Dimitrov
 
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...Yongzheng (Tiger) Zhang
 
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...Linked Enterprise Date Services
 
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLCouchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLDATAVERSITY
 
O365Con18 - Invest in Search - Matthew McDermott
O365Con18 - Invest in Search - Matthew McDermottO365Con18 - Invest in Search - Matthew McDermott
O365Con18 - Invest in Search - Matthew McDermottNCCOMMS
 
Semantically Enabled Personal Information Management with Cluug.com
Semantically Enabled Personal Information Management with Cluug.comSemantically Enabled Personal Information Management with Cluug.com
Semantically Enabled Personal Information Management with Cluug.comBernhard Schandl
 

Mais procurados (20)

Structured data: Where did that come from & why are Google asking for it
Structured data: Where did that come from & why are Google asking for itStructured data: Where did that come from & why are Google asking for it
Structured data: Where did that come from & why are Google asking for it
 
Fried data summit data quality data analytics together
Fried data summit data quality data analytics togetherFried data summit data quality data analytics together
Fried data summit data quality data analytics together
 
A Real-World Implementation of Linked Data
A Real-World Implementation of Linked DataA Real-World Implementation of Linked Data
A Real-World Implementation of Linked Data
 
How to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointHow to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePoint
 
How LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data VisualizationHow LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data Visualization
 
Focused Crawling for Structured Data
Focused Crawling for Structured DataFocused Crawling for Structured Data
Focused Crawling for Structured Data
 
Schema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowSchema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & How
 
The SAS Search Journey: Using AI to Move from Google to Lucidworks - Alex Fl...
The SAS Search Journey:  Using AI to Move from Google to Lucidworks - Alex Fl...The SAS Search Journey:  Using AI to Move from Google to Lucidworks - Alex Fl...
The SAS Search Journey: Using AI to Move from Google to Lucidworks - Alex Fl...
 
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
Neo4j   graphs in the real world - graph days d.c. - april 14, 2015Neo4j   graphs in the real world - graph days d.c. - april 14, 2015
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
 
DWCNZ - Content Types: Love Them or Lose It
DWCNZ - Content Types: Love Them or Lose ItDWCNZ - Content Types: Love Them or Lose It
DWCNZ - Content Types: Love Them or Lose It
 
Real-time big data analytics based on product recommendations case study
Real-time big data analytics based on product recommendations case studyReal-time big data analytics based on product recommendations case study
Real-time big data analytics based on product recommendations case study
 
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data SourcesKESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
 
Understanding voice of the member via text mining
Understanding voice of the member via text miningUnderstanding voice of the member via text mining
Understanding voice of the member via text mining
 
Instant Security and User Management in Spring Boot
Instant Security and User Management in Spring BootInstant Security and User Management in Spring Boot
Instant Security and User Management in Spring Boot
 
S4: The Self-Service Semantic Suite
S4: The Self-Service Semantic SuiteS4: The Self-Service Semantic Suite
S4: The Self-Service Semantic Suite
 
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
 
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
 
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLCouchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
 
O365Con18 - Invest in Search - Matthew McDermott
O365Con18 - Invest in Search - Matthew McDermottO365Con18 - Invest in Search - Matthew McDermott
O365Con18 - Invest in Search - Matthew McDermott
 
Semantically Enabled Personal Information Management with Cluug.com
Semantically Enabled Personal Information Management with Cluug.comSemantically Enabled Personal Information Management with Cluug.com
Semantically Enabled Personal Information Management with Cluug.com
 

Semelhante a Building the Inform Semantic Publishing Ecosystem: from Author to Audience

2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v12017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1Don Miller
 
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case StudyOracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case StudyMark Rittman
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us? Andrea Volpini
 
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...Mark Rittman
 
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 ConferenceStructuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 ConferenceNick Inglis
 
Climbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations WebinarClimbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations WebinarConcept Searching, Inc
 
Big problems Big data, simple AWS solution
Big problems Big data, simple AWS solutionBig problems Big data, simple AWS solution
Big problems Big data, simple AWS solutionJean-Claude Sotto
 
MLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a ServiceMLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a ServiceKarl Seiler
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...Dr. Haxel Consult
 
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Open Analytics
 
Open Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenOpen Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenChristopher Whitaker
 
Social Media Data Collection & Analysis
Social Media Data Collection & AnalysisSocial Media Data Collection & Analysis
Social Media Data Collection & AnalysisScott Sanders
 
Big problems Big Data, simple solutions
Big problems Big Data, simple solutionsBig problems Big Data, simple solutions
Big problems Big Data, simple solutionsClaudio Pontili
 
The Next Web of Linked Data
The Next Web of Linked DataThe Next Web of Linked Data
The Next Web of Linked DataJay Myers
 
How to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationHow to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationPerficient, Inc.
 
Rapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopRapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopPeter Skomoroch
 

Semelhante a Building the Inform Semantic Publishing Ecosystem: from Author to Audience (20)

Semantics and Machine Learning
Semantics and Machine LearningSemantics and Machine Learning
Semantics and Machine Learning
 
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v12017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
 
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case StudyOracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
 
Webinar: The Slippery Slope of Migrating to SharePoint Online or On-Premise
Webinar: The Slippery Slope of Migrating to SharePoint Online or On-PremiseWebinar: The Slippery Slope of Migrating to SharePoint Online or On-Premise
Webinar: The Slippery Slope of Migrating to SharePoint Online or On-Premise
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us?
 
Webinar: Slippery Slope of SharePoint Migrations
Webinar: Slippery Slope of SharePoint Migrations Webinar: Slippery Slope of SharePoint Migrations
Webinar: Slippery Slope of SharePoint Migrations
 
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
 
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 ConferenceStructuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
 
Climbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations WebinarClimbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations Webinar
 
Big problems Big data, simple AWS solution
Big problems Big data, simple AWS solutionBig problems Big data, simple AWS solution
Big problems Big data, simple AWS solution
 
MLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a ServiceMLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a Service
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
 
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
 
Open Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenOpen Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe Olsen
 
Social Media Data Collection & Analysis
Social Media Data Collection & AnalysisSocial Media Data Collection & Analysis
Social Media Data Collection & Analysis
 
Big problems Big Data, simple solutions
Big problems Big Data, simple solutionsBig problems Big Data, simple solutions
Big problems Big Data, simple solutions
 
The Next Web of Linked Data
The Next Web of Linked DataThe Next Web of Linked Data
The Next Web of Linked Data
 
Semantic Web For Dummies
Semantic Web For DummiesSemantic Web For Dummies
Semantic Web For Dummies
 
How to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationHow to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data Visualization
 
Rapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopRapid Data Exploration With Hadoop
Rapid Data Exploration With Hadoop
 

Mais de Vital.AI

Optimizing the
 Data Supply Chain
 for Data Science
Optimizing the
 Data Supply Chain
 for Data ScienceOptimizing the
 Data Supply Chain
 for Data Science
Optimizing the
 Data Supply Chain
 for Data ScienceVital.AI
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital.AI
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data ModelingVital.AI
 
Vital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI
 
Natural Language Processing & Semantic Models in an Imperfect World
Natural Language Processing & Semantic Modelsin an Imperfect WorldNatural Language Processing & Semantic Modelsin an Imperfect World
Natural Language Processing & Semantic Models in an Imperfect WorldVital.AI
 
Inform: Targeting the Interest Graph
Inform: Targeting the Interest GraphInform: Targeting the Interest Graph
Inform: Targeting the Interest GraphVital.AI
 

Mais de Vital.AI (6)

Optimizing the
 Data Supply Chain
 for Data Science
Optimizing the
 Data Supply Chain
 for Data ScienceOptimizing the
 Data Supply Chain
 for Data Science
Optimizing the
 Data Supply Chain
 for Data Science
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data Modeling
 
Vital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent Apps
 
Natural Language Processing & Semantic Models in an Imperfect World
Natural Language Processing & Semantic Modelsin an Imperfect WorldNatural Language Processing & Semantic Modelsin an Imperfect World
Natural Language Processing & Semantic Models in an Imperfect World
 
Inform: Targeting the Interest Graph
Inform: Targeting the Interest GraphInform: Targeting the Interest Graph
Inform: Targeting the Interest Graph
 

Último

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 

Último (20)

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

Building the Inform Semantic Publishing Ecosystem: from Author to Audience

  • 1. 1 More Meaning. Better Results. 1 Building the Inform Semantic Publishing Ecosystem: from Author to Audience Marc Hadfield VP, Research & Development marc@inform.com
  • 2. 2 Marc Hadfield • Semantic Technology, Computer Science • Inform Technologies (Head of R&D) ‣ Semantic Technologies applied to Content Analysis & Distribution • Alitora Systems (Co-Founder / CTO) ‣ Life Science Semantic Technology, Research, Big Data Analytics, Semantic HPC ‣ Life Science Natural Language Processing • Columbia Genome Center ‣ NLP applied to Life Science Research Articles • LCconnect (CTO) ‣ Letter-of-Credit Exchange 2
  • 3. 3 Semantics in Publishing… 3 • Ongoing Theme at ISWC 2010… ‣ NY Times ‣ Facebook (OpenGraph) ‣ Elsevier ‣ BBC
  • 4. 4 What is Inform? 4 • Inform is a content enrichment solution designed to increase consumer engagement, page views and revenue. • We provide a hosted Semantic Web Service for content publishers that: 1. Reads your article before you publish it 2. Turns main topics and entities (people, places, companies, organizations) into links 3. Provides feeds of related web content when you publish it • New Direction: Optimizing Content Distribution via Direct Channels • Web users moving away from destination web sites, but still want the destination web site content. • Companies utilizing Inform include:
  • 5. Connecting your content 55 Audio, Video & Blogs from the Web Articles from the Web Content from Inform Your Affiliates’ Content Your Content Affiliated Content Your Content Licensed Content Google Street View Topic 0.90 Google Company 1.00 Ireland Place 0.70 Norway Place 0.70 South Africa Place 0.70 Sweden Place 0.70 Brian McClendon Person 0.80 Mountain View, California Place 0.60 Wi-Fi Topic 0.50
  • 7. 7 Inform Topic Pages, Micro Sites 7
  • 8. 8 My Job: Building the Semantic Platform… 8 • “Silo”-ed Semantic Technology  Semantic Web ‣ Aligned with Wikipedia, Leverage Linked Data for Mash-Ups ‣ RDFa, SKOS, Semantic SEO • Semantic / NLP Engine ‣ Improve Features, Quality • Semantic Data Infrastructure ‣ Scalable Infrastructure • Semantic Data Analysis ‣ Algorithms (Topology of Graphs), Inference ‣ “PageRank” on semantic data • Personalization, Usage Analysis • Micro Sites ‣ Clusters of Topics, Generating Rich Content Experience • Distributing to Social Platforms ‣ i.e. Facebook
  • 9. 9 Inform: Author to Audience 9
  • 11. 1111 Author  ‣ Content Creation Services ‣ Semantic Data Repository ‣ Semantic Data Analysis ‣ Content Selection Algorithms ‣ Webservices ‣ Content Distribution Services  Audience Inside the Semantic System Architecture
  • 12. 12 Content Creation 12 • Article Creation Tool (ACT) ‣ Author Tools ‣ Embed in CMS, Tumblr / Wordpress Plugin • Publisher Portal ‣ Editorial Tool ‣ Content Feeds • Web Crawl • Summarizer ‣ Create smart “blurbs” to advertise article • LinkedData ‣ Freebase, Wikipedia, DBPedia, et cetera.
  • 15. 15 ACT Tool, Tumblr, Wordpress 15
  • 18. 18 Semantic Data Repository 18 • Data Master / Data Node ‣ Federated Semantic Data Managers ‣ SPARQL Triplestore (scalable cluster) ‣ Semantic Search ‣ Search Indexes (Semi-Structured and Full-Text Search) ‣ Lucene/Siren (Sindice) ‣ Facets, Frequency Counts ‣ Cache (In-Memory) ‣ Blob Store (Voldemort) ‣ Listener to Activity (Flume) ‣ User Activity (clicks) ‣ Content Activity (content updates) ‣ Near Real-Time Trends, Analysis ‣ Compute Algorithms (Stored Procedures in Groovy) ‣ Long Term Content Archive (offline)
  • 19. 19 Semantic Data Analysis 19 • Natural Language Processing ‣ Rules & Machine Learning, Training ‣ 500K articles per day, 4,000 unique sites ‣ Text Extraction, Section/Sentence Extraction ‣ Tokenization, Part-of-Speech, Noun/Verb Phrases ‣ Entity Extraction, Entity Normalization ‣ Topic Extraction, Summarization, Clustering • User Activity ‣ User Model (Personalization) • Semantic Inference ‣ F-Logic, Multi-Domain ‣ Linked Data Mash-Ups • Semantic Graph Topology ‣ Entity / Property Importance Metrics, Ranking, “PageRank” ‣ Which triples in LinkedData are interesting?
  • 20. 20 Content Selection Algorithms 20 • Model of User, Personalization ‣ Social Networks provide Context • Semantic Analysis of Content • Algorithms ‣ Maximize Relevancy / Relatedness (Meets Editorial Criteria) ‣ Maximize Click-Through ‣ Cute Kitten vs. Engagement Issue ‣ Maximize Monetization Goal: Content Exchange
  • 21. 21 Webservices 21 • REST ‣ Outputs RDF / JSON Data • Natural Language Processing ‣ Article to Semantic MetaData • Related Content ‣ Inputs: Content, Personalization, Algorithm ‣ Articles ‣ Semantic Mash-Ups ‣ Topics ‣ Entities • Semantic Query, Site Search • Storage, Content Repository
  • 22. 22 Content Distribution Services 22 • Customer Destinations (Traditional Business) ‣ Deep Integration • Publisher Widgets ‣ Levels of Lightweight Integration ‣ Example: Related-Content-Widget in JavaScript • Inform.com ‣ Topic Pages • Micro Sites ‣ Several Thousand Owned-and-Operated Domains/Sites, Topic Driven • Social Networks ‣ Facebook Tools: • Semantic SEO ‣ RDFa, SKOS
  • 25. 25 Using Facebook OpenGraph 25 Relevancy Algorithm: Combine: •Trending / Popular Topics •Trending / Popular Articles •Personalization “Liked” Topics •Personalization “Liked” Articles •User Profiles (“Users like you…”)
  • 28. 28 Inform: Author to Audience via Semantics 28
  • 29. 29 Thanks for your attention! 29 Questions? Contact Information: Marc Hadfield marc@inform.com