SlideShare uma empresa Scribd logo
1 de 32
Baixar para ler offline
Diana Tamabayeva & Dan Delany

API’s, Freebase, and the 
Collaborative Semantic Web
Dan Delany
dan.delany@gmail.com
 cognitiveharmony.net
     970‐309‐8598
Object‐Oriented Representation

                             “John
Smith
is
a
five‐
                               foot‐tall
male
who

                             weighs
one‐hundred

                              eighty
pounds.
He’s

                            forty‐two
years
old
and

                                 he
loves
dogs.”
   sortByHeight(users);
  isADude(“John
Smith”);


Structured data in         String of characters 
object ‐ useful to          ‐ useful to people
    computers  
Object‐Oriented Representation
  class
Dog<Mammal                       class
Place
    @home,
@age,
@human                    @latlong,
@elevation,
    def
goHome                             @address,
@name
      go(@home);                           def
isBelowSeaLevel
                          Home
    end                                      @elevation
<
0;
                                           end
                              Pet
                                          Home
                  Human
                                                             Spouse
O‐O instance variables and          class
Human<Mammal       Friends
methods represent                     @home,
@age,
@pets
structured semantic                   @spouse,
@friends

                                      def
divorce
relationships between 
                                        @spouse
=
nil;
pieces of information.                end
Structured Semantic 
Relationships Among Objects



We visualize this with a semantic graph.
The (hypertext) Web Today



            Web
Servers

                          WWW
     LAMP




User
Computer
The (semantic) Web Tomorrow?
                                             Servers
                      Servers




User
Computer
    Structured
Data
    (OWL/RDF)
                                     Data
Aggregator/
    Document
Data
    (HTML/CSS)
                                        Visualizer
    Hyperlinks
                           Servers
    (Old
www
links)
A Cloud that Talks To Itself.




            vs.
Why? Searchability.
With semantic graphs, you can perform semantic searches
 by traversing the graph:




 “Where does the woman who lives at 2408 Walnut work?”
Why? So You Can Mash it Up.



    twittermap.com




                       mapmash.googlepages.com/gaza.html



                     Mashups create meaning from data.
housingmaps.com
Why? Context‐Aware AI
Today’s AI is limited by the domain‐
specificity of its input data
                  Example: Image Analysis
              Context‐Unaware: Blob Tracking



                                       Useful in it’s domain (in this 
                                       case, touch detection), but 
                                       ultimately has no 
                       Blob2           ‘intelligence.’
          Blob1
Why? Context‐Aware AI
  Today’s AI is limited by the domain‐
  specificity of its input data
                      Example: Image Analysis
                   Context‐Aware: Mine the semantic 
                     graph for heuristics and clues
     Trees
             Sky


                                           You’re in the mountains 
                                           near Aspen, CO. It is the 
                                           fall. Your friends Taifur 
                                           and Rachel are at
                                           the same location. Etc.
Rachel   Taifur     Dan
Why? Context‐Aware AI


 I have a dream for the Web [in which computers] become capable of analyzing 
 all the data on the Web ‐ the content, links, and transactions between people 
 and computers. A ‘Semantic Web’, which should make this possible, has yet to 
 emerge, but when it does, the day‐to‐day mechanisms of trade, bureaucracy 
 and our daily lives will be handled by machines talking to machines.

  The ‘intelligent agents’ people have 
 touted for ages will finally materialize.
                                                           Tim Berners‐Lee, 1999
Challenges ‐ Motivation/Critical Mass

                       Value comes from ubiquity, 
                       and ubiquity comes from value.
                       How do we encourage adoption of technology
                       that does not yet provide value?



More importantly, how will it provide value to content producers? 
How is giving away your data’s meaning (your secret sauce) 
instead of presenting it alongside ads valuable?

Should semantic feeds be monetized?
“Information As A Service”
                                                                     14
Challenges ‐ Privacy/Security




• Reduced
anonymity
on
the
Web
• Increased
invasion
of
privacy
• SoluSon:
access
privileges
must

  be
controlled
by
infrastructure‐
  level
security
The State of the Semantic Web
Top‐Down: Content Structuring
•SemanSc
search
engines
generate
and
leverage
an
internal
semanSc
database
•Data
sSll
returned
as
HTML,
no
API










not
helping
create
the
semanSc
web!
Top‐Down: Dapper
•Dapper
‐
UI
for
seXng
up
rules
to

scrape
semi‐structured
data
(CSV,
RSS,
XML)

                                                         RDF
from
any
set
of
HTML
documents!                CSV
                                                     to
semanSc
web

                                               RSS       OWL


WWW
Top‐Down: Yahoo! Pipes
•Yahoo!
Pipes
‐
UI
for
transforming,

reformaXng,
and
combining
data
feeds
into

                                                        RDF
more
useful
data
feeds.
                                             JSON
                                                    to
semanSc
web

  CSV                                         RSS       OWL


JSON




       RSS
Bottom‐Up: Publishing Standards

                                         RDF and OWL standards
•W3C‐sponsored
standards
for
defining
semanSc
relaSonships
and
resources
•Powerful,
but
complex
and
hard
for
humans
to
read/create
•No
mo@va@on
for
developers
to
create
•No
W3C‐sponsored
universal
ontologies
       <rdf:Description                      owl:Class rdf:ID=quot;Carquot;
       rdf:about=quot;http://.../DansCarquot;>        rdfs:subClassOf rdf:resource=quot;#Vehiclequot;
        <car:color>Red</car:color>            rdfs:subClassOf [a owl:Restriction;
        <car:make>Honda</make>               owl:cardinality quot;4quot;^^xsd:nonNegativeInteger;
        <car:year>1999</make>                owl:onProperty <#Wheel> ]


         RDF Data Node                             OWL Ontology
               Dan’s                                              hasA
                                                   isA
                                                         Car             Color
                                         Vehicle
       color              make
                Car
                                                             HasMul*ple(4)
                   year

               1999
API’s: I dream of a RESTful tomorrow

Thousands of content creators are already sharing structured data 
with API’s!
Bottom‐Up: Knowledge Bases
                                 Domain
Experts
•CollaboraSve
a_empt
to
build

factual
knowledge
base
•“Object‐Oriented
Wikipedia”
Bottom‐Up: Knowledge Bases
                                   Domain
Experts
•Content
contributed
by

experts
manually,
or
by
dataset

owners
automaScally.



Seman@c
Analysis
Freebase: The Everything Graph
Metaweb Freebase Statistics

                 •5.3
million
topics
today
                   •Growing
by
~15,000/month
                   •Pulled
from
public
data
                   •By
comparison,
Wikipedia
has

                   2.64
million
English
arScles
                 •25,379
users
today.
                   •Growing
by
600‐800/month




                 •Freebase
Launch,
March
2007
                 •LLC
Founded,
July
2005
The Freebase Approach
Crea@ve
Commons                Content
ALribu@on
License              Publishers                         WWW


                               Open
API


                        PI
                       RDF
A
                                (MQL)                  App Developers
Node
Editor
GUI
                                                       Data Modelers

                                                       Expert Users &
                                                       Dataset Owners



       Casual
       Collaborators                        Exis@ng
                                            Datasets              WWW
Freebase Data Policy
Crea@ve
Commons
                                 •Requires
A_ribuSon
of
Source
ALribu@on
License
                                                MQL
Query:
                            Open
API


                     PI
                                          [{   quot;albumquot; : {




                    RDF
A
                                                  quot;artistquot; : [],
                                                  quot;namequot; : null,

                             (MQL)                quot;release_datequot; : null
                                               },
                                               quot;limitquot; : 25,
                                               quot;namequot; : null,
                                               quot;name~=quot; : quot;Love*quot;,
                                               quot;typequot; : quot;/music/trackquot; }]

                                                   Returns:
    Data                                 [{ quot;albumquot; : {
                                                quot;artistquot; : [quot;Massive Attackquot;],
                                                quot;namequot; : quot;Blue Linesquot;,
                                                quot;release_datequot; : quot;1991-04-08quot;




  Freedom!
                                             },
                                             quot;namequot; : quot;One Lovequot;,
                                             quot;typequot; : quot;/music/trackquot;
                                         },{quot;albumquot; : {
                                                quot;artistquot; : [“Squirrel Nut
                                         Zippersquot;],
                                                quot;namequot; : quot;The Inevitablequot;,
                                                quot;release_datequot; : quot;1995-03-17quot;
                                             },
                                             quot;namequot; : quot;Anything But Lovequot;,
The Freebase Approach
Crea@ve
Commons                Content
ALribu@on
License              Publishers                         WWW


                               Open
API


                        PI
                       RDF
A
                                (MQL)                  App Developers
Node
Editor
GUI
                                                       Data Modelers

                                                       Expert Users &
                                                       Dataset Owners



       Casual
       Collaborators                        Exis@ng
                                            Datasets              WWW
Freebase Community

                         App
Developers
List

                          Data
Modelers
List

                                                          App Developers
IRC
Chat
(open
to
all)
                                                          Data Modelers
                   Discussion
Threads
                   on
Individual
Topics                  Expert Users &
                                                         Dataset Owners

                                               •No
Central
Forum
                                               •No
Backlog
of
Mailing
List
        Casual
                                               •No
Friends
        Collaborators
                                               •No
Private
Messages
Freebase Community Tools
                                       “Acre,
the
Freebase
applicaSon

                                       development
plaqorm,
lets
anyone

                                       mashup
Freebase
data
using

                                       Javascript
and
have
it
hosted
for

   Employees                           free.”
‐ Shawn Simister, developer



                                                                                                  App Developers
“We
have
been
working
         “[Data
Modeling
is
hard
because
new

hard
recently
to
provide
      schemas
are
a
slow
process,
and
they

bulk
import
tools
             can
break
users’
code.
Let’s
have
a

                               “Sloppy
Freebase”
that
allows
                                     Data Modelers
for
Freebase.
While
such

tools
exist
internally,
the
   users
to
enter
unstructured
data
unSl

reconciliaSon
process
has
     new
schemas
are
defined.]”
‐
Jack Alves, 
                                                                                                  Expert Users &
thus
far
been
too
             former Metaweb Director of Engineering
                                                                                                  Dataset Owners
complicated
for
public

release.”
‐ Brian 
Culbertson, Metaweb 
                                                        “As
a
programmer,
I
feel
that
I'm
most

Engineer
                                                        effecSve
when
I'm
contribuSng

                                                        large
data
sets
…
faciliSes

          Casual                                        built
into
Freebase
that
let
users

          Collaborators                                 upload
lists
of
topics
are
limited
to

                                                        specific
situaSons.”
‐ Shawn Simister
“Sloppy” Data Modeling
“[Data
Modeling
is
hard
because
new

                                           Currently, Freebase users cannot submit data if there is not 
schemas
are
a
slow
process,
and
they

                                           already a data structure + ontology built for that data type.
can
break
users’
code.
Let’s
have
a

“Sloppy
Freebase”
that
allows

                                           “Sloppy” data creation allows users to create their own data 
users
to
enter
unstructured
data
unSl

new
schemas
are
defined.]”
‐
Jack Alves, 
                                           types, which will later be cleaned and standardized.
former Metaweb Director of Engineering

      User‐Generated, Semi‐
      structured “sloppy” data                                               Clean, structured data
               Bob Jones                                                  John Green
                                                                                                 RDF
              major: English                                             area of study:
                                                                           Sociology

                                                                                             to
semanSc
web
John Green                      Eric Bradley
                                                                             Bob Jones
  focus:                         studying:
                                                                            area of study:
 Sociology                           CS
                                                                                                 OWL
                                                                               English

            Casual                               Automated Data             Eric Bradley
            Collaborators                        Cleaner‐Upper             area of study:
                                                                                 CS
Conclusion

 We
are
here.   • Progress
                  • RDF/OWL
                  • Freebase
‐
5.3
mil.
topics!
                  • Dapper,
Yahoo!
Pipes,
other

                    data
abstractors
                  • API’s
out
the
wazoo
                • Issues
                  •   AdopSon
                  •   Privacy/Security
                  •   Intellectual
Property
                  •   We’re
good
at
making

                      content,
but
we
suck
at

                      mining
it
and
describing
it.

Mais conteúdo relacionado

Semelhante a API's, Freebase, and the Collaborative Semantic web

The Yahoo Open Stack
The Yahoo Open StackThe Yahoo Open Stack
The Yahoo Open Stack
Megan Eskey
 
Facebook Hadoop Data & Applications
Facebook Hadoop Data & ApplicationsFacebook Hadoop Data & Applications
Facebook Hadoop Data & Applications
dzhou
 
Cwinters Intro To Rest And JerREST and Jersey Introductionsey
Cwinters Intro To Rest And JerREST and Jersey IntroductionseyCwinters Intro To Rest And JerREST and Jersey Introductionsey
Cwinters Intro To Rest And JerREST and Jersey Introductionsey
elliando dias
 
Yahoo Pipes Middleware In The Cloud
Yahoo Pipes Middleware In The CloudYahoo Pipes Middleware In The Cloud
Yahoo Pipes Middleware In The Cloud
ConSanFrancisco123
 
Creating Semantic Mashups Bridging Web 2 0 And The Semantic Web Presentation 1
Creating Semantic Mashups  Bridging Web 2 0 And The Semantic Web Presentation 1Creating Semantic Mashups  Bridging Web 2 0 And The Semantic Web Presentation 1
Creating Semantic Mashups Bridging Web 2 0 And The Semantic Web Presentation 1
jward5519
 
Creating Semantic Mashups Bridging Web 2 0 And The Semantic Web Presentation 1
Creating Semantic Mashups  Bridging Web 2 0 And The Semantic Web Presentation 1Creating Semantic Mashups  Bridging Web 2 0 And The Semantic Web Presentation 1
Creating Semantic Mashups Bridging Web 2 0 And The Semantic Web Presentation 1
jward5519
 
Text Mining and SEASR
Text Mining and SEASRText Mining and SEASR
Text Mining and SEASR
Loretta Auvil
 

Semelhante a API's, Freebase, and the Collaborative Semantic web (20)

When To Use Ruby On Rails
When To Use Ruby On RailsWhen To Use Ruby On Rails
When To Use Ruby On Rails
 
The Yahoo Open Stack
The Yahoo Open StackThe Yahoo Open Stack
The Yahoo Open Stack
 
Gates Toorcon X New School Information Gathering
Gates Toorcon X New School Information GatheringGates Toorcon X New School Information Gathering
Gates Toorcon X New School Information Gathering
 
Facebook Hadoop Data & Applications
Facebook Hadoop Data & ApplicationsFacebook Hadoop Data & Applications
Facebook Hadoop Data & Applications
 
Cwinters Intro To Rest And JerREST and Jersey Introductionsey
Cwinters Intro To Rest And JerREST and Jersey IntroductionseyCwinters Intro To Rest And JerREST and Jersey Introductionsey
Cwinters Intro To Rest And JerREST and Jersey Introductionsey
 
Introduction To R
Introduction To RIntroduction To R
Introduction To R
 
Some news about the SW
Some news about the SWSome news about the SW
Some news about the SW
 
Yahoo Pipes Middleware In The Cloud
Yahoo Pipes Middleware In The CloudYahoo Pipes Middleware In The Cloud
Yahoo Pipes Middleware In The Cloud
 
Hello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic DeveloperHello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic Developer
 
Creating Semantic Mashups Bridging Web 2 0 And The Semantic Web Presentation 1
Creating Semantic Mashups  Bridging Web 2 0 And The Semantic Web Presentation 1Creating Semantic Mashups  Bridging Web 2 0 And The Semantic Web Presentation 1
Creating Semantic Mashups Bridging Web 2 0 And The Semantic Web Presentation 1
 
Creating Semantic Mashups Bridging Web 2 0 And The Semantic Web Presentation 1
Creating Semantic Mashups  Bridging Web 2 0 And The Semantic Web Presentation 1Creating Semantic Mashups  Bridging Web 2 0 And The Semantic Web Presentation 1
Creating Semantic Mashups Bridging Web 2 0 And The Semantic Web Presentation 1
 
Text Mining and SEASR
Text Mining and SEASRText Mining and SEASR
Text Mining and SEASR
 
Graph and Amazon Neptune - Bill Baldwin
Graph and Amazon Neptune - Bill BaldwinGraph and Amazon Neptune - Bill Baldwin
Graph and Amazon Neptune - Bill Baldwin
 
Roll-out of the NYU HSL Website and Drupal CMS
Roll-out of the NYU HSL Website and Drupal CMSRoll-out of the NYU HSL Website and Drupal CMS
Roll-out of the NYU HSL Website and Drupal CMS
 
SEASR Overview
SEASR OverviewSEASR Overview
SEASR Overview
 
OpenSRF and Evergreen
OpenSRF and EvergreenOpenSRF and Evergreen
OpenSRF and Evergreen
 
Graph and Amazon Neptune
Graph and Amazon NeptuneGraph and Amazon Neptune
Graph and Amazon Neptune
 
SEO for the Semantic Web
SEO for the Semantic WebSEO for the Semantic Web
SEO for the Semantic Web
 
Graph & Neptune
Graph & NeptuneGraph & Neptune
Graph & Neptune
 
Blueprint talk at Open Hackday London 2009
Blueprint talk at Open Hackday London 2009Blueprint talk at Open Hackday London 2009
Blueprint talk at Open Hackday London 2009
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

API's, Freebase, and the Collaborative Semantic web