SlideShare uma empresa Scribd logo
1 de 8
Baixar para ler offline
Session: Social Semantics (9:30 – 11:10)



          n    Mark Greaves: Crowdsourcing Semantics

          n    Denny Vrandečić, Elena Simperl, Rudi Studer:
                Diversity and the Semantic Web

          n    Andreas Harth: Beyond Privacy

          n    Denny Vrandečić: Shortipedia



1
Crowdsourcing Semantic Information

            Mark Greaves
             Vulcan Inc.
          markg@vulcan.com

                                 © 2011 Vulcan Inc.
Crowdsourcing Knowledge with the Semantic Web
    n    Scalable knowledge acquisition (KA) is a grand challenge in AI
    n    There are several known factors which impact KA success
           –    The skill of the KA team and the amount of training/coordination required
           –    The expressive power and ease of learning of the KR formalism
           –    The usability and power of the KA tools
           –    The formal complexity of an adequate domain model

    n    Semantic web implies a KA strategy using crowdsourcing
           –    Encoder = DBA or webmaster SME, with minimal training and interaction
           –    Huge numbers of authors and the web as a global publication fabric
           –    KR with low expressive power = RDF or (sometimes) OWL
           –    A set of tools and syntaxes
           –    Modeling a domain of straightforward facts


     Is crowdsourced KA via semantic web likely to succeed?
            Is pay-as-you-go integration likely to work?
3
Is embedded semantic markup a promising KA strategy?
    n    Can crowdsourced KA via page-embedded semantic markup succeed?
           –  The original Semantic Web use case
           –  Combined structured and unstructured knowledge in the same place, with (hopefully)
              synchronized update
           –  Machines could “read the web” without NLP
           –  Best incentive ended up being to support SEO and publishing social data

    n    The situation for powerful page-based semantic markup is not hopeful
           –  Despite 430M web pages with RDFa, Google said at SemTech that webmaster
              authoring in RDFa was too difficult, and this is probably right
           –  Facebook said that >10% of OG markup is syntactically incorrect or incoherent

    n    The KA answer so far: Schema.org and Facebook OG
           –  Data publishers are required to use a single common global ontology and vocabulary
           –  Formal KR complexity is almost the lowest possible
           –  Lowering the bar to achieve success

                       Clay Shirkey will be shown wrong…
            … but it is difficult to take much satisfaction in this
4
Is Linked Data a Promising KA Crowdsourcing Strategy?

     n    Can crowdsourced KA via Linked Data succeed?
           –  “Evolved” Semantic Web use case
           –  Best incentive ended up being sharing and (government) data distribution

     n    The situation for Linked Data markup is more hopeful
           –  Much more productive, yielding 10s of billions of semantic assertions
           –  Many organizations are successfully publishing Linked Data
           –  Overall semantic cohesion will increase as more data is mapped together

     n    The KA answer so far: faith in Pay-As-You-Go (PAYGO)
           –  Building from a core set of known vocabularies and ontologies
           –  PAYGO should yield a set of evolving, partial agreements on semantic models
              and terminology
           –  What is the incentive to create and maintain high-quality PAYGO models?
           –  There should be a business here… but no one has found it yet.


    The community is better at publishing data than integrating it
5
How Can We Make PAYGO Succeed?

    n    Is the PAYGO authorship just the familiar KA problem?
          –  PAYGO implies distributed authoring and managing a set of useful integration
             mappings
          –  Data integration is much harder than just asserting links

    n    PAYGO experiences
          –  How do companies succeed at “traditional” database PAYGO integration?
          –  Powerful commercial reasons to differentiate products (brands, trademarks, etc.),
             so there are very high costs in creating product mappings
          –  Achieving PAYGO in neuroscience has been exceptionally difficult

    n    Mobile-social is next big tranche of Semantic Web data
          –  How can PAYGO work here given the fuzziness of the data collection?
          –  Can we use machine learning for PAYGO in this area?

    What PAYGO lessons can Semantic Web researchers learn
           from the database integration community?
6
Can Semantic Web Crowdsourced KA Yield Useful KBs?

    n    The broad KA war is over and we won (sort of)
          –  RDF/OWL is the most important AI KR system on the planet
          –  Semantic Web can create communities of engaged data publishers who take
             ownership of the integrated data
          –  PAYGO and Linked Data is the vision for cost-effective KA and maintenance of
             specialized content

    n    Is the semantic web KB technically useful?
          –  Data is often impossible to cache
          –  Data at this scale always includes significant percentage of mistakes
          –  Schema.org and Facebook OG markup has a very weak semantics

    n    Responding to Watson: Most human questions are not precise
          –  Replacement of deduction by evidence gathering in large data sets
          –  “Popular shopping areas in London”

                           Mark’s Challenge to the EC:
                          Continue on the LarKC Vision!
7
Thank You


                   Disclaimer: The preceding slides represent the views of the author only.
    Any brands, logos and products are trademarks or registered trademarks of their respective companies.
8

Mais conteúdo relacionado

Destaque (10)

STI Summit 2011 - Dynamic web
STI Summit 2011 - Dynamic webSTI Summit 2011 - Dynamic web
STI Summit 2011 - Dynamic web
 
Teaching
TeachingTeaching
Teaching
 
STI Summit 2011 - Linked services
STI Summit 2011 - Linked servicesSTI Summit 2011 - Linked services
STI Summit 2011 - Linked services
 
STI2 Budget 2012
STI2 Budget 2012STI2 Budget 2012
STI2 Budget 2012
 
STI Summit 2011 - Making linked data work
STI Summit 2011 - Making linked data workSTI Summit 2011 - Making linked data work
STI Summit 2011 - Making linked data work
 
STI2 Board Meeting 2011 - STI Americas
STI2 Board Meeting 2011 - STI AmericasSTI2 Board Meeting 2011 - STI Americas
STI2 Board Meeting 2011 - STI Americas
 
STI2 Board Meeting 2011 - US strategy
STI2 Board Meeting 2011 - US strategySTI2 Board Meeting 2011 - US strategy
STI2 Board Meeting 2011 - US strategy
 
Summit2013 choi - kaist-cs-intro
Summit2013   choi - kaist-cs-introSummit2013   choi - kaist-cs-intro
Summit2013 choi - kaist-cs-intro
 
Summit2013 ho-jin choi - summit2013
Summit2013   ho-jin choi - summit2013Summit2013   ho-jin choi - summit2013
Summit2013 ho-jin choi - summit2013
 
Bullying in school
Bullying in schoolBullying in school
Bullying in school
 

Semelhante a STI Summit 2011 - Social semantics

From semantic platforms to semantic apps
From semantic platforms to semantic appsFrom semantic platforms to semantic apps
From semantic platforms to semantic appsscroisier
 
Network of Networks Discussion and Charter Document
Network of Networks Discussion and Charter DocumentNetwork of Networks Discussion and Charter Document
Network of Networks Discussion and Charter DocumentLora Cecere
 
Data centric business and knowledge graph trends
Data centric business and knowledge graph trendsData centric business and knowledge graph trends
Data centric business and knowledge graph trendsAlan Morrison
 
Scaling the mirrorworld with knowledge graphs
Scaling the mirrorworld with knowledge graphsScaling the mirrorworld with knowledge graphs
Scaling the mirrorworld with knowledge graphsAlan Morrison
 
Spivack Blogtalk 2008
Spivack Blogtalk 2008Spivack Blogtalk 2008
Spivack Blogtalk 2008Blogtalk 2008
 
WebGUI And The Semantic Web
WebGUI And The Semantic WebWebGUI And The Semantic Web
WebGUI And The Semantic WebWilliam McKee
 
Integrating Semantic Web with the Real World - A Journey between Two Cities ...
Integrating Semantic Web with the Real World  - A Journey between Two Cities ...Integrating Semantic Web with the Real World  - A Journey between Two Cities ...
Integrating Semantic Web with the Real World - A Journey between Two Cities ...Juan Sequeda
 
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios SpagoWorld
 
Handout for Workforce Session at the HUD Sustainable Communities Grantee Conv...
Handout for Workforce Session at the HUD Sustainable Communities Grantee Conv...Handout for Workforce Session at the HUD Sustainable Communities Grantee Conv...
Handout for Workforce Session at the HUD Sustainable Communities Grantee Conv...Kristin Wolff
 
FAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdfFAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdfAlan Morrison
 
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?Martin Hepp
 
Big data connection overview by aibdp.org
Big data connection overview by aibdp.orgBig data connection overview by aibdp.org
Big data connection overview by aibdp.orgAIBDP
 
The Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge GraphThe Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge GraphCambridge Semantics
 
Big Data in Education Sector
Big Data in Education SectorBig Data in Education Sector
Big Data in Education SectorKaran Sachdeva
 
Webinar: BI in the Sky - The New Rules of Cloud Analytics
Webinar: BI in the Sky - The New Rules of Cloud AnalyticsWebinar: BI in the Sky - The New Rules of Cloud Analytics
Webinar: BI in the Sky - The New Rules of Cloud AnalyticsSnapLogic
 

Semelhante a STI Summit 2011 - Social semantics (20)

The Future of LOD
The Future of LODThe Future of LOD
The Future of LOD
 
Semantic web
Semantic webSemantic web
Semantic web
 
Hak intis2013
Hak intis2013Hak intis2013
Hak intis2013
 
From semantic platforms to semantic apps
From semantic platforms to semantic appsFrom semantic platforms to semantic apps
From semantic platforms to semantic apps
 
Network of Networks Discussion and Charter Document
Network of Networks Discussion and Charter DocumentNetwork of Networks Discussion and Charter Document
Network of Networks Discussion and Charter Document
 
Data centric business and knowledge graph trends
Data centric business and knowledge graph trendsData centric business and knowledge graph trends
Data centric business and knowledge graph trends
 
Scaling the mirrorworld with knowledge graphs
Scaling the mirrorworld with knowledge graphsScaling the mirrorworld with knowledge graphs
Scaling the mirrorworld with knowledge graphs
 
Spivack Blogtalk 2008
Spivack Blogtalk 2008Spivack Blogtalk 2008
Spivack Blogtalk 2008
 
WebGUI And The Semantic Web
WebGUI And The Semantic WebWebGUI And The Semantic Web
WebGUI And The Semantic Web
 
Integrating Semantic Web with the Real World - A Journey between Two Cities ...
Integrating Semantic Web with the Real World  - A Journey between Two Cities ...Integrating Semantic Web with the Real World  - A Journey between Two Cities ...
Integrating Semantic Web with the Real World - A Journey between Two Cities ...
 
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
 
Handout for Workforce Session at the HUD Sustainable Communities Grantee Conv...
Handout for Workforce Session at the HUD Sustainable Communities Grantee Conv...Handout for Workforce Session at the HUD Sustainable Communities Grantee Conv...
Handout for Workforce Session at the HUD Sustainable Communities Grantee Conv...
 
Data aware apps
Data aware appsData aware apps
Data aware apps
 
FAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdfFAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdf
 
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
 
Big data connection overview by aibdp.org
Big data connection overview by aibdp.orgBig data connection overview by aibdp.org
Big data connection overview by aibdp.org
 
The Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge GraphThe Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge Graph
 
Big Data in Education Sector
Big Data in Education SectorBig Data in Education Sector
Big Data in Education Sector
 
Webinar: BI in the Sky - The New Rules of Cloud Analytics
Webinar: BI in the Sky - The New Rules of Cloud AnalyticsWebinar: BI in the Sky - The New Rules of Cloud Analytics
Webinar: BI in the Sky - The New Rules of Cloud Analytics
 
Semantic Web Nature
Semantic Web NatureSemantic Web Nature
Semantic Web Nature
 

Mais de Semantic Technology Institute International

Mais de Semantic Technology Institute International (20)

Summit2013 sw in russian universities
Summit2013   sw in russian universitiesSummit2013   sw in russian universities
Summit2013 sw in russian universities
 
Summit2013 semantic web in russia
Summit2013   semantic web in russiaSummit2013   semantic web in russia
Summit2013 semantic web in russia
 
Summit2013 john domingue - introduction
Summit2013   john domingue - introductionSummit2013   john domingue - introduction
Summit2013 john domingue - introduction
 
Summit2013 john domingue - horizon2020
Summit2013   john domingue - horizon2020Summit2013   john domingue - horizon2020
Summit2013 john domingue - horizon2020
 
Summit2013 georg gottlob and tim furche - diadem
Summit2013   georg gottlob and tim furche - diademSummit2013   georg gottlob and tim furche - diadem
Summit2013 georg gottlob and tim furche - diadem
 
Summit2013 eventos onto quad
Summit2013   eventos onto quadSummit2013   eventos onto quad
Summit2013 eventos onto quad
 
Summit2013 choi - wise kb-introd
Summit2013   choi - wise kb-introdSummit2013   choi - wise kb-introd
Summit2013 choi - wise kb-introd
 
STI Summit 2011 - Mlr-sm
STI Summit 2011 - Mlr-smSTI Summit 2011 - Mlr-sm
STI Summit 2011 - Mlr-sm
 
STI Summit 2011 - Linked data-services-streams
STI Summit 2011 - Linked data-services-streamsSTI Summit 2011 - Linked data-services-streams
STI Summit 2011 - Linked data-services-streams
 
STI Summit 2011 - di@scale
STI Summit 2011 - di@scaleSTI Summit 2011 - di@scale
STI Summit 2011 - di@scale
 
STI Summit 2011 - A personal look at the future of Semantic Technologies
STI Summit 2011 - A personal look at the future of Semantic TechnologiesSTI Summit 2011 - A personal look at the future of Semantic Technologies
STI Summit 2011 - A personal look at the future of Semantic Technologies
 
STI Summit 2011 - Visual analytics and linked data
STI Summit 2011 - Visual analytics and linked dataSTI Summit 2011 - Visual analytics and linked data
STI Summit 2011 - Visual analytics and linked data
 
STI Summit 2011 - LS4 LS Khaos
STI Summit 2011 - LS4 LS KhaosSTI Summit 2011 - LS4 LS Khaos
STI Summit 2011 - LS4 LS Khaos
 
STI Summit 2011 - Shortipedia
STI Summit 2011 - ShortipediaSTI Summit 2011 - Shortipedia
STI Summit 2011 - Shortipedia
 
STI Summit 2011 - Diversity
STI Summit 2011 - DiversitySTI Summit 2011 - Diversity
STI Summit 2011 - Diversity
 
STI Summit 2011 - Monetizing the Semantic Web
STI Summit 2011 - Monetizing the Semantic WebSTI Summit 2011 - Monetizing the Semantic Web
STI Summit 2011 - Monetizing the Semantic Web
 
STI Summit 2011 - Limits of LOD
STI Summit 2011 - Limits of LODSTI Summit 2011 - Limits of LOD
STI Summit 2011 - Limits of LOD
 
STI Summit 2011 - Linked Data & Ontologies
STI Summit 2011 - Linked Data & OntologiesSTI Summit 2011 - Linked Data & Ontologies
STI Summit 2011 - Linked Data & Ontologies
 
STI Summit 2011 - DB vs RDF
STI Summit 2011 - DB vs RDFSTI Summit 2011 - DB vs RDF
STI Summit 2011 - DB vs RDF
 
STI Summit 2011 - Global data integration and global data mining
STI Summit 2011 - Global data integration and global data miningSTI Summit 2011 - Global data integration and global data mining
STI Summit 2011 - Global data integration and global data mining
 

Último

Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 

Último (20)

Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 

STI Summit 2011 - Social semantics

  • 1. Session: Social Semantics (9:30 – 11:10) n  Mark Greaves: Crowdsourcing Semantics n  Denny Vrandečić, Elena Simperl, Rudi Studer: Diversity and the Semantic Web n  Andreas Harth: Beyond Privacy n  Denny Vrandečić: Shortipedia 1
  • 2. Crowdsourcing Semantic Information Mark Greaves Vulcan Inc. markg@vulcan.com © 2011 Vulcan Inc.
  • 3. Crowdsourcing Knowledge with the Semantic Web n  Scalable knowledge acquisition (KA) is a grand challenge in AI n  There are several known factors which impact KA success –  The skill of the KA team and the amount of training/coordination required –  The expressive power and ease of learning of the KR formalism –  The usability and power of the KA tools –  The formal complexity of an adequate domain model n  Semantic web implies a KA strategy using crowdsourcing –  Encoder = DBA or webmaster SME, with minimal training and interaction –  Huge numbers of authors and the web as a global publication fabric –  KR with low expressive power = RDF or (sometimes) OWL –  A set of tools and syntaxes –  Modeling a domain of straightforward facts Is crowdsourced KA via semantic web likely to succeed? Is pay-as-you-go integration likely to work? 3
  • 4. Is embedded semantic markup a promising KA strategy? n  Can crowdsourced KA via page-embedded semantic markup succeed? –  The original Semantic Web use case –  Combined structured and unstructured knowledge in the same place, with (hopefully) synchronized update –  Machines could “read the web” without NLP –  Best incentive ended up being to support SEO and publishing social data n  The situation for powerful page-based semantic markup is not hopeful –  Despite 430M web pages with RDFa, Google said at SemTech that webmaster authoring in RDFa was too difficult, and this is probably right –  Facebook said that >10% of OG markup is syntactically incorrect or incoherent n  The KA answer so far: Schema.org and Facebook OG –  Data publishers are required to use a single common global ontology and vocabulary –  Formal KR complexity is almost the lowest possible –  Lowering the bar to achieve success Clay Shirkey will be shown wrong… … but it is difficult to take much satisfaction in this 4
  • 5. Is Linked Data a Promising KA Crowdsourcing Strategy? n  Can crowdsourced KA via Linked Data succeed? –  “Evolved” Semantic Web use case –  Best incentive ended up being sharing and (government) data distribution n  The situation for Linked Data markup is more hopeful –  Much more productive, yielding 10s of billions of semantic assertions –  Many organizations are successfully publishing Linked Data –  Overall semantic cohesion will increase as more data is mapped together n  The KA answer so far: faith in Pay-As-You-Go (PAYGO) –  Building from a core set of known vocabularies and ontologies –  PAYGO should yield a set of evolving, partial agreements on semantic models and terminology –  What is the incentive to create and maintain high-quality PAYGO models? –  There should be a business here… but no one has found it yet. The community is better at publishing data than integrating it 5
  • 6. How Can We Make PAYGO Succeed? n  Is the PAYGO authorship just the familiar KA problem? –  PAYGO implies distributed authoring and managing a set of useful integration mappings –  Data integration is much harder than just asserting links n  PAYGO experiences –  How do companies succeed at “traditional” database PAYGO integration? –  Powerful commercial reasons to differentiate products (brands, trademarks, etc.), so there are very high costs in creating product mappings –  Achieving PAYGO in neuroscience has been exceptionally difficult n  Mobile-social is next big tranche of Semantic Web data –  How can PAYGO work here given the fuzziness of the data collection? –  Can we use machine learning for PAYGO in this area? What PAYGO lessons can Semantic Web researchers learn from the database integration community? 6
  • 7. Can Semantic Web Crowdsourced KA Yield Useful KBs? n  The broad KA war is over and we won (sort of) –  RDF/OWL is the most important AI KR system on the planet –  Semantic Web can create communities of engaged data publishers who take ownership of the integrated data –  PAYGO and Linked Data is the vision for cost-effective KA and maintenance of specialized content n  Is the semantic web KB technically useful? –  Data is often impossible to cache –  Data at this scale always includes significant percentage of mistakes –  Schema.org and Facebook OG markup has a very weak semantics n  Responding to Watson: Most human questions are not precise –  Replacement of deduction by evidence gathering in large data sets –  “Popular shopping areas in London” Mark’s Challenge to the EC: Continue on the LarKC Vision! 7
  • 8. Thank You Disclaimer: The preceding slides represent the views of the author only. Any brands, logos and products are trademarks or registered trademarks of their respective companies. 8