SlideShare uma empresa Scribd logo
1 de 14
Baixar para ler offline
The OpenCalais Web Service & Open API

                          Krista Thomas
Introducing OpenCalais

• A Thomson Reuters initiative to connect all the world’s
  business-relevant content.
• A free service that brings new efficiencies and
  productivity to publishers and content curators.
• The fastest, easiest way to categorize your content, and
  tag the entities, facts and events therein.
• Progress since Feb., 2008:
      • 18,000 developers
      • 20+ publishers using OpenCalais
      • 50+ cool new apps and services created
      • 4+ million documents per day processed
Free Metadata Generation
                           1.   You feed your content into our
                                extraction engine
                           2.   It categorizes the stories; finds
                                the people, places, companies,
                                facts and events, and then
                                returns that metadata to you
                           3.   Along with the metadata, it
                                returns links to free data on the
                                open Web (i.e. Wikipedia, CIA
                                World Fact book, IMDB, etc.)
                           4.   You use the metadata to
                                streamline content ops, enhance
                                your content, create topic hubs
                                on the fly, improve search, etc.
Live Demo:
 http://viewer.opencalais.com
1. Cut and paste a business news story into the viewer,
   and hit submit.
2. View the semantic markup (hover over underlined
   items to see relevance, for instance).
3. Expand the extracted entities, facts and events on
   the left hand rail.
4. Click on one of the companies in the list on the left,
   to view the OpenCalais / Thomson Reuters asset on
   that company in the Linked Data cloud.
5. Click the ‘SameAs’ links at the bottom to find more
   data on the Linked Data cloud.
How Metadata Connects You to the Open Web




                           NEW!


                                      NEW!




  The Linked Data Cloud – December, 2008
Linked Data Cloud as of July, 2009
Your Content & The OpenCalais Process
                                                                                          5
                                      Metadata     3                    Which provides
                                                                        information and
                1                    returned to
                                       the user                           other Linked
   Unstructur                         with keys                           Data pointers
    ed Text




                                                          Keys
                                                        provide
                                                                    4
                                                       access to
                                                       the Calais
                      Calais     2                       Linked
                                                       Data cloud
                     extracts
                     entities,                                                        To a range of open
                                                                                                           6
                    facts and                                                         and partner Linked
                      events                                                             data assets,
                                                                                           including
                                                                                      Thomson Reuters
OpenCalais mainstream adoption




                                 8
OpenCalais mainstream adoption




                                 9
Early Adopters


•   Aggregate & organize content in new ways.
•   Automatically produce topic-based sites.
•   Improve search functionality.
•   Generate better content recommendations.
•   Publish reviews, articles & blog posts for programmatic use on the open Web



                                                     • Content Triage
                                                     • Hyper-local news


                                                     • Contextual Ad Placement
New Publishers to tap OpenCalais include
• The New Republic: The new TNR.com uses OpenPublish, an
   OpenCalais-enabled Drupal-powered CMS to increase editorial productivity
   & drive reader engagement.

• Al Jazeera English’s new blogging network: uses
   OpenCalais for content operations & tagging; features Al Jazeera
   correspondents from around the world.

• Slate Magazine’s News Dots Network: visualizes the
   most recent topics in the news as a concise network of related topics.

• I *heart* Sea: a hyper-local news aggregation site that collects some
   of the best blogs in Seattle, especially those serving the Capitol Hill area.
Media Monitoring and Intelligence Tools
• Meltwater: a rapidly growing SaaS-based provider in the Corporate IR
   & PR Services

• Tattler (app): an open source topic monitoring tool for today's Web.
   Tattler finds and aggregates content from the Web on topics users ask it to
   monitor.

• Interceder: a social media monitoring tool that makes it easy to track
   trending topics and search the latest content from major news Web sites,
   blogs, Twitter and YouTube.

• AskJot: a tool for analyzing web pages for keywords, and displaying
   them as links to search results from services around the Web.
New Content Experiences / Open Research
• Feedly: a Firefox plug-in that brings user-selected inputs from Google
   Reader, Twitter, RSS feeds, etc. in an easy-to-read magazine-style format.

• OpenPublish: a new CMS based on Drupal that integrates
   OpenCalais from the ground up, OpenPublish is tailored to the needs of
   today's online publishers & media providers.

• DocumentCloud: founded by reporters from The NYT and
   ProPublica, and funded by the Knight Foundation, DocumentCloud will offer
   public access to news reporters’ original source materials.

• MediaCloud:an open research tool from Harvard’s Berkman Center
   that aggregates mainstream media and blogs to enable researchers to
   identify how and where news coverage starts, what we’re missing, etc.
Why Thomson Reuters Cares

• Its mission is to connect all the world’s business-
  relevant content to provide professionals with ‘intelligent
  information.’
• The days of surviving
  as a ‘walled garden’ of
  content are over.
• ‘Crowdsourcing’ Q&A
  creates faster, better,
  stronger software.

Mais conteúdo relacionado

Mais procurados

Óscar Méndez - Big data: de la investigación científica a la gestión empresarial
Óscar Méndez - Big data: de la investigación científica a la gestión empresarialÓscar Méndez - Big data: de la investigación científica a la gestión empresarial
Óscar Méndez - Big data: de la investigación científica a la gestión empresarialFundación Ramón Areces
 
Pm shandilya-s-wcodew-web-methodology
Pm shandilya-s-wcodew-web-methodologyPm shandilya-s-wcodew-web-methodology
Pm shandilya-s-wcodew-web-methodologyprashant mishra
 
Oxford Seo.Com Presentation
Oxford Seo.Com PresentationOxford Seo.Com Presentation
Oxford Seo.Com PresentationIgorgold
 

Mais procurados (6)

Affinity micro data-infograph
Affinity micro data-infographAffinity micro data-infograph
Affinity micro data-infograph
 
A Visual Tour of Quintly for Social Media Analytics
A Visual Tour of Quintly for Social Media AnalyticsA Visual Tour of Quintly for Social Media Analytics
A Visual Tour of Quintly for Social Media Analytics
 
Óscar Méndez - Big data: de la investigación científica a la gestión empresarial
Óscar Méndez - Big data: de la investigación científica a la gestión empresarialÓscar Méndez - Big data: de la investigación científica a la gestión empresarial
Óscar Méndez - Big data: de la investigación científica a la gestión empresarial
 
Pm shandilya-s-wcodew-web-methodology
Pm shandilya-s-wcodew-web-methodologyPm shandilya-s-wcodew-web-methodology
Pm shandilya-s-wcodew-web-methodology
 
Search engine
Search engineSearch engine
Search engine
 
Oxford Seo.Com Presentation
Oxford Seo.Com PresentationOxford Seo.Com Presentation
Oxford Seo.Com Presentation
 

Semelhante a San diego

Semantically enriching content using OpenCalais
Semantically enriching content using OpenCalaisSemantically enriching content using OpenCalais
Semantically enriching content using OpenCalaisMarius Butuc
 
(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijek(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijekRatko Mutavdzic
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13Kristi Holmes
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?Anita de Waard
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?Robert Grossman
 
Cloud web scale discovery services landscape an overview
Cloud web scale discovery services landscape an overviewCloud web scale discovery services landscape an overview
Cloud web scale discovery services landscape an overviewNikesh Narayanan
 
Llinked open data training for EU institutions
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutionsOpen Data Support
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareIMC Technologies
 
Big Data Technologies.pdf
Big Data Technologies.pdfBig Data Technologies.pdf
Big Data Technologies.pdfRAHULRAHU8
 
Linked open data project
Linked open data projectLinked open data project
Linked open data projectFaathima Fayaza
 
OpenNASA v2.0 Slideshare Large File
OpenNASA v2.0 Slideshare   Large FileOpenNASA v2.0 Slideshare   Large File
OpenNASA v2.0 Slideshare Large FileMegan Eskey
 
Big Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use casesBig Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use casesJeff Kelly
 
Simple OpenCalais Whitepaper
Simple OpenCalais WhitepaperSimple OpenCalais Whitepaper
Simple OpenCalais WhitepaperKrista Thomas
 
The rise of big data governance: insight on this emerging trend from active o...
The rise of big data governance: insight on this emerging trend from active o...The rise of big data governance: insight on this emerging trend from active o...
The rise of big data governance: insight on this emerging trend from active o...DataWorks Summit
 
Open Calais Release 4.0
Open Calais Release 4.0Open Calais Release 4.0
Open Calais Release 4.0Krista Thomas
 
First they have to find it: Getting Open Government Data Discovered and Used
First they have to find it: Getting Open Government Data Discovered and UsedFirst they have to find it: Getting Open Government Data Discovered and Used
First they have to find it: Getting Open Government Data Discovered and UsedRensselaer Polytechnic Institute
 
Drupal, CKAN and Public Data. DrupalGov 08 february 2016
Drupal, CKAN and Public Data. DrupalGov 08 february 2016Drupal, CKAN and Public Data. DrupalGov 08 february 2016
Drupal, CKAN and Public Data. DrupalGov 08 february 2016Steven De Costa
 

Semelhante a San diego (20)

Semantically enriching content using OpenCalais
Semantically enriching content using OpenCalaisSemantically enriching content using OpenCalais
Semantically enriching content using OpenCalais
 
(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijek(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijek
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
 
Planetdata simpda
Planetdata simpdaPlanetdata simpda
Planetdata simpda
 
PlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web ScalePlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web Scale
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?
 
Cloud web scale discovery services landscape an overview
Cloud web scale discovery services landscape an overviewCloud web scale discovery services landscape an overview
Cloud web scale discovery services landscape an overview
 
Llinked open data training for EU institutions
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutions
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Big Data Technologies.pdf
Big Data Technologies.pdfBig Data Technologies.pdf
Big Data Technologies.pdf
 
Linked open data project
Linked open data projectLinked open data project
Linked open data project
 
OpenNASA v2.0 Slideshare Large File
OpenNASA v2.0 Slideshare   Large FileOpenNASA v2.0 Slideshare   Large File
OpenNASA v2.0 Slideshare Large File
 
Big Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use casesBig Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use cases
 
Publisher whitepaper
Publisher whitepaperPublisher whitepaper
Publisher whitepaper
 
Simple OpenCalais Whitepaper
Simple OpenCalais WhitepaperSimple OpenCalais Whitepaper
Simple OpenCalais Whitepaper
 
The rise of big data governance: insight on this emerging trend from active o...
The rise of big data governance: insight on this emerging trend from active o...The rise of big data governance: insight on this emerging trend from active o...
The rise of big data governance: insight on this emerging trend from active o...
 
Open Calais Release 4.0
Open Calais Release 4.0Open Calais Release 4.0
Open Calais Release 4.0
 
First they have to find it: Getting Open Government Data Discovered and Used
First they have to find it: Getting Open Government Data Discovered and UsedFirst they have to find it: Getting Open Government Data Discovered and Used
First they have to find it: Getting Open Government Data Discovered and Used
 
Drupal, CKAN and Public Data. DrupalGov 08 february 2016
Drupal, CKAN and Public Data. DrupalGov 08 february 2016Drupal, CKAN and Public Data. DrupalGov 08 february 2016
Drupal, CKAN and Public Data. DrupalGov 08 february 2016
 

Mais de Krista Thomas

The OpenCalais Workshop at WeMedia 2010
The OpenCalais Workshop at WeMedia 2010The OpenCalais Workshop at WeMedia 2010
The OpenCalais Workshop at WeMedia 2010Krista Thomas
 
OpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry CouncilOpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry CouncilKrista Thomas
 
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09Krista Thomas
 
Open Calais @ Transparent Text
Open Calais @ Transparent TextOpen Calais @ Transparent Text
Open Calais @ Transparent TextKrista Thomas
 
Tague Semtech Keynote 2009
Tague Semtech Keynote 2009Tague Semtech Keynote 2009
Tague Semtech Keynote 2009Krista Thomas
 
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009Krista Thomas
 
Open Calais For SF And LA Meetups
Open Calais For SF And LA MeetupsOpen Calais For SF And LA Meetups
Open Calais For SF And LA MeetupsKrista Thomas
 
Calais @ the Palo Alto Semantic Web Meetup
Calais @ the Palo Alto Semantic Web MeetupCalais @ the Palo Alto Semantic Web Meetup
Calais @ the Palo Alto Semantic Web MeetupKrista Thomas
 
Final Calais For ONA
Final Calais For ONAFinal Calais For ONA
Final Calais For ONAKrista Thomas
 

Mais de Krista Thomas (11)

Ad.ly Introduction
Ad.ly IntroductionAd.ly Introduction
Ad.ly Introduction
 
San diego
San diegoSan diego
San diego
 
The OpenCalais Workshop at WeMedia 2010
The OpenCalais Workshop at WeMedia 2010The OpenCalais Workshop at WeMedia 2010
The OpenCalais Workshop at WeMedia 2010
 
OpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry CouncilOpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry Council
 
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
 
Open Calais @ Transparent Text
Open Calais @ Transparent TextOpen Calais @ Transparent Text
Open Calais @ Transparent Text
 
Tague Semtech Keynote 2009
Tague Semtech Keynote 2009Tague Semtech Keynote 2009
Tague Semtech Keynote 2009
 
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
 
Open Calais For SF And LA Meetups
Open Calais For SF And LA MeetupsOpen Calais For SF And LA Meetups
Open Calais For SF And LA Meetups
 
Calais @ the Palo Alto Semantic Web Meetup
Calais @ the Palo Alto Semantic Web MeetupCalais @ the Palo Alto Semantic Web Meetup
Calais @ the Palo Alto Semantic Web Meetup
 
Final Calais For ONA
Final Calais For ONAFinal Calais For ONA
Final Calais For ONA
 

San diego

  • 1. The OpenCalais Web Service & Open API Krista Thomas
  • 2. Introducing OpenCalais • A Thomson Reuters initiative to connect all the world’s business-relevant content. • A free service that brings new efficiencies and productivity to publishers and content curators. • The fastest, easiest way to categorize your content, and tag the entities, facts and events therein. • Progress since Feb., 2008: • 18,000 developers • 20+ publishers using OpenCalais • 50+ cool new apps and services created • 4+ million documents per day processed
  • 3. Free Metadata Generation 1. You feed your content into our extraction engine 2. It categorizes the stories; finds the people, places, companies, facts and events, and then returns that metadata to you 3. Along with the metadata, it returns links to free data on the open Web (i.e. Wikipedia, CIA World Fact book, IMDB, etc.) 4. You use the metadata to streamline content ops, enhance your content, create topic hubs on the fly, improve search, etc.
  • 4. Live Demo: http://viewer.opencalais.com 1. Cut and paste a business news story into the viewer, and hit submit. 2. View the semantic markup (hover over underlined items to see relevance, for instance). 3. Expand the extracted entities, facts and events on the left hand rail. 4. Click on one of the companies in the list on the left, to view the OpenCalais / Thomson Reuters asset on that company in the Linked Data cloud. 5. Click the ‘SameAs’ links at the bottom to find more data on the Linked Data cloud.
  • 5. How Metadata Connects You to the Open Web NEW! NEW! The Linked Data Cloud – December, 2008
  • 6. Linked Data Cloud as of July, 2009
  • 7. Your Content & The OpenCalais Process 5 Metadata 3 Which provides information and 1 returned to the user other Linked Unstructur with keys Data pointers ed Text Keys provide 4 access to the Calais Calais 2 Linked Data cloud extracts entities, To a range of open 6 facts and and partner Linked events data assets, including Thomson Reuters
  • 10. Early Adopters • Aggregate & organize content in new ways. • Automatically produce topic-based sites. • Improve search functionality. • Generate better content recommendations. • Publish reviews, articles & blog posts for programmatic use on the open Web • Content Triage • Hyper-local news • Contextual Ad Placement
  • 11. New Publishers to tap OpenCalais include • The New Republic: The new TNR.com uses OpenPublish, an OpenCalais-enabled Drupal-powered CMS to increase editorial productivity & drive reader engagement. • Al Jazeera English’s new blogging network: uses OpenCalais for content operations & tagging; features Al Jazeera correspondents from around the world. • Slate Magazine’s News Dots Network: visualizes the most recent topics in the news as a concise network of related topics. • I *heart* Sea: a hyper-local news aggregation site that collects some of the best blogs in Seattle, especially those serving the Capitol Hill area.
  • 12. Media Monitoring and Intelligence Tools • Meltwater: a rapidly growing SaaS-based provider in the Corporate IR & PR Services • Tattler (app): an open source topic monitoring tool for today's Web. Tattler finds and aggregates content from the Web on topics users ask it to monitor. • Interceder: a social media monitoring tool that makes it easy to track trending topics and search the latest content from major news Web sites, blogs, Twitter and YouTube. • AskJot: a tool for analyzing web pages for keywords, and displaying them as links to search results from services around the Web.
  • 13. New Content Experiences / Open Research • Feedly: a Firefox plug-in that brings user-selected inputs from Google Reader, Twitter, RSS feeds, etc. in an easy-to-read magazine-style format. • OpenPublish: a new CMS based on Drupal that integrates OpenCalais from the ground up, OpenPublish is tailored to the needs of today's online publishers & media providers. • DocumentCloud: founded by reporters from The NYT and ProPublica, and funded by the Knight Foundation, DocumentCloud will offer public access to news reporters’ original source materials. • MediaCloud:an open research tool from Harvard’s Berkman Center that aggregates mainstream media and blogs to enable researchers to identify how and where news coverage starts, what we’re missing, etc.
  • 14. Why Thomson Reuters Cares • Its mission is to connect all the world’s business- relevant content to provide professionals with ‘intelligent information.’ • The days of surviving as a ‘walled garden’ of content are over. • ‘Crowdsourcing’ Q&A creates faster, better, stronger software.