SlideShare uma empresa Scribd logo
1 de 14
Baixar para ler offline
You	
  rang,	
  M’LOD?	
                                                        ì	
  
Google	
  Refine	
  in	
  the	
  world	
  of	
  LOD	
  




Mateja	
  Verlic	
  




                       Seman/c	
  Tech	
  &	
  Business	
  Conference	
  
                          June	
  3-­‐7,	
  2012	
  |	
  San	
  Francisco	
  
2	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
     June	
  7,	
  2012	
  
3	
  




                                                                                     Google	
  Refine	
  

                                   ì  What	
  we’ve	
  seen	
  so	
  far	
  
                                       ì  Messy	
  data	
  gone	
  clean	
  
                                              ì  Filtering,	
  faceted	
  browsing	
  
                                              ì  Edi/ng	
  cells	
  and	
  columns,	
  clustering,	
  expor/ng	
  
                                              ì  Bulk	
  transforma/ons	
  
                                              ì  History	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                          June	
  7,	
  2012	
  
4	
  




                                                …	
  and	
  the	
  powerful	
  dark	
  side	
  

                                   ì  Reconcilia/on	
  

                                   ì  Extending	
  data	
  	
  

                                   ì  Regular	
  expressions	
  

                                   ì  Integrated	
  GREL	
  commands	
  

                                   ì  Jython	
  

                                   ì  Extensions	
  (actually	
  not	
  so	
  many	
  of	
  them)	
  


©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                 June	
  7,	
  2012	
  
5	
  




                                                                                                                      LOD2	
  

                                   ì  Crea/ng	
  knowledge	
  out	
  of	
  Interlinked	
  Data	
  

                                   ì  EU	
  FP7	
  project	
  

                                   ì  15	
  partners	
  

                                   ì  LOD2	
  in	
  a	
  	
  	
  	
  	
  	
  	
  	
  	
  :	
  	
  
                                              LOD2	
  is	
  like	
  Batman	
  and	
  Robin:	
  business	
  and	
  academics	
  
                                              -­‐	
  mighty	
  tools	
  and	
  a	
  bunch	
  of	
  real	
  &	
  good	
  use	
  cases	
  
                                              supported	
  by	
  science	
  for	
  success.	
  

                                                                                                      	
  

©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                                       June	
  7,	
  2012	
  
6	
  




                                                                             Linked	
  Open	
  Data	
  

                                   ì  Distributed	
  data,	
  different	
  sources,	
  formats	
  

                                   ì  Open	
  Government	
  data	
  

                                   ì  Open	
  Data	
  Business	
  &	
  Business	
  of	
  Open	
  Data	
  

                                   ì  CKAN	
  



                                   ì  LOD2:	
  hap://www.lod2.eu	
  

                                   	
  
©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                     June	
  7,	
  2012	
  
7	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
     June	
  7,	
  2012	
  
8	
  




                                                                                               LODGrefine	
  

                                   ì  LOD-­‐friendly	
  GoogleRefine	
  extensions	
  


                                              ì  RDF	
  extension	
  
                                              	
         hap://lab.linkeddata.deri.ie/2010/grefine-­‐rdf-­‐extension/	
  
                                              	
  
                                              ì  DBpedia	
  extension	
  	
  


                                   ì  LODGrefine:	
  Google	
  Refine	
  +	
  integrated	
  extensions	
  


©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                               June	
  7,	
  2012	
  
9	
  




                                                                             LODGrefine	
  toolbox	
  

                                   ì  Google	
  Refine	
  func/onali/es	
  +	
  	
  
                                       ì  Registering	
  reconcilia/on	
  service	
  based	
  on	
  a	
  SPARQL	
  
                                           endpoint,	
  RDF	
  dump	
  or	
  Sindice	
  search	
  
                                       ì  RDF	
  Export	
  
                                              ì  Extending	
  reconciled	
  column	
  with	
  data	
  from	
  
                                                  DBpedia	
  	
  
                                              ì  Extrac/ng	
  en//es	
  from	
  full	
  text	
  using	
  Zemanta	
  API	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                             June	
  7,	
  2012	
  
10	
  




                                                                             Mechanical	
  Future	
  

                                   ì  Integra/on	
  with	
  Amazon	
  Mechanical	
  Turk?	
  

                                   ì  Leveraging	
  crowd	
  intelligence	
  

                                   ì  Do	
  Workers	
  dream	
  reconciled	
  data?	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                         June	
  7,	
  2012	
  
11	
  




                                                                                 Short	
  Summary	
  

                                   ì  Google	
  Refine	
  -­‐	
  what	
  we	
  had	
  

                                   ì  LOD	
  1st	
  class	
  ci/zen	
  in	
  GR	
  -­‐	
  what	
  we	
  wanted	
  

                                   ì  Google	
  Refine	
  extension(s)	
  -­‐	
  what	
  we	
  did	
  

                                   ì  LODGrefine	
  -­‐	
  what	
  we	
  have	
  

                                   ì  Mechanical	
  future	
  –	
  what	
  (we	
  think)	
  we	
  want	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                              June	
  7,	
  2012	
  
12	
  




                                                                                                      Demo	
  #1	
  

                                   ì  Pimp	
  my	
  data	
  under	
  10	
  minutes:	
  A	
  showcase	
  how	
  
                                              to	
  convert	
  data	
  from	
  a	
  website	
  into	
  a	
  linked	
  dataset	
  
                                              under	
  10	
  minutes.	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                                June	
  7,	
  2012	
  
13	
  




                                                                                                    Demo	
  #2	
  

                                   ì  Yes,	
  we	
  C(K)AN:	
  Conver/ng	
  one	
  of	
  the	
  CKAN	
  Open	
  
                                              Data	
  datasets	
  into	
  a	
  LOD	
  dataset	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                     June	
  7,	
  2012	
  
14	
  




                                                                                Thank	
  you	
  

                                   Mateja	
  Verlic	
  

                                   E-­‐mail:	
  mateja.verlic@zemanta.com	
  

                                   Web:	
  hap://www-­‐zemanta.com	
  

                                   LODGrefine:	
  hap://code.zemanta.com/sparkica	
  

                                   Twiaer:	
  @sparkica	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                  June	
  7,	
  2012	
  

Mais conteúdo relacionado

Semelhante a You rang, M’LOD? Google Refine in the world of LOD

Productivity Future Vision
Productivity Future VisionProductivity Future Vision
Productivity Future VisionMicro Focus SRL
 
Advocate Consulting - Tangoe Summit Keynote Presentation 2012
Advocate Consulting - Tangoe Summit Keynote Presentation 2012Advocate Consulting - Tangoe Summit Keynote Presentation 2012
Advocate Consulting - Tangoe Summit Keynote Presentation 2012Advocate Consulting
 
Building Agile Data Warehouses with Ralph Hughes
Building Agile Data Warehouses with Ralph HughesBuilding Agile Data Warehouses with Ralph Hughes
Building Agile Data Warehouses with Ralph HughesKalido
 
Benchmark METRICS THAT MATTER October 4 2012
Benchmark METRICS THAT MATTER October 4 2012Benchmark METRICS THAT MATTER October 4 2012
Benchmark METRICS THAT MATTER October 4 2012BenchmarkQA
 
ORCID Outreach Meeting dev breakout session
ORCID Outreach Meeting dev breakout sessionORCID Outreach Meeting dev breakout session
ORCID Outreach Meeting dev breakout sessionGudmundur Thorisson
 
EDF2012 Chris Taggart - How the biggest Open Database of Companies was built
EDF2012   Chris Taggart - How the biggest Open Database of Companies was builtEDF2012   Chris Taggart - How the biggest Open Database of Companies was built
EDF2012 Chris Taggart - How the biggest Open Database of Companies was builtEuropean Data Forum
 
Virtual Worlds: A Future History
Virtual Worlds: A Future HistoryVirtual Worlds: A Future History
Virtual Worlds: A Future HistoryRobin Teigland
 
Ashnik corporate presentation Dec 2012
Ashnik corporate presentation Dec 2012Ashnik corporate presentation Dec 2012
Ashnik corporate presentation Dec 2012Sachin Dabir
 
SMX Landing Page Optimization
SMX Landing Page OptimizationSMX Landing Page Optimization
SMX Landing Page OptimizationDatalicious
 
True Drivers of MDM webinar
True Drivers of MDM webinarTrue Drivers of MDM webinar
True Drivers of MDM webinarKalido
 
Who’s using my apps
Who’s using my appsWho’s using my apps
Who’s using my appsbartlannoeye
 
2012 - 2013 bulk ieee projects for sale
2012 - 2013 bulk ieee projects for sale2012 - 2013 bulk ieee projects for sale
2012 - 2013 bulk ieee projects for saleJPINFOTECH JAYAPRAKASH
 
Final Year Project Guidance
Final Year Project GuidanceFinal Year Project Guidance
Final Year Project GuidanceVarad Meru
 
Ideal-Analytics - Introduction to Version 3.3
Ideal-Analytics - Introduction to Version 3.3Ideal-Analytics - Introduction to Version 3.3
Ideal-Analytics - Introduction to Version 3.3Yamika Mehra
 
Pal gov.tutorial2.session13 1.data schema integration
Pal gov.tutorial2.session13 1.data schema integrationPal gov.tutorial2.session13 1.data schema integration
Pal gov.tutorial2.session13 1.data schema integrationMustafa Jarrar
 
3 Jahre OGD In Österreich - eine Billanz
3 Jahre OGD In Österreich - eine Billanz3 Jahre OGD In Österreich - eine Billanz
3 Jahre OGD In Österreich - eine BillanzOpen Knowledge Austria
 

Semelhante a You rang, M’LOD? Google Refine in the world of LOD (20)

Productivity Future Vision
Productivity Future VisionProductivity Future Vision
Productivity Future Vision
 
Advocate Consulting - Tangoe Summit Keynote Presentation 2012
Advocate Consulting - Tangoe Summit Keynote Presentation 2012Advocate Consulting - Tangoe Summit Keynote Presentation 2012
Advocate Consulting - Tangoe Summit Keynote Presentation 2012
 
JIST 2012
JIST 2012JIST 2012
JIST 2012
 
Building Agile Data Warehouses with Ralph Hughes
Building Agile Data Warehouses with Ralph HughesBuilding Agile Data Warehouses with Ralph Hughes
Building Agile Data Warehouses with Ralph Hughes
 
Benchmark METRICS THAT MATTER October 4 2012
Benchmark METRICS THAT MATTER October 4 2012Benchmark METRICS THAT MATTER October 4 2012
Benchmark METRICS THAT MATTER October 4 2012
 
ORCID Outreach Meeting dev breakout session
ORCID Outreach Meeting dev breakout sessionORCID Outreach Meeting dev breakout session
ORCID Outreach Meeting dev breakout session
 
EDF2012 Chris Taggart - How the biggest Open Database of Companies was built
EDF2012   Chris Taggart - How the biggest Open Database of Companies was builtEDF2012   Chris Taggart - How the biggest Open Database of Companies was built
EDF2012 Chris Taggart - How the biggest Open Database of Companies was built
 
Virtual Worlds: A Future History
Virtual Worlds: A Future HistoryVirtual Worlds: A Future History
Virtual Worlds: A Future History
 
Ashnik corporate presentation Dec 2012
Ashnik corporate presentation Dec 2012Ashnik corporate presentation Dec 2012
Ashnik corporate presentation Dec 2012
 
SMX Landing Page Optimization
SMX Landing Page OptimizationSMX Landing Page Optimization
SMX Landing Page Optimization
 
True Drivers of MDM webinar
True Drivers of MDM webinarTrue Drivers of MDM webinar
True Drivers of MDM webinar
 
Who’s using my apps
Who’s using my appsWho’s using my apps
Who’s using my apps
 
EDF2012 Nuria de Lama - BIG
EDF2012   Nuria de Lama - BIGEDF2012   Nuria de Lama - BIG
EDF2012 Nuria de Lama - BIG
 
2012 - 2013 bulk ieee projects for sale
2012 - 2013 bulk ieee projects for sale2012 - 2013 bulk ieee projects for sale
2012 - 2013 bulk ieee projects for sale
 
2012-2013 IEEE PROJECT TITLES
2012-2013 IEEE PROJECT TITLES2012-2013 IEEE PROJECT TITLES
2012-2013 IEEE PROJECT TITLES
 
Lod2
Lod2Lod2
Lod2
 
Final Year Project Guidance
Final Year Project GuidanceFinal Year Project Guidance
Final Year Project Guidance
 
Ideal-Analytics - Introduction to Version 3.3
Ideal-Analytics - Introduction to Version 3.3Ideal-Analytics - Introduction to Version 3.3
Ideal-Analytics - Introduction to Version 3.3
 
Pal gov.tutorial2.session13 1.data schema integration
Pal gov.tutorial2.session13 1.data schema integrationPal gov.tutorial2.session13 1.data schema integration
Pal gov.tutorial2.session13 1.data schema integration
 
3 Jahre OGD In Österreich - eine Billanz
3 Jahre OGD In Österreich - eine Billanz3 Jahre OGD In Österreich - eine Billanz
3 Jahre OGD In Österreich - eine Billanz
 

Último

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 

Último (20)

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 

You rang, M’LOD? Google Refine in the world of LOD

  • 1. You  rang,  M’LOD?   ì   Google  Refine  in  the  world  of  LOD   Mateja  Verlic   Seman/c  Tech  &  Business  Conference   June  3-­‐7,  2012  |  San  Francisco  
  • 2. 2   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 3. 3   Google  Refine   ì  What  we’ve  seen  so  far   ì  Messy  data  gone  clean   ì  Filtering,  faceted  browsing   ì  Edi/ng  cells  and  columns,  clustering,  expor/ng   ì  Bulk  transforma/ons   ì  History   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 4. 4   …  and  the  powerful  dark  side   ì  Reconcilia/on   ì  Extending  data     ì  Regular  expressions   ì  Integrated  GREL  commands   ì  Jython   ì  Extensions  (actually  not  so  many  of  them)   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 5. 5   LOD2   ì  Crea/ng  knowledge  out  of  Interlinked  Data   ì  EU  FP7  project   ì  15  partners   ì  LOD2  in  a                  :     LOD2  is  like  Batman  and  Robin:  business  and  academics   -­‐  mighty  tools  and  a  bunch  of  real  &  good  use  cases   supported  by  science  for  success.     ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 6. 6   Linked  Open  Data   ì  Distributed  data,  different  sources,  formats   ì  Open  Government  data   ì  Open  Data  Business  &  Business  of  Open  Data   ì  CKAN   ì  LOD2:  hap://www.lod2.eu     ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 7. 7   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 8. 8   LODGrefine   ì  LOD-­‐friendly  GoogleRefine  extensions   ì  RDF  extension     hap://lab.linkeddata.deri.ie/2010/grefine-­‐rdf-­‐extension/     ì  DBpedia  extension     ì  LODGrefine:  Google  Refine  +  integrated  extensions   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 9. 9   LODGrefine  toolbox   ì  Google  Refine  func/onali/es  +     ì  Registering  reconcilia/on  service  based  on  a  SPARQL   endpoint,  RDF  dump  or  Sindice  search   ì  RDF  Export   ì  Extending  reconciled  column  with  data  from   DBpedia     ì  Extrac/ng  en//es  from  full  text  using  Zemanta  API   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 10. 10   Mechanical  Future   ì  Integra/on  with  Amazon  Mechanical  Turk?   ì  Leveraging  crowd  intelligence   ì  Do  Workers  dream  reconciled  data?   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 11. 11   Short  Summary   ì  Google  Refine  -­‐  what  we  had   ì  LOD  1st  class  ci/zen  in  GR  -­‐  what  we  wanted   ì  Google  Refine  extension(s)  -­‐  what  we  did   ì  LODGrefine  -­‐  what  we  have   ì  Mechanical  future  –  what  (we  think)  we  want   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 12. 12   Demo  #1   ì  Pimp  my  data  under  10  minutes:  A  showcase  how   to  convert  data  from  a  website  into  a  linked  dataset   under  10  minutes.   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 13. 13   Demo  #2   ì  Yes,  we  C(K)AN:  Conver/ng  one  of  the  CKAN  Open   Data  datasets  into  a  LOD  dataset   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 14. 14   Thank  you   Mateja  Verlic   E-­‐mail:  mateja.verlic@zemanta.com   Web:  hap://www-­‐zemanta.com   LODGrefine:  hap://code.zemanta.com/sparkica   Twiaer:  @sparkica   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012