SlideShare uma empresa Scribd logo
1 de 7
Multilingual Scraping fromOpen Dutch Government Data Open Data Day Hackathon Ireland DERI & 091 labs Galway, 4 Dec 2010 Tobias Wunner
Dutch open government data 3 websites same data but multilingual
Dutch Spending Data  Javascript Website Pixel Graphic in PDF
Dutch Spending Data  Website Pixel Graphic in PDF DIFFICULT!
Scrape multilingual concepts ,[object Object]
 concept hierarchy“International items”@en “Internationale conjunctur”@nl super concept “Long-term interest rate”@en “Lange Rente”@nl
Scrape multilingual concepts ,[object Object]

Mais conteúdo relacionado

Mais de Tobias Wunner

Deri in three ontology-lexicons for fact extraction
Deri in three   ontology-lexicons for fact extractionDeri in three   ontology-lexicons for fact extraction
Deri in three ontology-lexicons for fact extractionTobias Wunner
 
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...Tobias Wunner
 
Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1Tobias Wunner
 
Cross-lingual ontology lexicalisation, translation and information extraction...
Cross-lingual ontology lexicalisation, translation and information extraction...Cross-lingual ontology lexicalisation, translation and information extraction...
Cross-lingual ontology lexicalisation, translation and information extraction...Tobias Wunner
 
Ontology-based information extraction in the DERI Reading Group
Ontology-based information extraction in the DERI Reading GroupOntology-based information extraction in the DERI Reading Group
Ontology-based information extraction in the DERI Reading GroupTobias Wunner
 
Semantic, terminological and linguistic analysis of xbrl
Semantic, terminological and linguistic analysis of xbrlSemantic, terminological and linguistic analysis of xbrl
Semantic, terminological and linguistic analysis of xbrlTobias Wunner
 

Mais de Tobias Wunner (6)

Deri in three ontology-lexicons for fact extraction
Deri in three   ontology-lexicons for fact extractionDeri in three   ontology-lexicons for fact extraction
Deri in three ontology-lexicons for fact extraction
 
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...
 
Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1
 
Cross-lingual ontology lexicalisation, translation and information extraction...
Cross-lingual ontology lexicalisation, translation and information extraction...Cross-lingual ontology lexicalisation, translation and information extraction...
Cross-lingual ontology lexicalisation, translation and information extraction...
 
Ontology-based information extraction in the DERI Reading Group
Ontology-based information extraction in the DERI Reading GroupOntology-based information extraction in the DERI Reading Group
Ontology-based information extraction in the DERI Reading Group
 
Semantic, terminological and linguistic analysis of xbrl
Semantic, terminological and linguistic analysis of xbrlSemantic, terminological and linguistic analysis of xbrl
Semantic, terminological and linguistic analysis of xbrl
 

Multilingual scraping from dutch government data