Ontotext Cultural Heritage and Digital Humanities Projects
1. Ontotext CH/DH Projects
(A lot more details in , 2016-09, 130 slides)
Vladimir Alexiev, PhD, PMP
2018-06-13, CLADA-BG Meeting, So a
LOD for CH webinar
2. About Ontotext
Founded 2000, part of (400 people, , part of SOFIX), venture funding 2008Sirma Group BSE:SKK
65 people: 7 PhD, 30 MS, 20 BS, 6 university lecturers. O ces in So a, Varna, London
Core part of with focus on cognitive computingSirma Strategy 2022
Working on: semantic technologies, semantic repositories, semantic text analysis, machine learning
Semantic Graph Database: Ontotext GraphDB
Semantic data integration and building of Knowledge Graphs
Semantic text analysis: entity, concept, relation extraction, document classi cation
Recommendations, sentiment analysis
Machine learning: entity disambiguation, deep learning in graphs, etc
3. Current Projects:
Research Projects
EHRI2: European Holocaust Research Infrastructure (H2020 RI): CH
Evala: Congnitive and Semantic Links Analysis and Media Evaluation
Platform (EuroStars)
euBusinessGraph: Innovative Data Products and Services for Company
Data (H2020 BigData Experimentation)
COMPACT: From Research Through Policy on Social Media and
Convergence (H2020 CSA)
BigDataGrapes: BigData to Enable Global Disruption of the Grapevine-
Powered Industries (H2020 BigData Research)
CIMA: Company Intelligent Matching and Linking (BG OPC ISIS)
4. Research and Innovation Awards
Arguably, Ontotext is the most innovative Bulgarian software company.
We have more EU research projects than some universities combined
Innovative Enterprise of the Year 2017
EU Innovation Radar Prize 2016 nomination
BAIT Business Innovation Award 2014
Innovative Enterprise of the Year 2014
Washington Post “Destination Innovation” Competition 2014 Award
for most successful company in EU FP6 projectsPythagoras Award 2010
5. Industries and Clients
80% of our sales are in the UK and US
Media: BBC, UK Press Association, NL Press Association (NDP)…
Financial Info: S&P Global Platts, Euromoney, Financial Times, Nikkei…
STEM Publishing: IET, Oxford University Press, Wiley, Elsevier, Springer Nature…
Life Science: AstraZeneca, Novartis…
Government: UK Parliament, The National Archives, Natural Resources Canada…
Cultural Heritage and Digital Humanities (see next)
6. CH/DH Projects
ResearchSpace: British Museum, Yale Center for British Art. Largest museum collection, CIDOC CRM, semantic search…
(with Sirma Enterprise) ,ConservationSpace Sirma MuseumSpace
(VCMS) COST actionMedieval Cultures and Technological Resources
Europeana: , ( ), , , , 5 work groups,Creative Food and Drink sem app OAI PMH SPARQL members council Data Quality Committee
national aggegator: initiatorBulgariana
and helping on Getty Museum LODGetty Research Institute: vocabularies LOD
Carnegie Hall LOD
consulting: 14 US museums integrating data using CIDOC CRMAmerican Art Collaborative
: semantic archive integration. 4+4 years, heading towards ERICEuropean Holocaust Research Infrastructure
consulting (national aggregator moving to LOD)Canadian Heritage Information Network
Wikidata: frequent contributions (authority control)
DBpedia: contributions, association member, data quality and ontology committee
CLADA BG: key participant in both CLARIN (NLP) and DARIAH (CH/DH)
7. Knowledge Graphs
Knowledge Graph Year M obj B triples
British Museum 2013 2 0.92
Polish Digital Library 2013 3.1 0.53
Europeana 2014 20.3 3.8
FactForge 2006-now ~14 3.2
LinkedLifeData 2008-now ~12 10.2
Company Graph 2017-now 6 3
Dun & Bradstreet 2017 210 30
Details about the rst 5 are in V.Alexiev et al, ,
Workshop Practical Experiences with CIDOC CRM and its Extensions (CRMEX), TPDL 2013, slide 17
Large-scale Reasoning with a Complex Cultural Heritage Ontology (CIDOC CRM)
9. ConservationSpace
Line-of-business application for conservation specialists. International consortium (US NGA, DK SMK, UK Courtauld etc).
Based on the and , Ontotext helped with the ontologies.Sirma Enterprise Platform Ontotext GraphDB
10. MuseumSpace
Based on the Sirma Enterprise Platform and ConservationSpace experience. Collections, exhibitions, curation…
11. Virtual Center for Medieval Studies
(VCMS) COST action. FET proposals for medieval lexicography, historic
research, Virtual Research Environments
Medieval Cultures and Technological Resources
12. , and servers for Europeana (part of Europeana Labs).
Europeana Creative
OAI PMH SPARQL
23. EHRI Camps and Ghettos
Integrating Camps and Ghettos info between EHRI and Wikidata
24. Canadian Heritage Information Network
CA national aggregator is transitioning to LOD. 4 Consulting projects: environment scan, strategy, Artefacts Canada data
analysis, national authorities
26. Wikidata/DBpedia vs VIAF/GND; and Europeana
(Europeana Creative D2.4).
(GlamWiki 2015)
Name Data Sources for Semantic Enrichment Wikidata, a Target for Europeana’s Semantic Strategy