presentation from the 5th "EC Framework Programmes - funding opportunities" seminar organised by the Applied Research and Communications Fund
http://www.arcfund.net/arcartShow.php?id=16150
slides from our talk "Low-Cost Open Data as-a-service" from the Semantic Web Developers workshop of ESWC'2015 (full paper: http://ceur-ws.org/Vol-1361/paper7.pdf)
OWLIM@AWS - On-demand RDF Data Management in the CloudMarin Dimitrov
The document discusses OWLIM@AWS, which provides on-demand RDF data management in the Amazon Web Services cloud. It offers pay-as-you-go access to OWLIM semantic graph database software running on EC2 instances, without upfront hardware costs. Users can launch OWLIM AMIs on various EC2 instance types, attach EBS storage, and pay hourly rates. Future plans include additional regions, pricing options, and hosted datasets.
On-Demand RDF Graph Databases in the CloudMarin Dimitrov
slides from the S4 webinar "On-Demand RDF Graph Databases in the Cloud"
RDF database-as-a-service running on the Self-Service Semantic Suite (S4) platform: http://s4.ontotext.com
video recording of the talk is available at http://info.ontotext.com/on-demand-rdf-graph-database
The document summarizes the DataGraft Platform, an open data platform that provides an RDF database-as-a-service (DBaaS). The platform transforms tabular data into RDF and publishes linked data services instead of static datasets. It uses Amazon Web Services for its cloud architecture with Ontotext GraphDB as the RDF database engine running in Docker containers. The platform is designed to be elastic, highly available, cost efficient, and securely isolate multi-tenant databases. It provides a standards-compliant SPARQL endpoint and linked data interface that can be used with various third-party querying and visualization tools.
Enabling Low-cost Open Data Publishing and ReuseMarin Dimitrov
In the space of just a few years we’ve seen the transformational power of open data; both for transparency and accountability in public data, and efficiency and innovation with businesses in private data. In its first year, institutions and individuals throughout Europe have supported public sector bodies in releasing data and numerous start-ups, developers and SMEs in reusing this data for economic benefit.
However, we are still at the beginning of the open data movement, and there is still more that can be done to make open data simpler to use and to make it available to a wider audience.
The core goal of the DaPaaS project is to provide a Data- and Platform-as-a-Service environment, where 3rd parties (such as governmental organisations, SMEs, developers and larger companies) can publish and host both data sets and data-intensive applications, which can then be accessed by end-user applications in a cross-platform manner. You can find out more about DaPaaS on the detailed about page.
Essentially, DaPaaS aims to make publishing, consumption, and reuse of open data, as well as deploying open data applications, easier and cheaper for SMEs and small public bodies which otherwise may not have sufficient technical expertise, infrastructure and resources required to do so.
see also http://www.slideshare.net/eswcsummerschool/wed-roman-tutopendatapub-38742186
The document discusses Ontotext's Self-Service Semantic Suite (S4), which aims to address challenges customers face around unlocking insights from text and data, creating dynamic content, and integrating data sources. S4 provides semantic technology as a self-service set of pay-per-use services for text analytics, content enrichment, and metadata management using RDF graphs and ontologies. This approach aims to make semantic technology easier to adopt with lower costs and risks than traditional options.
overview of the RDF graph database-as-a-service (GraphDB based) on the Self-Service Semantic Suite (S4)
http://s4.ontotext.com
presentation for the AKSW Group of the University of Leipzig
Text Analytics & Linked Data Management As-a-ServiceMarin Dimitrov
slides from the talk on "Text Analytics & Linked Data Management As-a-Service with S4" from the ESWC'2015 workshop on Semantic Web Enterprise Adoption & Best Practices
full paper available at http://2015.wasabi-ws.org/papers/wasabi15_1.pdf
slides from our talk "Low-Cost Open Data as-a-service" from the Semantic Web Developers workshop of ESWC'2015 (full paper: http://ceur-ws.org/Vol-1361/paper7.pdf)
OWLIM@AWS - On-demand RDF Data Management in the CloudMarin Dimitrov
The document discusses OWLIM@AWS, which provides on-demand RDF data management in the Amazon Web Services cloud. It offers pay-as-you-go access to OWLIM semantic graph database software running on EC2 instances, without upfront hardware costs. Users can launch OWLIM AMIs on various EC2 instance types, attach EBS storage, and pay hourly rates. Future plans include additional regions, pricing options, and hosted datasets.
On-Demand RDF Graph Databases in the CloudMarin Dimitrov
slides from the S4 webinar "On-Demand RDF Graph Databases in the Cloud"
RDF database-as-a-service running on the Self-Service Semantic Suite (S4) platform: http://s4.ontotext.com
video recording of the talk is available at http://info.ontotext.com/on-demand-rdf-graph-database
The document summarizes the DataGraft Platform, an open data platform that provides an RDF database-as-a-service (DBaaS). The platform transforms tabular data into RDF and publishes linked data services instead of static datasets. It uses Amazon Web Services for its cloud architecture with Ontotext GraphDB as the RDF database engine running in Docker containers. The platform is designed to be elastic, highly available, cost efficient, and securely isolate multi-tenant databases. It provides a standards-compliant SPARQL endpoint and linked data interface that can be used with various third-party querying and visualization tools.
Enabling Low-cost Open Data Publishing and ReuseMarin Dimitrov
In the space of just a few years we’ve seen the transformational power of open data; both for transparency and accountability in public data, and efficiency and innovation with businesses in private data. In its first year, institutions and individuals throughout Europe have supported public sector bodies in releasing data and numerous start-ups, developers and SMEs in reusing this data for economic benefit.
However, we are still at the beginning of the open data movement, and there is still more that can be done to make open data simpler to use and to make it available to a wider audience.
The core goal of the DaPaaS project is to provide a Data- and Platform-as-a-Service environment, where 3rd parties (such as governmental organisations, SMEs, developers and larger companies) can publish and host both data sets and data-intensive applications, which can then be accessed by end-user applications in a cross-platform manner. You can find out more about DaPaaS on the detailed about page.
Essentially, DaPaaS aims to make publishing, consumption, and reuse of open data, as well as deploying open data applications, easier and cheaper for SMEs and small public bodies which otherwise may not have sufficient technical expertise, infrastructure and resources required to do so.
see also http://www.slideshare.net/eswcsummerschool/wed-roman-tutopendatapub-38742186
The document discusses Ontotext's Self-Service Semantic Suite (S4), which aims to address challenges customers face around unlocking insights from text and data, creating dynamic content, and integrating data sources. S4 provides semantic technology as a self-service set of pay-per-use services for text analytics, content enrichment, and metadata management using RDF graphs and ontologies. This approach aims to make semantic technology easier to adopt with lower costs and risks than traditional options.
overview of the RDF graph database-as-a-service (GraphDB based) on the Self-Service Semantic Suite (S4)
http://s4.ontotext.com
presentation for the AKSW Group of the University of Leipzig
Text Analytics & Linked Data Management As-a-ServiceMarin Dimitrov
slides from the talk on "Text Analytics & Linked Data Management As-a-Service with S4" from the ESWC'2015 workshop on Semantic Web Enterprise Adoption & Best Practices
full paper available at http://2015.wasabi-ws.org/papers/wasabi15_1.pdf
This document discusses Uber's growth and engineering challenges over time. It covers topics like Uber reaching 1 billion and 2 billion trips, microservices, tradeoffs between different programming languages, and tools used for building, deploying, and monitoring Uber's systems and services. The document also highlights advantages of various languages and technologies as well as Uber's open source projects that address common problems.
Delivering Linked Data Training to Data Science PractitionersMarin Dimitrov
Ontotext has provided Linked Data trainings to practitioners from various organizations to educate them on Linked Data and Semantic Web topics. They have learned that trainings need to (1) accommodate mixed audiences with different backgrounds and expertise, (2) use language tailored to each audience, and (3) strike a balance between theoretical foundations and practical applications. Ontotext also developed the EUCLID social media monitoring platform to identify trending topics in Linked Data for extending their training curriculum. The platform integrates and analyzes data from various social media sources to extract topics and visualize analytics.
Scaling to Millions of Concurrent SPARQL Queries on the CloudMarin Dimitrov
The document describes testing the scalability of OWLIM, a semantic database, on Amazon EC2 using a replication cluster approach. It found that:
- A 20 node cluster handled over 1 million SPARQL queries per hour, and a 100 node cluster handled 5 million queries per hour, demonstrating near-linear scalability.
- Cluster nodes maintained high performance, handling 2000-2300 queries per hour each even as the cluster size increased.
- The replication cluster approach distributed load well with low overhead, keeping CPU usage below 30% and network traffic below 0.1 MB/s for slave nodes.
Много често, когато искаме да станем по-добри backend програмисти се опитваме да научим различни езици за програмиране и съответните библиотеки. Проблема е че в Rails, Express.js, Django или Zend Framework има горе долу едни и същи концепции. Ако искаме да се научим как да пишем код за големи системи, които скалират добре и се справят сами с различни грешки и неочаквани ситуации, трябва да овладеем един друг дял от човешкото познание, който се нарича разпределени системи. В моята презентация ще видим защо трябва да задълбаем в тях и какви са основните принципи като консистентност(consistency), достъпност(availability) и издръжливост на разделения(partition tolerance). Също, ще разгледаме стъпки, които всеки може да направи за да научи повече по темата и да получава нови и актуални знания.
This document discusses Ontotext GraphDB connectors which allow users to perform complex SPARQL queries over RDF data by leveraging external engines like Elasticsearch, Solr, and Lucene. The connectors provide fast full-text search, faceted search, aggregations, and range queries through selective replication of RDF data to the external engines while synchronizing data and managing the connectors through SPARQL queries and updates. This enables users to get the benefits of SPARQL for graph pattern matching along with the advanced querying capabilities of systems like Elasticsearch without having to use a different query language.
Dec'2013 webinar from the EUCLID project on managing large volumes of Linked Data
webinar recording at https://vimeo.com/84126769 and https://vimeo.com/84126770
more info on EUCLID: http://euclid-project.eu/
This document discusses moving from big data to smart data. It summarizes three key points:
1) Big data focuses too much on volume and speed without ensuring useful insights. Smart data prioritizes understanding data quality and relationships to provide more value.
2) Organizations should first enrich data by adding metadata, interlinking related pieces, and providing a common layer before pursuing large volumes of raw data.
3) The document describes two success stories where Ontotext utilized semantic technologies and interlinked data sources to provide insightful analytics and answers to complex questions for clients in job market intelligence and asset recovery.
Crossing the Chasm with Semantic TechnologyMarin Dimitrov
After more than a decade of active efforts towards establishing Semantic Web, Linked Data and related standards, the verdict of whether the technology has delivered its promise and has proven itself in the enterprise is still unclear, despite the numerous existing success stories.
Every emerging technology and disruptive innovation has to overcome the challenge of “crossing the chasm” between the early adopters, who are just eager to experiment with the technology potential, and the majority of the companies, who need a proven technology that can be reliably used in mission critical scenarios and deliver quantifiable cost savings.
Succeeding with a Semantic Technology product in the enterprise is a challenging task involving both top quality research and software development practices, but most often the technology adoption challenges are not about the quality of the R&D but about successful business model generation and understanding the complexities and challenges of the technology adoption lifecycle by the enterprise.
This talk will discuss topics related to the challenge of “crossing the chasm” for a Semantic Technology product and provide examples from Ontotext’s experience of successfully delivering Semantic Technology solutions to enterprises.
This document summarizes a presentation about semantic technologies for big data. It discusses how semantic technologies can help address challenges related to the volume, velocity, and variety of big data. Specific examples are provided of large semantic datasets containing billions of triples and semantic applications that have integrated and analyzed disparate data sources. Semantic technologies are presented as a good fit for addressing big data's variety, and research is making progress in applying them to velocity and volume as well.
The course introduces students to current trends in information technology (IT). Cloud computing is a new paradigm on the needs of IT, providing integrated plan for a homogeneous environment offered by the cloud services - Software as a Service (SaaS), Platform as a service (PaaS) and Infrastructure as a service (IaaS).
http://elearn.uni-sofia.bg/course/info.php?id=928
Моби2 ЕООД е компания, специализирана в областите web базирани решения, eLearning, ePublishing, обучения и консултации. Нашите продукти и услуги са насочени към компаниите и институциите, търсещи работещи иновативни решения за своята дейност. През годините се утвърдихме като стабилен партньор на нашите клиенти характерен с коректност, етика, високо качество и високи професионални стандарти.
This presentation gives insight to the overall Horizon 2020 Program and more specifically for the period 2018-2020 with emphasis to ICT. Mariana Damova is the National Contact Point for Horizon 2020 ICT in Bulgaria
Студио проектите са нов елемент от образованието на студентите от инженерните специалности в Нов Български Университет. Основната им идея е студентите да се научат да работят в екип по реални проекти, поставени от външни заинтересовани лица от водещи компании в България и под менторството на преподавателите от университета.
This document discusses Uber's growth and engineering challenges over time. It covers topics like Uber reaching 1 billion and 2 billion trips, microservices, tradeoffs between different programming languages, and tools used for building, deploying, and monitoring Uber's systems and services. The document also highlights advantages of various languages and technologies as well as Uber's open source projects that address common problems.
Delivering Linked Data Training to Data Science PractitionersMarin Dimitrov
Ontotext has provided Linked Data trainings to practitioners from various organizations to educate them on Linked Data and Semantic Web topics. They have learned that trainings need to (1) accommodate mixed audiences with different backgrounds and expertise, (2) use language tailored to each audience, and (3) strike a balance between theoretical foundations and practical applications. Ontotext also developed the EUCLID social media monitoring platform to identify trending topics in Linked Data for extending their training curriculum. The platform integrates and analyzes data from various social media sources to extract topics and visualize analytics.
Scaling to Millions of Concurrent SPARQL Queries on the CloudMarin Dimitrov
The document describes testing the scalability of OWLIM, a semantic database, on Amazon EC2 using a replication cluster approach. It found that:
- A 20 node cluster handled over 1 million SPARQL queries per hour, and a 100 node cluster handled 5 million queries per hour, demonstrating near-linear scalability.
- Cluster nodes maintained high performance, handling 2000-2300 queries per hour each even as the cluster size increased.
- The replication cluster approach distributed load well with low overhead, keeping CPU usage below 30% and network traffic below 0.1 MB/s for slave nodes.
Много често, когато искаме да станем по-добри backend програмисти се опитваме да научим различни езици за програмиране и съответните библиотеки. Проблема е че в Rails, Express.js, Django или Zend Framework има горе долу едни и същи концепции. Ако искаме да се научим как да пишем код за големи системи, които скалират добре и се справят сами с различни грешки и неочаквани ситуации, трябва да овладеем един друг дял от човешкото познание, който се нарича разпределени системи. В моята презентация ще видим защо трябва да задълбаем в тях и какви са основните принципи като консистентност(consistency), достъпност(availability) и издръжливост на разделения(partition tolerance). Също, ще разгледаме стъпки, които всеки може да направи за да научи повече по темата и да получава нови и актуални знания.
This document discusses Ontotext GraphDB connectors which allow users to perform complex SPARQL queries over RDF data by leveraging external engines like Elasticsearch, Solr, and Lucene. The connectors provide fast full-text search, faceted search, aggregations, and range queries through selective replication of RDF data to the external engines while synchronizing data and managing the connectors through SPARQL queries and updates. This enables users to get the benefits of SPARQL for graph pattern matching along with the advanced querying capabilities of systems like Elasticsearch without having to use a different query language.
Dec'2013 webinar from the EUCLID project on managing large volumes of Linked Data
webinar recording at https://vimeo.com/84126769 and https://vimeo.com/84126770
more info on EUCLID: http://euclid-project.eu/
This document discusses moving from big data to smart data. It summarizes three key points:
1) Big data focuses too much on volume and speed without ensuring useful insights. Smart data prioritizes understanding data quality and relationships to provide more value.
2) Organizations should first enrich data by adding metadata, interlinking related pieces, and providing a common layer before pursuing large volumes of raw data.
3) The document describes two success stories where Ontotext utilized semantic technologies and interlinked data sources to provide insightful analytics and answers to complex questions for clients in job market intelligence and asset recovery.
Crossing the Chasm with Semantic TechnologyMarin Dimitrov
After more than a decade of active efforts towards establishing Semantic Web, Linked Data and related standards, the verdict of whether the technology has delivered its promise and has proven itself in the enterprise is still unclear, despite the numerous existing success stories.
Every emerging technology and disruptive innovation has to overcome the challenge of “crossing the chasm” between the early adopters, who are just eager to experiment with the technology potential, and the majority of the companies, who need a proven technology that can be reliably used in mission critical scenarios and deliver quantifiable cost savings.
Succeeding with a Semantic Technology product in the enterprise is a challenging task involving both top quality research and software development practices, but most often the technology adoption challenges are not about the quality of the R&D but about successful business model generation and understanding the complexities and challenges of the technology adoption lifecycle by the enterprise.
This talk will discuss topics related to the challenge of “crossing the chasm” for a Semantic Technology product and provide examples from Ontotext’s experience of successfully delivering Semantic Technology solutions to enterprises.
This document summarizes a presentation about semantic technologies for big data. It discusses how semantic technologies can help address challenges related to the volume, velocity, and variety of big data. Specific examples are provided of large semantic datasets containing billions of triples and semantic applications that have integrated and analyzed disparate data sources. Semantic technologies are presented as a good fit for addressing big data's variety, and research is making progress in applying them to velocity and volume as well.
The course introduces students to current trends in information technology (IT). Cloud computing is a new paradigm on the needs of IT, providing integrated plan for a homogeneous environment offered by the cloud services - Software as a Service (SaaS), Platform as a service (PaaS) and Infrastructure as a service (IaaS).
http://elearn.uni-sofia.bg/course/info.php?id=928
Моби2 ЕООД е компания, специализирана в областите web базирани решения, eLearning, ePublishing, обучения и консултации. Нашите продукти и услуги са насочени към компаниите и институциите, търсещи работещи иновативни решения за своята дейност. През годините се утвърдихме като стабилен партньор на нашите клиенти характерен с коректност, етика, високо качество и високи професионални стандарти.
This presentation gives insight to the overall Horizon 2020 Program and more specifically for the period 2018-2020 with emphasis to ICT. Mariana Damova is the National Contact Point for Horizon 2020 ICT in Bulgaria
Студио проектите са нов елемент от образованието на студентите от инженерните специалности в Нов Български Университет. Основната им идея е студентите да се научат да работят в екип по реални проекти, поставени от външни заинтересовани лица от водещи компании в България и под менторството на преподавателите от университета.
*Мобилният пазар в света и Европа - данни от IHS
*Мобилната реклама в България - данни от AdEx 2012
*MediaScope Europe 2012 - проучване на медия потреблението в Европа | България
Measuring the Productivity of Your Engineering Organisation - the Good, the B...Marin Dimitrov
High-performing engineering teams regularly dedicate time on measuring the performance & quality of the systems and applications they’re building or on measuring & improving the various aspects of the development lifecycle. High-performing product companies are also data-driven when it comes to measuring the impact of new features & products in terms of business KPIs and Northstar metrics.
Can a data-driven approach be applied to measuring the performance, maturity and continuous improvement of an engineering team or the whole engineering organisation? In this discussion we’ll cover various important topics related to quantifying the performance of an engineering organisation
The career development of our teammates is among the key responsibilities of a leader - and оur personal career development vision & plan plays a critical role for our long term growth and success. Despite their importance, our career vision is often not getting enough attention and level of detail, or is hampered by easily avoidable mistakes. In this discussion, we’ll address typical mistakes related to long-term career planning, some best practices, and practical steps for building our own long-term career development vision (or the ones of the teammates we are leading), so that career planning becomes a long term journey with clear why/how/what, rather than just a list of SMART goals
Uber began its open source journey in 2015 when three passionate engineers decided to contribute Uber’s work back to the community. In only four years, Uber’s open source program has fostered 350+ outstanding open source projects with 2,000+ contributors worldwide delivering over 70,000 commits. Since 2017, four of Uber’s open source projects have won InfoWorld’s Best of Open Source Software Awards. In this talk, Brian Hsieh & Marin Dimitrov will share more details on Uber’s open source journey, program and best practices, and how Uber enables open innovation by fostering a healthy and collaborative open source culture
Trust - the Key Success Factor for Teams & OrganisationsMarin Dimitrov
>>> Most leaders agree that trust is a key factor for the success o the team and the organisation and that they are actively working to build trust. And yet, various studies imply that almost half of the teams and organisations worldwide experience lower trust levels with their managers, teammates and the rest of the organisation, which leads to decreased engagement, productivity and success.
>>> In this talk we will discuss why trust is a key success factor for every team and every organisation, some good practices for building, sustaining and rebuilding trust, as well as the most common mistakes related to trust building
Marin Dimitrov and Evelina Prodanova from Uber Engineering in Sofia gave a presentation about Uber. They discussed how Uber operates in over 600 cities across 80 countries, providing over 5 billion trips. They also provided information about Uber Engineering events in Sofia and career opportunities at Uber Engineering in Sofia.
talk @ the Computer Science department of Sofia University - practical advice for career growth for students
DEV.BG event http://dev.bg/%D1%81%D1%8A%D0%B1%D0%B8%D1%82%D0%B8%D0%B5/fmi-club-%D0%BF%D1%80%D0%B0%D0%BA%D1%82%D0%B8%D1%87%D0%BD%D0%B8-%D1%81%D1%8A%D0%B2%D0%B5%D1%82%D0%B8-%D0%B7%D0%B0-%D0%BA%D0%B0%D1%80%D0%B8%D0%B5%D1%80%D0%BD%D0%BE-%D1%80%D0%B0%D0%B7%D0%B2%D0%B8%D1%82/
Building, Scaling and Leading High-Performance TeamsMarin Dimitrov
The document discusses building, scaling, and leading high-performance teams. It covers cultural values, attracting top talent through transparent hiring processes and a magical interview experience, coaching and growth through onboarding, knowledge sharing, mentoring, and feedback, and leadership through execution, vision, emotional intelligence, and effective team design. The speaker is an engineering manager sharing experiences from Uber on developing teams and talent.
Uber @ Career Days 2017 (Sofia University)Marin Dimitrov
Uber's engineering team aims to build highly scalable, available, and flexible platforms to achieve Uber's mission of providing transportation that is as reliable as running water everywhere for everyone. Uber currently operates in over 600 cities across 80 countries. The platforms need to handle data from tens of millions of daily trips while ensuring riders and drivers can access documents and data 24/7. Uber also aims to build flexibility into its platforms to meet various compliance requirements in the over 80 countries it operates in worldwide.
Linked Data for the Enterprise: Opportunities and ChallengesMarin Dimitrov
1) Semantic technologies and linked data can help address challenges of integrating disparate data sources and providing unified access to enterprise information.
2) Case studies demonstrate successes in areas like semantic search, knowledge discovery, and dynamic publishing by linking and enriching content.
3) Adoption challenges include developing domain ontologies, query performance, data quality, and getting enterprise IT teams familiar with semantic technologies.
Semantic Technologies and Triplestores for Business IntelligenceMarin Dimitrov
This document provides an introduction to semantic technologies and triplestores. It discusses the Semantic Web vision of making data on the web more accessible and linked. Key concepts covered include RDF, ontologies, OWL, SPARQL and Linked Data. It also introduces triplestores as RDF databases for storing and querying semantic data and compares their features to traditional databases.
This document discusses data marketplaces and the potential benefits of linked data for data marketplaces. It provides an overview of several existing data marketplaces including Factual, InfoChimps, Azure DataMarket, Freebase, Socrata, and Kasabi. These marketplaces vary in their data domains, models, sizes, monetization approaches, and tools for data access. The document also outlines benefits of the semantic web and linked data for data marketplaces, such as unified data representation, global identifiers, interlinked datasets, and easy integration of existing linked open data. However, challenges include ensuring data quality and performing large-scale data integration across different schemas.
This document summarizes Marin Dimitrov's presentation on linked data management at the 3rd GATE training course in Montreal in August 2010. The presentation covered linked data principles, key vocabularies and datasets, open government data initiatives, and tools for working with linked data. Some open issues discussed were the diversity of linked data schemas, data quality issues, reliability of endpoints, licensing concerns, and challenges of querying distributed data.
1. Онтотекст в Европейски проекти
2002-2012
Марин Димитров
5ти семинар «Финансиране чрез европейските рамкови програми»
2. Съдържание
• За Онтотекст
• Участие на Онтотекст в Европейски проекти 2002-
2012
• Продуктизиране на изследователски проекти в
Онтотекст
• Препоръки за участие в проекти
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #2
3. За Онтотекст
• Основана през 2000г. като част от Сирма Груп
– Независима компания от 2009
– Офиси в София и Варна, USA и UK
• Решения за интелигентно управление на данни
• Основни клиенти
– Медии (BBC, Press Association, NDP Nieuwsmedia)
– Фармацевтични компании (AstraZeneca, UCB)
– Музеи (The British Museum, Polish Digital National
Museum, Dutch Public Library)
– Правителствени агенции (The National Archives, DoD)
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #3
5. Основни изледователски области
• Извличане на информация от текст (text mining)
• Интелигентни бази от данни (semantic databases)
• Семантично търсене на иформация (semantic search)
• Семантични технологии за уеб услуги и бизнес
процеси
• Семантични технологии за интелигентно
интегриране и управление на информация
• Свързани данни (Linked Data)
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #5
6. Онтотекст в Европейски проекти 2002-2012
OntoWeb
SWWS
DIP
SEKT
InfraWebs
PrestoSpace
SemanticGov
MediaCampaign
SUPER
TAO
TripCom
SOA4All
LarKC
Insemtives
NoTube
MOLTO
CUBIST
Khresmoi
RENDER
TrendMiner
EUCLID
AnnoMarket
2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #6
7. Партньори по Европейски проекти
• UK: The Open University, University of Sheffield, Sheffield Hallam University, University of
Southampton, Heriot-Watt University, University of Surrey, University of Manchester
• Germany: Karlsruhe Institute of Technology, DFKI, University of Karlsruhe, University of
Stuttgart, Free University of Berlin, University of Bochum, University of Duisburg-Essen,
University of Siegen
• Netherlands: Free University of Amsterdam, Technical University of Eindhoven, University of
Twente
• Ireland: Dublin City University, National University of Ireland at Galway, DERI
• Austria: University of Innsbruck, Medical University of Wien, Technical University of Wien
• България: БАН, НБУ
• Spain: University of Barcelona, Technical University of Cataluña, University of Seville
• France: INRIA, Ecole Centrale Paris
• Switzerland: Swiss Federal Institute of Technology, University of Zurich
• China, Slovenia, Hungary, Greece, Poland, Italy, Romania, Sweden, Czech Republic
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #7
8. Партньори по Европейски проекти (2)
• SAP Research, HP Research, IBM Research, Siemens,
Atos Origin, ILOG (IBM), Unicorn (IBM)
• British Telecom, Telefonica, Telekom Austria, Korea
Telecom, Tiscali, Telekomunikacja Polska
• BBC, RAI, Nielsen Media, The Press Association
• Google, Wikimedia
• AstraZeneca
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #8
10. OWLIM
• http://www.ontotext.com/owlim
• Семантична СУБД (RDF)
– Съвместима с W3C стандартите за RDF, OWL и SPARQL
• Разширена функционалност за пространствено
(geo-spatial) и пълно-текстово (full-text) търсене
• Работа в клъстер
• Основни предимства
– Производителност при добавяне/премахване на факти
– Мащабируемост (scalability)
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #10
11. KIM и Semantic Biomedical Tagger
• http://www.ontotext.com/kim
• Платформи за обработка на текст (text mining) и
семантично анотиране (semantic annotation)
– Автоматично генериране на метаданни и свързани
данни (Linked Data)
• Базирани на платформата с отворен код GATE
• Извличането на информация и обработката на
текст е базирана на онтологии и бази знания
• Адаптирана за различни домейни
– HCLS, Publishing & Media, Cultural Heritage
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #11
12. KIM и Semantic Biomedical Tagger (2)
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #12
13. Семантично интегриране на информация
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #13
14. ПРЕПОРЪКИ ЗА УЧАСТИЕ В
ПРОЕКТИ
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #14
15. Препоръки за успешно участие в Европейски
проекти
• Опит с инновативни технологии
– Финансиране през 2013-2016 в следните области: Cloud
Computing, Exascale Computing, Digital / Sensing
Enterprise, Smart Cities, Internet of Things, Connected &
Social Media, Future Internet, Robotics, Machine
Translation, Cross-media content analytics, Big Data,
scalable hardware architectures (GPU/FPGA), Open Data
• Идея за продуктизиране на резултатите от
проекта
– Бизнес план
– Особено важно за SME проектите
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #15
16. Препоръки за успешно участие в Европейски
проекти (2)
• Връзки и сътрудничество с големи организации
– университети или компании
– Networking events, конференции, съвместни проекти
– ICT Proposers Days (26-27.09, Варшава)
• Продуктова визия за следващите 3-5 години
– Поне 6-12 месеца до започване на проекта
– Конкретни резултати (софтуер) в проекта едва 12-18
месеца след започването
– Няколко проекта могат да допринесат за конкретен
продукт
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #16
17. Препоръки за успешно участие в Европейски
проекти (3)
• Правилният тип проект и консорциум
– IP, STREP, CA, SME
– Възможност за бъдещо сътрудничество с партньори от
проекта
• за Онтотекст: BBC, AstraZeneca, Korea Telecom
– Възможност за включване в проекта на текущи бизнес
партньори
• за Онтотекст: Press Association, Innovantage
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #17
18. Критерии за успешно участие и продуктизиране на
Европейски проекти
• Цели на проекта
• Опит с технологиите използвани в проекта
• Налични ресурси за участие
• Възможности за бързо продуктизиране на
резултатите от проекта
• Съществуващи бизнес партньорства с участници в
проекта
• Възможности за бъдещи бизнес партньорства
5ти семинар «Финансиране чрез европейските рамкови програми» Сеп 2012 #18