The main goal of the DataBio project is to show the benefits of Big Data technologies in the raw material production from agriculture, forestry and fishery/aquaculture for the bio-economy industry to produce food, energy and bio-materials responsibly and sustainably. DataBio proposes to deploy a state of the art, big data platform on top of the existing partners’ infrastructure and solutions – the Big DATABIO Platform. Achieved impacts are measured against anticipations. In this webinar, we present the impact of the DataBio project and of its big data platform after three years of implementation, and we illustrate some of the novel breakthroughs on practical cases of artificial intelligence applications that are meaningfully boosting crop monitoring businesses with a global potential.
BDV Webinar Series - Caj - Big Data Breakthroughs for Global Bio-economy Business
1. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
1
This project has received funding from
the European Union’s Horizon 2020
research and innovation programme
under grant agreement No 732064
This project is part
of BDV PPP
DATABIO PLATFORM
Prof. Dr. Caj Södergård,
Technical Coordinator,
VTT Technical Research Centre of Finland
Big Data Breakthroughs for Global Bio-
economy Business
Webinar, 17.12.2019
2. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
2
What do we mean with the DataBio platform ?
• A technical environment
• where software is developed
• to be deployed , e.g. as Docker modules, in hardware, operating system or a
cloud
• Handles Big Data - high Volume, Velocity, Variety
• Provides a big data toolset for digital services in agriculture, forestry and fishery
• Enable new software components to be combined with
• open source
• proprietary components
• Supports the forming of reusable pipelines of software components
3. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
3
Platform offers resources for iterative Sand box development
Sand box
Design Build
Learn Test
4. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
4
Platform offers resources for iterative Sand box development
DataBio Hub
www.databiohub.eu
Deployment
images on
Clouds
DataBio web site
www.databio.eu
Sand box
Design Build
Learn Test
5. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
5
Platform offers resources for iterative Sandbox development
DataBio Hub
www.databiohub.eu
Deployment
images on
Clouds
DataBio web site
www.databio.eu
Sandbox
Design Build
Learn Test
• Components
• Pipelines
• Datasets
• Pilots • Deliverables
• Models
• Presentations
• Close to data
• Docker repositories
• Exploitation platforms
• Forestry TEP
• Proba-V MEP
6. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
6
DataBio platform serves the 27 pilots
Big Data Pipeline
8
7. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
7
DataBio platform serves the 27 pilots
Big Data Pipeline
8
8. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
8
DataBio platform serves the 27 pilots
Big Data Pipeline
8
9. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
9
Pipelines vs. Services
Pipeline
• A chain of processing components: the output data
of each element is the input data of the next
• Clear interfaces between components and to outside
• A ”white box” showing internal wiring for developer
Service
• Provides usability to end users
• No display of internal wirings of components
• Accessed through API:s (web services, remote calls)
• Activated remotely through database queries (”end
points”) and executed in the cloud.
• Represents a ”black box”.
Real-time Data
Collection
ComplexEvent
Processing
Real-time Data
Preprocessing
Decision Making
10. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
10
We use the BDVA Reference Model
11. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
11
Platform development in DataBio in numbers
• 62components in two trial rounds
• 1-6Components per pilots (average 2)
• 14 new user interfaces
• 59 new APIs
• +2,7 in Technology Readiness Level (1-10)
12. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
12
Managing project assets: components, pipelines,
datasets and reports
• DataBio Hub is a central in the development platform
• Provides a catalogue of public and private digital assets of a project (DataBio)
• Links resources together and providesownership information
• Describes currently 95 components, 39 datasets, 12 pipelines, and 25 pilots
and provides links to project reports and models
13. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
13
DataBio Hub structure
14. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
14
Example of a pipeline in Agriculture
Big Data Pipeline
8
15. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
15
Pilot: Precision Agriculture in Olives, Fruits, Grapes
• Smart farming pilot focusing on the exploitation of
heterogeneous data, facts and scientific knowledge
to facilitate decisions and their application in the field
• The pilot promotes sustainable farming practices
through the provision of irrigation, fertilization and
pest/disease management advices
• The farmer benefits from the provided big-data
technologies and advisory services by better
managing the natural resources, optimizing the use
of agricultural inputs and increasing farm yields
Stimagka
Chalkidiki
Veroia
3 pilot sites, crop types and advisory services
4 data sources
fieldremote eye farm
16. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
16
Prediction and real-time alerts of diseases and pests breakouts
• Process
• Collect, validate and store farm IoT data
• Combine with EO and historical farm data
• Perform initial processing, monitoring and
cross-checking on the raw data
• Push the validated values to CEP for further analysis
(temporal reasoning) for triggering early alerts in real-time
• Early experiments with olives (left)
17. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
17
Internet of Things Pipeline
Real-time Data
Collection
ComplexEvent
Processing
Real-time Data
Preprocessing
Decision Making
18. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
18
PILOT INITIAL RESULTS
http://www.ypaithros.gr/en/yannis-
olive-grove-
reduction-by-30-in-production-costs-
and-parallel-increase-of-sales/
SUCCESS STORIES
Chalkidiki Pilot
Avg cost of spraying
(euros/ha)
810
250
790
232
782
71
VEROIA CHALKIDIKI
Base Value Target Value (1st year)
Current Value
Avg cost of irrigation
(euros/ha)
870
330
740
280
490
220
VEROIA CHALKIDIKI
Precision Agriculture in Olives, Fruits, Grapes
-40
-20
-20
Nitrogen under-
fertilization(%)
USER INTERFACES
19. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
19
Conclusions
• DataBio platform is an environment for
developing and deploying software in
bioeconomy
• It uses primarly components from partners
collected into pipelines
• The DataBio platform has deployed 62
components in 27 pilots -> examples for others
• It helps actors outside DataBio to develop services
for agriculture, forestry and fishery
20. This document is part of a project that has received funding
from the European Union’s Horizon 2020 research and innovation programme
under agreement No 732064. It is the property of the DataBio consortium and shall not be distributed or
reproduced without the formal approval of the DataBio Management Committee. Find us at www.databio.eu.
20
12.12.2019VTT – beyond the obvious20
1942 20181984
77 years of innovations Read more:
www.vttresearch.com
12.12.2019 VTT – beyond the obvious 20