SlideShare uma empresa Scribd logo
1 de 24
Baixar para ler offline
Notes by Marco Brandizi
Key Notes
Agriculture
Data Publishing &

Interoperability
MetadataArtificial Intelligence
Data Integration &

Exploration
Data Annotation &
Enrichment
Notes by Marco Brandizi
Key Notes
Medha Devare, Spinning a Semantic Web
for Agriculture
The CGIAR challenges
Their domain is very varied, ranging from fighting poverty to helping to access markets
Technologies to integrate data exists, need to be put together
AI and SW are different, but one need the other, and one can provide results for the other
See also: https://bigdata.cgiar.org/
The GARDIAN Platform
Where should I plant my rice? How should I manage my crop? How to mitigate risks and define
insurance plans?
CGIAR diverse data collected and harmonised, using LD/ontologies
Data made available via SPARQL
AgroFIMS
Platform for field trial data collection
Support both electronic and paper-based operations (need for flexibility)
UI and functionality built over ontology modelling
Export metadata
R Scripting functionality for analytics
Christian Lovis , AI and Big data: the
dilemna of Truth
Limits of bioinformatics (eg, in genetics)
There are unpredictable things
Limits of AI, eg,
overfitting (Wheels are faces)
Unreliable data (chocolate consumption vs Nobel laureates)
Biases (Google prefers white skin women)
Our conceptions
Should unreproducible papers being retracted? Shamed?
Anonymisation is impossible, privacy should be post-action
too, not just preventive
Philippe Bourne, How does Data Science
impact the Semantic Web
Science isn’t made with formal definitions
Data Science is unexpected reuse of
information
SW has opportunity to contribute, but
schema.org is becoming the norm, not
the exception
FAIR is broader than SW
Model
Transportability
Horizontal
Integration
Multi-scale
Integration
human
mouse
zebrafish
DNA
Gene/Protein
Network
Cell
Tissue
Organ
Body
Population
CNV SNP methylation
3D structure Gene
expression Proteomics
Metabolomics
MetabolicSignaling
transduction
Gene
regulation
Hepatic Myoepithelial Erythrocyte
Epithelial Muscle Nervous
Liver Kidney Pancreas Heart
Physiologically based
pharmacokinetics
GWASPopulation
dynamics
Microbiota
Open, complex, diverse digital data
Systems Pharmacology
Xie et al. Annu Rev Pharmacol Toxicol. 2017 57:245-262
12/04/18
18
Dean Allemang, Semantic Web and the
New Industrial Revolution 
Comparing academia and business
eg, publishing/Sharing as goal, vs
absolutely forbidden
The evolution of FIBO, from OWL to auto-
generated multiple views/formats
The increasing role of vocabularies and
shared data models
Notes by Marco Brandizi
Key Notes
Agriculture
Data Publishing &

Interoperability
MetadataArtificial Intelligence
Data Integration &

Exploration
Data Annotation &
Enrichment
Notes by Marco Brandizi
Agriculture
More Agriculture-related talks
Design of a Framework to Support Reuse of Open Data about Agriculture
Data files are harvested and enriched (at metadata level) with text mining, stored
as linked data
A recommender ranks data annotations according to vectors of user preferences
Web services to access annotated data are auto-created with SADI
Lightly specified ontologies for access to agricultural information across languages and
domains (https://goo.gl/z5qwvh)
The case for taxonomies, SKOS vocabularies, thesauri, etc
The Global Agricultural Concept Scheme (GACS)
Notes by Marco Brandizi
Key Notes
Agriculture
Data Publishing &

Interoperability
MetadataArtificial Intelligence
Data Integration &

Exploration
Data Annotation &
Enrichment
Notes by Marco Brandizi
Data Integration &

Exploration
Integration and Data Access Platforms
Data2Services: enabling automated conversion of data to services
Several data formats converted to generic RDF model,
Which can then be translated to something more significant, via SPARQL
Garlic service to convert from spraql to API, with calls like class/all, class/$class/instances,
resource/$uri
Architecture for the harmonization of clinical cohort data in the IMI EMIF project
Harmonised model, configurable mappings to data sources
Sparklis over PEGASE Knowledge Graph: A New Tool for Pharmacovigilance
SPARKLIS, help formulating SPARQL in natural language, and also get NL results
They developed a vigilance ontology and extended SPARKLIS: OntoADR, which leverages SNOMED
They use MeDRA as vigilance source
Notes by Marco Brandizi
Key Notes
Agriculture
Data Publishing &

Interoperability
MetadataArtificial Intelligence
Data Integration &

Exploration
Data Annotation &
Enrichment
Notes by Marco Brandizi
Data Publishing &

Interoperability
The bioschemas and WikiBase Tutorials
Bioschemas
Common lightweight ontology to publish data on the web,
mainly to support search engines (derived from schema.org)
Tutorial gave an overview and examples of annotation using their tool: https://goo.gl/GFDhPF
During the hackathon, I’ve got info about proposing new types
WikiData
The Wikipedia of data
Multiple formats supported (JSON, RDF, SPARQL)
Increasingly being used to share open data, make resolvable URIs
Batch imports or Wikibase editor (http://wikiba.se)
Can be used with local installations, Docker support
See also: https://stuff.coffeecode.net/2018/wikibase-workshop-swib18.html
Common properties to describe data (promotes interoperability)
Using Wikidata for semantic data modeling in education and research
Data integration workshops, using Wikidata and Wikibase
Notes by Marco Brandizi
Key Notes
Agriculture
Data Publishing &

Interoperability
MetadataArtificial Intelligence
Data Integration &

Exploration
Data Annotation &
Enrichment
Notes by Marco Brandizi
Metadata
CEDAR
Tool to annotate datasets with metadata
Similar to COPO
Dataset-level metadata description, ontology autocompletion,
ontology recommender
http://tinyurl.com/cedar-swat4ls2018
See also: https://www.go-fair.org
Notes by Marco Brandizi
Key Notes
Agriculture
Data Publishing &

Interoperability
MetadataArtificial Intelligence
Data Integration &

Exploration
Data Annotation &
Enrichment
Notes by Marco Brandizi
Artificial Intelligence
Data Annotation &
Enrichment
Enrichment, Text Mining, AI and alike
Evaluation of Knowledge Graph Embedding Approaches for Drug-Drug Interaction Prediction using Linked Open Data
Uses the same methods are used in:
Vec2SPARQL: integrating SPARQL queries and knowledge graph embeddings
APIs to extract feature vectors are integrated with Linked Data in two ways:
LD used to compute features via ML (random walks)
SPARQL extended with similarity metric functions (eg, mostSimilar (?uri, top-n)
Ontology-Driven Metadata Enrichment for Genomic Datasets
Semi-structured sequencing data annotated with ZOOMA/BioPortal,
then scored according to similarity between original text and matched term label
Cooperation of bio-ontologies for the classification of genetic intellectual disabilities : a diseasome approach
Data are classified into multiple disease classes
Results assessed with a comparison metric that rewards pairs in the same disease class
Notes by Marco Brandizi
Artificial Intelligence
Data Annotation &
Enrichment
The KNIME Tutorial
It’s a platform similar to Galaxy, but dedicated mostly to Machine
Learning
Workflows can be executed in batch mode, results can be exported
as structured data
Notes by Marco Brandizi
Key Notes
Agriculture
Data Publishing &

Interoperability
MetadataArtificial Intelligence
Data Integration &

Exploration
Data Annotation &
Enrichment

Mais conteúdo relacionado

Mais procurados

International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)ijfcst journal
 
Linking Open, Big Data Using Semantic Web Technologies - An Introduction
Linking Open, Big Data Using Semantic Web Technologies - An IntroductionLinking Open, Big Data Using Semantic Web Technologies - An Introduction
Linking Open, Big Data Using Semantic Web Technologies - An IntroductionRonald Ashri
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)ijfcst journal
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)ijfcst journal
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to MetadataJenn Riley
 
International Journal of Education (IJE)
International Journal of Education (IJE) International Journal of Education (IJE)
International Journal of Education (IJE) ijfcst journal
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)ijfcst journal
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)ijfcst journal
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)ijfcst journal
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)ijfcst journal
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)ijfcst journal
 
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...Connected Data World
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)ijfcst journal
 
Big Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesBig Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesSrinath Srinivasa
 
PID Services for FAIR data
PID Services for FAIR dataPID Services for FAIR data
PID Services for FAIR dataOpenAIRE
 
PID services - understandability and findability of data
PID services - understandability and findability of dataPID services - understandability and findability of data
PID services - understandability and findability of dataEOSC-hub project
 
Querying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphQuerying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphIoan Toma
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)ijfcst journal
 
It Don’t Mean a Thing If It Ain’t Got Semantics
It Don’t Mean a Thing If It Ain’t Got SemanticsIt Don’t Mean a Thing If It Ain’t Got Semantics
It Don’t Mean a Thing If It Ain’t Got SemanticsOntotext
 

Mais procurados (20)

International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)
 
Linking Open, Big Data Using Semantic Web Technologies - An Introduction
Linking Open, Big Data Using Semantic Web Technologies - An IntroductionLinking Open, Big Data Using Semantic Web Technologies - An Introduction
Linking Open, Big Data Using Semantic Web Technologies - An Introduction
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to Metadata
 
International Journal of Education (IJE)
International Journal of Education (IJE) International Journal of Education (IJE)
International Journal of Education (IJE)
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)
 
Sebastian Hellmann
Sebastian HellmannSebastian Hellmann
Sebastian Hellmann
 
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)
 
Big Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesBig Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and Opportunities
 
PID Services for FAIR data
PID Services for FAIR dataPID Services for FAIR data
PID Services for FAIR data
 
PID services - understandability and findability of data
PID services - understandability and findability of dataPID services - understandability and findability of data
PID services - understandability and findability of data
 
Querying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphQuerying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge Graph
 
International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)International Journal of Data mining Management Systems (IJDMS)
International Journal of Data mining Management Systems (IJDMS)
 
It Don’t Mean a Thing If It Ain’t Got Semantics
It Don’t Mean a Thing If It Ain’t Got SemanticsIt Don’t Mean a Thing If It Ain’t Got Semantics
It Don’t Mean a Thing If It Ain’t Got Semantics
 

Semelhante a Notes about SWAT4LS 2018

Building Federated FAIR Data Spaces, Yann Le Franc, EOSC-Pillar
Building Federated FAIR Data Spaces, Yann Le Franc, EOSC-PillarBuilding Federated FAIR Data Spaces, Yann Le Franc, EOSC-Pillar
Building Federated FAIR Data Spaces, Yann Le Franc, EOSC-PillarEOSC-Pillar European Project
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformSanjay Padhi, Ph.D
 
Hadoop India Summit, Feb 2011 - Informatica
Hadoop India Summit, Feb 2011 - InformaticaHadoop India Summit, Feb 2011 - Informatica
Hadoop India Summit, Feb 2011 - InformaticaSanjeev Kumar
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Data Without Borders
Data Without BordersData Without Borders
Data Without BordersAeolai
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessAjay Ohri
 
Intro to big data and applications - day 1
Intro to big data and applications - day 1Intro to big data and applications - day 1
Intro to big data and applications - day 1Parviz Vakili
 
Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)Dag Endresen
 
Putting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open DataPutting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open DataMartin Kaltenböck
 
Moving Toward Big Data: Challenges, Trends and Perspectives
Moving Toward Big Data: Challenges, Trends and PerspectivesMoving Toward Big Data: Challenges, Trends and Perspectives
Moving Toward Big Data: Challenges, Trends and PerspectivesIJRESJOURNAL
 
GBIF web services for biodiversity data, for USDA GRIN, Washington DC, USA (2...
GBIF web services for biodiversity data, for USDA GRIN, Washington DC, USA (2...GBIF web services for biodiversity data, for USDA GRIN, Washington DC, USA (2...
GBIF web services for biodiversity data, for USDA GRIN, Washington DC, USA (2...Dag Endresen
 
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Joachim Neubert
 
EIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationEIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationVishwas Chavan
 
IRJET- A Comparative Study on Big Data Analytics Approaches and Tools
IRJET- A Comparative Study on Big Data Analytics Approaches and ToolsIRJET- A Comparative Study on Big Data Analytics Approaches and Tools
IRJET- A Comparative Study on Big Data Analytics Approaches and ToolsIRJET Journal
 

Semelhante a Notes about SWAT4LS 2018 (20)

Building Federated FAIR Data Spaces, Yann Le Franc, EOSC-Pillar
Building Federated FAIR Data Spaces, Yann Le Franc, EOSC-PillarBuilding Federated FAIR Data Spaces, Yann Le Franc, EOSC-Pillar
Building Federated FAIR Data Spaces, Yann Le Franc, EOSC-Pillar
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
Gbrds Tech Issues Op
Gbrds Tech Issues OpGbrds Tech Issues Op
Gbrds Tech Issues Op
 
Overview of CGIAR’s Big Data Platform
Overview of CGIAR’s Big Data PlatformOverview of CGIAR’s Big Data Platform
Overview of CGIAR’s Big Data Platform
 
Hadoop India Summit, Feb 2011 - Informatica
Hadoop India Summit, Feb 2011 - InformaticaHadoop India Summit, Feb 2011 - Informatica
Hadoop India Summit, Feb 2011 - Informatica
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Data Without Borders
Data Without BordersData Without Borders
Data Without Borders
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help business
 
Intro to big data and applications - day 1
Intro to big data and applications - day 1Intro to big data and applications - day 1
Intro to big data and applications - day 1
 
Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)
 
20230525_mmc_seminar.pdf
20230525_mmc_seminar.pdf20230525_mmc_seminar.pdf
20230525_mmc_seminar.pdf
 
Putting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open DataPutting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open Data
 
Moving Toward Big Data: Challenges, Trends and Perspectives
Moving Toward Big Data: Challenges, Trends and PerspectivesMoving Toward Big Data: Challenges, Trends and Perspectives
Moving Toward Big Data: Challenges, Trends and Perspectives
 
GBIF web services for biodiversity data, for USDA GRIN, Washington DC, USA (2...
GBIF web services for biodiversity data, for USDA GRIN, Washington DC, USA (2...GBIF web services for biodiversity data, for USDA GRIN, Washington DC, USA (2...
GBIF web services for biodiversity data, for USDA GRIN, Washington DC, USA (2...
 
2009 11 icudl
2009 11 icudl2009 11 icudl
2009 11 icudl
 
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
 
EIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationEIA Biodiversity Data Mobilisation
EIA Biodiversity Data Mobilisation
 
Big data mining
Big data miningBig data mining
Big data mining
 
Data coordination and the role of RDA
Data coordination and the role of RDAData coordination and the role of RDA
Data coordination and the role of RDA
 
IRJET- A Comparative Study on Big Data Analytics Approaches and Tools
IRJET- A Comparative Study on Big Data Analytics Approaches and ToolsIRJET- A Comparative Study on Big Data Analytics Approaches and Tools
IRJET- A Comparative Study on Big Data Analytics Approaches and Tools
 

Mais de Rothamsted Research, UK

FAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseFAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseRothamsted Research, UK
 
Interoperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use CasesInteroperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use CasesRothamsted Research, UK
 
AgriSchemas: Sharing Agrifood data with Bioschemas
AgriSchemas: Sharing Agrifood data with BioschemasAgriSchemas: Sharing Agrifood data with Bioschemas
AgriSchemas: Sharing Agrifood data with BioschemasRothamsted Research, UK
 
Publishing and Consuming FAIR Data A Case in the Agri-Food Domain
Publishing and Consuming FAIR DataA Case in the Agri-Food DomainPublishing and Consuming FAIR DataA Case in the Agri-Food Domain
Publishing and Consuming FAIR Data A Case in the Agri-Food DomainRothamsted Research, UK
 
AgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use CasesAgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use CasesRothamsted Research, UK
 
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...Rothamsted Research, UK
 
A Preliminary survey of RDF/Neo4j as backends for KnetMiner
A Preliminary survey of RDF/Neo4j as backends for KnetMinerA Preliminary survey of RDF/Neo4j as backends for KnetMiner
A Preliminary survey of RDF/Neo4j as backends for KnetMinerRothamsted Research, UK
 
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...Rothamsted Research, UK
 
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...Rothamsted Research, UK
 
graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...Rothamsted Research, UK
 
myEquivalents, aka a new cross-reference service
myEquivalents, aka a new cross-reference servicemyEquivalents, aka a new cross-reference service
myEquivalents, aka a new cross-reference serviceRothamsted Research, UK
 

Mais de Rothamsted Research, UK (20)

FAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseFAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use Case
 
Interoperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use CasesInteroperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use Cases
 
AgriSchemas: Sharing Agrifood data with Bioschemas
AgriSchemas: Sharing Agrifood data with BioschemasAgriSchemas: Sharing Agrifood data with Bioschemas
AgriSchemas: Sharing Agrifood data with Bioschemas
 
Publishing and Consuming FAIR Data A Case in the Agri-Food Domain
Publishing and Consuming FAIR DataA Case in the Agri-Food DomainPublishing and Consuming FAIR DataA Case in the Agri-Food Domain
Publishing and Consuming FAIR Data A Case in the Agri-Food Domain
 
Continuos Integration @Knetminer
Continuos Integration @KnetminerContinuos Integration @Knetminer
Continuos Integration @Knetminer
 
Better Data for a Better World
Better Data for a Better WorldBetter Data for a Better World
Better Data for a Better World
 
AgriSchemas Progress Report
AgriSchemas Progress ReportAgriSchemas Progress Report
AgriSchemas Progress Report
 
AgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use CasesAgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use Cases
 
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
 
Knetminer Backend Training, Nov 2018
Knetminer Backend Training, Nov 2018Knetminer Backend Training, Nov 2018
Knetminer Backend Training, Nov 2018
 
A Preliminary survey of RDF/Neo4j as backends for KnetMiner
A Preliminary survey of RDF/Neo4j as backends for KnetMinerA Preliminary survey of RDF/Neo4j as backends for KnetMiner
A Preliminary survey of RDF/Neo4j as backends for KnetMiner
 
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...
 
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
 
graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...
 
Interoperable Open Data: Which Recipes?
Interoperable Open Data: Which Recipes?Interoperable Open Data: Which Recipes?
Interoperable Open Data: Which Recipes?
 
Linked Data with the EBI RDF Platform
Linked Data with the EBI RDF PlatformLinked Data with the EBI RDF Platform
Linked Data with the EBI RDF Platform
 
BioSD Linked Data: Lessons Learned
BioSD Linked Data: Lessons LearnedBioSD Linked Data: Lessons Learned
BioSD Linked Data: Lessons Learned
 
BioSD Tutorial 2014 Editition
BioSD Tutorial 2014 EdititionBioSD Tutorial 2014 Editition
BioSD Tutorial 2014 Editition
 
myEquivalents, aka a new cross-reference service
myEquivalents, aka a new cross-reference servicemyEquivalents, aka a new cross-reference service
myEquivalents, aka a new cross-reference service
 
Dev 2014 LOD tutorial
Dev 2014 LOD tutorialDev 2014 LOD tutorial
Dev 2014 LOD tutorial
 

Último

Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Milind Agarwal
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...KarteekMane1
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingsocarem879
 

Último (20)

Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processing
 

Notes about SWAT4LS 2018

  • 1. Notes by Marco Brandizi Key Notes Agriculture Data Publishing &
 Interoperability MetadataArtificial Intelligence Data Integration &
 Exploration Data Annotation & Enrichment
  • 2. Notes by Marco Brandizi Key Notes
  • 3. Medha Devare, Spinning a Semantic Web for Agriculture The CGIAR challenges Their domain is very varied, ranging from fighting poverty to helping to access markets Technologies to integrate data exists, need to be put together AI and SW are different, but one need the other, and one can provide results for the other See also: https://bigdata.cgiar.org/ The GARDIAN Platform Where should I plant my rice? How should I manage my crop? How to mitigate risks and define insurance plans? CGIAR diverse data collected and harmonised, using LD/ontologies Data made available via SPARQL AgroFIMS Platform for field trial data collection Support both electronic and paper-based operations (need for flexibility) UI and functionality built over ontology modelling Export metadata R Scripting functionality for analytics
  • 4. Christian Lovis , AI and Big data: the dilemna of Truth Limits of bioinformatics (eg, in genetics) There are unpredictable things Limits of AI, eg, overfitting (Wheels are faces) Unreliable data (chocolate consumption vs Nobel laureates) Biases (Google prefers white skin women) Our conceptions Should unreproducible papers being retracted? Shamed? Anonymisation is impossible, privacy should be post-action too, not just preventive
  • 5. Philippe Bourne, How does Data Science impact the Semantic Web Science isn’t made with formal definitions Data Science is unexpected reuse of information SW has opportunity to contribute, but schema.org is becoming the norm, not the exception FAIR is broader than SW Model Transportability Horizontal Integration Multi-scale Integration human mouse zebrafish DNA Gene/Protein Network Cell Tissue Organ Body Population CNV SNP methylation 3D structure Gene expression Proteomics Metabolomics MetabolicSignaling transduction Gene regulation Hepatic Myoepithelial Erythrocyte Epithelial Muscle Nervous Liver Kidney Pancreas Heart Physiologically based pharmacokinetics GWASPopulation dynamics Microbiota Open, complex, diverse digital data Systems Pharmacology Xie et al. Annu Rev Pharmacol Toxicol. 2017 57:245-262 12/04/18 18
  • 6. Dean Allemang, Semantic Web and the New Industrial Revolution  Comparing academia and business eg, publishing/Sharing as goal, vs absolutely forbidden The evolution of FIBO, from OWL to auto- generated multiple views/formats The increasing role of vocabularies and shared data models
  • 7. Notes by Marco Brandizi Key Notes Agriculture Data Publishing &
 Interoperability MetadataArtificial Intelligence Data Integration &
 Exploration Data Annotation & Enrichment
  • 8. Notes by Marco Brandizi Agriculture
  • 9. More Agriculture-related talks Design of a Framework to Support Reuse of Open Data about Agriculture Data files are harvested and enriched (at metadata level) with text mining, stored as linked data A recommender ranks data annotations according to vectors of user preferences Web services to access annotated data are auto-created with SADI Lightly specified ontologies for access to agricultural information across languages and domains (https://goo.gl/z5qwvh) The case for taxonomies, SKOS vocabularies, thesauri, etc The Global Agricultural Concept Scheme (GACS)
  • 10. Notes by Marco Brandizi Key Notes Agriculture Data Publishing &
 Interoperability MetadataArtificial Intelligence Data Integration &
 Exploration Data Annotation & Enrichment
  • 11. Notes by Marco Brandizi Data Integration &
 Exploration
  • 12. Integration and Data Access Platforms Data2Services: enabling automated conversion of data to services Several data formats converted to generic RDF model, Which can then be translated to something more significant, via SPARQL Garlic service to convert from spraql to API, with calls like class/all, class/$class/instances, resource/$uri Architecture for the harmonization of clinical cohort data in the IMI EMIF project Harmonised model, configurable mappings to data sources Sparklis over PEGASE Knowledge Graph: A New Tool for Pharmacovigilance SPARKLIS, help formulating SPARQL in natural language, and also get NL results They developed a vigilance ontology and extended SPARKLIS: OntoADR, which leverages SNOMED They use MeDRA as vigilance source
  • 13. Notes by Marco Brandizi Key Notes Agriculture Data Publishing &
 Interoperability MetadataArtificial Intelligence Data Integration &
 Exploration Data Annotation & Enrichment
  • 14. Notes by Marco Brandizi Data Publishing &
 Interoperability
  • 15. The bioschemas and WikiBase Tutorials Bioschemas Common lightweight ontology to publish data on the web, mainly to support search engines (derived from schema.org) Tutorial gave an overview and examples of annotation using their tool: https://goo.gl/GFDhPF During the hackathon, I’ve got info about proposing new types WikiData The Wikipedia of data Multiple formats supported (JSON, RDF, SPARQL) Increasingly being used to share open data, make resolvable URIs Batch imports or Wikibase editor (http://wikiba.se) Can be used with local installations, Docker support See also: https://stuff.coffeecode.net/2018/wikibase-workshop-swib18.html Common properties to describe data (promotes interoperability) Using Wikidata for semantic data modeling in education and research Data integration workshops, using Wikidata and Wikibase
  • 16. Notes by Marco Brandizi Key Notes Agriculture Data Publishing &
 Interoperability MetadataArtificial Intelligence Data Integration &
 Exploration Data Annotation & Enrichment
  • 17. Notes by Marco Brandizi Metadata
  • 18. CEDAR Tool to annotate datasets with metadata Similar to COPO Dataset-level metadata description, ontology autocompletion, ontology recommender http://tinyurl.com/cedar-swat4ls2018 See also: https://www.go-fair.org
  • 19. Notes by Marco Brandizi Key Notes Agriculture Data Publishing &
 Interoperability MetadataArtificial Intelligence Data Integration &
 Exploration Data Annotation & Enrichment
  • 20. Notes by Marco Brandizi Artificial Intelligence Data Annotation & Enrichment
  • 21. Enrichment, Text Mining, AI and alike Evaluation of Knowledge Graph Embedding Approaches for Drug-Drug Interaction Prediction using Linked Open Data Uses the same methods are used in: Vec2SPARQL: integrating SPARQL queries and knowledge graph embeddings APIs to extract feature vectors are integrated with Linked Data in two ways: LD used to compute features via ML (random walks) SPARQL extended with similarity metric functions (eg, mostSimilar (?uri, top-n) Ontology-Driven Metadata Enrichment for Genomic Datasets Semi-structured sequencing data annotated with ZOOMA/BioPortal, then scored according to similarity between original text and matched term label Cooperation of bio-ontologies for the classification of genetic intellectual disabilities : a diseasome approach Data are classified into multiple disease classes Results assessed with a comparison metric that rewards pairs in the same disease class
  • 22. Notes by Marco Brandizi Artificial Intelligence Data Annotation & Enrichment
  • 23. The KNIME Tutorial It’s a platform similar to Galaxy, but dedicated mostly to Machine Learning Workflows can be executed in batch mode, results can be exported as structured data
  • 24. Notes by Marco Brandizi Key Notes Agriculture Data Publishing &
 Interoperability MetadataArtificial Intelligence Data Integration &
 Exploration Data Annotation & Enrichment