SlideShare uma empresa Scribd logo
1 de 29
Baixar para ler offline
CEDAR
An intelligent browser extension to generate
ontology-based metadata.
OnDemand
Syed Ahmad Chan Bukhari, PhD
ahmad.chan@yale.edu
Importance of Scientific Metadata
● Scientific data are generated by experiments or observations.
● Datasets must be accompanied by auxiliary information in order to be
interpreted and accessed.
Metadata helps
● Datasets more understandable for humans and processable for the machines
● Scientific data analysis- often requires multiple datasets to be integrated
across multiple repositories.
● Discovery in the large variety of scientific datasets and support reproducibility.
What is the high-quality metadata?
● Datasets and their metadata should be identifiable globally, described using
standardized terminologies, and available in a standardized machine readable
format.
Challenges with the generation of high-quality
metadata
● The diversity of metadata representation formats and the poor support for
semantic markup typically result in metadata that are of poor quality.
Metadata Diversity in NCBI repositories
Current practices towards data standardization
● Scientific communities have developed templates incorporating detailed
checklists of the metadata needed to describe about the particular types of
experimental data sources.
● Minimum information standards such as
○ MIAME: Minimum information about a microarray experiment
○ MIAPE: Minimum Information About a Proteomics Experiment
● What is the minimum amount of information (metadata) needed for reporting
results in a reproducible and reusable fashion.
Metadata Standardization and availability
● A large number of public repositories use these community derived templates
to collect metadata from users
FAIRsharing provides a central catalog of existing standards
and data formats.
Metadata Standardization and availability
CEDAR helps to generate FAIR metadata
CEDAR Advantages over conventional
approaches
● Decrease authoring time
○ Suggest values
○ Pre-filling some of the fields
○ Extract metadata from unstructured sources
● Increase metadata quality (accurate, complete, standardized data)
○ No mistakes and inconsistencies
○ Validation (required values, format, data types)
○ Standardized metadata (ontologies)
○ Accurate, complete, standardized
CEDAR provides run-time recommendations
CEDAR can help editing metadata within its
environment
● CEDAR template designing and metadata approaches are centralized.
● Outside of the CEDAR workbench, there are a number of existing portals
providing conventional metadata submitting environments.
● CEDAR OnDemand is a browser extension
○ An extension is essentially a small software program that can access
contents of a web page, modify it and can enhance the functionality of a web
browser.
Most of public data repositories provide web
interfaces
● The lack of standardization in the collected metadata limits the source datasets to
be broadly discovered and reused.
● The creation of standardized metadata can be facilitated using standard
vocabularies/ontologies.
● CEDAR have developed technologies to facilitate high-quality metadata authoring.
● While CEDAR has been working closely with several data providers to implement
such pipelines, there is a communication and implementation overhead.
● To reach out to the maximum available public biomedical data repositories and
enable users to generate ontology linked standardised metadata within the
repository specific environment.
● This approach enables the user to seamlessly enter ontologically-controlled
metadata through existing web forms native to individual repositories.
● CEDAR OnDemand helps lower the barrier of incorporating ontologies into
standardized metadata entry for public data repositories.
The key advantage of this approach is that it facilitates the creation of
ontology-annotated metadata into existing web forms without requiring
the individual repositories to change any code.
A manifest file is the entry point for the chrome extension script to take
action
● CEDAR OnDemand facilitates users to create standardised machine readable
metadata on web forms accessible through WWW.
● It can have its own interface to operate or can work seamlessly without providing
any graphical interface.
● CEDAR OnDemand utilizes the CEDAR terminology API server and the NCBO
web services to access ontologies available on bioportal and to predict relevant
metadata.
● Upon activation, CEDAR OnDemand script analyses a web page contents
through the browser document object model (DOM), which defines the
content, structure and style of an HTML document.
● To predict the field specific ontology pool, CEDAR OnDemand script takes
associated text of input fields in a webpage as inputs and invokes the CEDAR
ontology server API through restful web services.
● To access the biomedical ontologies available on bioportal through CEDAR
ontology server API, we use AJAX (asynchronous JavaScript and XML).
AJAX communicates with CEDAR server asynchronously (in the background)
through XMLHttpRequest Object to send and retrieve the data.
http://data.bioontology.org
Ontology
Search
• Download
• Traverse
• Search
• Comment
Widgets
• Tree-view
• Auto-complete
• Graph-view
Annotator
Recommender
Mapping
Services
• Create
• Download
• Upload
● Term recognition
● Ontology
association
● Class
Recommendation
http://bioportal.bioontology.org
NCBO Tools and services in summary
● Our algorithm syntactically matches the keywords mentioned in associated text of
the field with the ontology description and fetches the relevant ontology URI
(Universal Resource Identifier).
● To find the relevant ontology terms, our algorithm looks from the domain ontology
first. [NCBITAXON, DOID, GO, OBI, PR,CL]
● Our approach narrows down the scope of ontology class research which helps to
provide relevant semantic vocabulary runtime.
● While functioning, CEDAR OnDemand displays most relevant classes run-time
when to author scientific metadata.
CEDAR OnDemand In action
CEDAR OnDemand In action
Other potential usage of CEDAR OnDemand
What could be other application areas?
● Auto-reading the web page contents, Its vulnerable, could be used for
browser based eavesdropping attacks. E.g passwords, Credit Card
■ Gave control to users through manual activation
● Diversity in the input field. E.g <input type=text, <div, <inputfield, <text
■ Support <input type=text, <div, HTML5
■ Limited support for twitter bootstrap
● Right ontology selection. Most of the ontologies in bioportal do not have
definitions and description.
■ String mapping algorithm is currently used to fetch the right ontology ID
● Run-time delay
■ Limited to a set of ontologies
(challenges and limitations)
Future Work
● Topic to ontology prediction is the area where I have plan to focus in future to
increase the precision.
● Required more metadata to display run-time e.g definitions It takes several
minutes to display with in current setup.
○ Downloading ontologies to a local server could be possible solution
● Auto-filling feature would a great addition based on the pre filled fields
Summary
● CEDAR OnDemand is a Chrome browser extension that help to create
standardized high-quality metadata on the web forms available on web.
● It utilizes the functionality of cutting edge ontology web services and tools
available at the NCBO and CEDAR workbench and make them available out of
their working environment
● CEDAR OnDemand is an application independent browser extension which can
work on mobile platform as well.
Availability
● CEDAR OnDemand is available on chrome webstore freely. Source code can be
accessed at Github http:/github.com/ahmadchan/cedarondemand
Acknowledgement
Kei-Hoi Cheung, Yale University, Dept. of Medical Informatics
Kleinstein Lab, Yale University, Dept. of Pathology

Mais conteúdo relacionado

Mais procurados

Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...
Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...
Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...
Ahmad C. Bukhari
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Amit Sheth
 
provenance of microarray experiments
provenance of microarray experimentsprovenance of microarray experiments
provenance of microarray experiments
Helena Deus
 

Mais procurados (20)

CEDAR: Easing Authoring of Metadata to Make Biomedical Data Sets More Findabl...
CEDAR: Easing Authoring of Metadata to Make Biomedical Data Sets More Findabl...CEDAR: Easing Authoring of Metadata to Make Biomedical Data Sets More Findabl...
CEDAR: Easing Authoring of Metadata to Make Biomedical Data Sets More Findabl...
 
Final Acb All Hands 26 11 07.Key
Final Acb All Hands 26 11 07.KeyFinal Acb All Hands 26 11 07.Key
Final Acb All Hands 26 11 07.Key
 
Drug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge GraphsDrug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge Graphs
 
Being FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data Science
 
Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...
Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...
Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...
 
Sequence assembly
Sequence assemblySequence assembly
Sequence assembly
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
 
provenance of microarray experiments
provenance of microarray experimentsprovenance of microarray experiments
provenance of microarray experiments
 
Enabling faster analysis of vaccine adverse event reports with ontology support
Enabling faster analysis of vaccine adverse event reports with ontology supportEnabling faster analysis of vaccine adverse event reports with ontology support
Enabling faster analysis of vaccine adverse event reports with ontology support
 
Protein-protein interaction networks
Protein-protein interaction networksProtein-protein interaction networks
Protein-protein interaction networks
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata management
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use Cases
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
 
Biological networks
Biological networksBiological networks
Biological networks
 
Pathway and network analysis
Pathway and network analysisPathway and network analysis
Pathway and network analysis
 
2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar
 
Genome scale-data as networks
Genome scale-data as networksGenome scale-data as networks
Genome scale-data as networks
 
Semantic Technology empowering Real World outcomes in Biomedical Research and...
Semantic Technology empowering Real World outcomes in Biomedical Research and...Semantic Technology empowering Real World outcomes in Biomedical Research and...
Semantic Technology empowering Real World outcomes in Biomedical Research and...
 
Drug Discovery- ELRIG -2012
Drug Discovery- ELRIG -2012Drug Discovery- ELRIG -2012
Drug Discovery- ELRIG -2012
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
 

Semelhante a Cedar OnDemand: An intelligent browser extension to generate ontology-based metadata.

MS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.docMS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.doc
butest
 
Metadata mapping
Metadata mappingMetadata mapping
Metadata mapping
Vlad Vega
 

Semelhante a Cedar OnDemand: An intelligent browser extension to generate ontology-based metadata. (20)

Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Life Science Database Cross Search and Metadata
Life Science Database Cross Search and MetadataLife Science Database Cross Search and Metadata
Life Science Database Cross Search and Metadata
 
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Denodo Partner Connect - Technical Webinar - Ask Me AnythingDenodo Partner Connect - Technical Webinar - Ask Me Anything
Denodo Partner Connect - Technical Webinar - Ask Me Anything
 
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
 
How to create custom dashboards in Elastic Search / Kibana with Performance V...
How to create custom dashboards in Elastic Search / Kibana with Performance V...How to create custom dashboards in Elastic Search / Kibana with Performance V...
How to create custom dashboards in Elastic Search / Kibana with Performance V...
 
Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014
 
How to Find a Needle in the Haystack
How to Find a Needle in the HaystackHow to Find a Needle in the Haystack
How to Find a Needle in the Haystack
 
Tripal v3, the Collaborative Online Database Platform Supporting an Internati...
Tripal v3, the Collaborative Online Database Platform Supporting an Internati...Tripal v3, the Collaborative Online Database Platform Supporting an Internati...
Tripal v3, the Collaborative Online Database Platform Supporting an Internati...
 
MS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.docMS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.doc
 
MongoDB.local Sydney: An Introduction to Document Databases with MongoDB
MongoDB.local Sydney: An Introduction to Document Databases with MongoDBMongoDB.local Sydney: An Introduction to Document Databases with MongoDB
MongoDB.local Sydney: An Introduction to Document Databases with MongoDB
 
Open Science Data Repository - the platform for materials research
Open Science Data Repository - the platform for materials researchOpen Science Data Repository - the platform for materials research
Open Science Data Repository - the platform for materials research
 
Big Data and Semantic Web in Manufacturing
Big Data and Semantic Web in ManufacturingBig Data and Semantic Web in Manufacturing
Big Data and Semantic Web in Manufacturing
 
Microservices as an evolutionary architecture: lessons learned
Microservices as an evolutionary architecture: lessons learnedMicroservices as an evolutionary architecture: lessons learned
Microservices as an evolutionary architecture: lessons learned
 
web development process WT
web development process WTweb development process WT
web development process WT
 
Wt unit 1 ppts web development process
Wt unit 1 ppts web development processWt unit 1 ppts web development process
Wt unit 1 ppts web development process
 
dREG & SimVascular-Gateways-ECSS-Presentation
dREG & SimVascular-Gateways-ECSS-PresentationdREG & SimVascular-Gateways-ECSS-Presentation
dREG & SimVascular-Gateways-ECSS-Presentation
 
Django course
Django courseDjango course
Django course
 
Why we need internet of things on Node.js
Why we need internet of things on Node.jsWhy we need internet of things on Node.js
Why we need internet of things on Node.js
 
Metadata mapping
Metadata mappingMetadata mapping
Metadata mapping
 
Backend Basic in nodejs express and mongodb PPT.pdf
Backend  Basic in nodejs express and mongodb PPT.pdfBackend  Basic in nodejs express and mongodb PPT.pdf
Backend Basic in nodejs express and mongodb PPT.pdf
 

Mais de Syed Ahmad Chan Bukhari, PhD

BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
Syed Ahmad Chan Bukhari, PhD
 

Mais de Syed Ahmad Chan Bukhari, PhD (10)

Finding and Reusing Biomedical Datasets using CEDAR Metadata Repository and T...
Finding and Reusing Biomedical Datasets using CEDAR Metadata Repository and T...Finding and Reusing Biomedical Datasets using CEDAR Metadata Repository and T...
Finding and Reusing Biomedical Datasets using CEDAR Metadata Repository and T...
 
CEDAR Technologies for AIRR Submissions
CEDAR Technologies for AIRR SubmissionsCEDAR Technologies for AIRR Submissions
CEDAR Technologies for AIRR Submissions
 
CEDAR: Web-Based Tools for Accelerating the Creation of Standardized Metadata
CEDAR: Web-Based Tools for Accelerating the Creation of Standardized MetadataCEDAR: Web-Based Tools for Accelerating the Creation of Standardized Metadata
CEDAR: Web-Based Tools for Accelerating the Creation of Standardized Metadata
 
Leveraging CEDAR workbench for ontology-linked submission of adaptive immune ...
Leveraging CEDAR workbench for ontology-linked submission of adaptive immune ...Leveraging CEDAR workbench for ontology-linked submission of adaptive immune ...
Leveraging CEDAR workbench for ontology-linked submission of adaptive immune ...
 
Standardization of the HIPC Data Templates
Standardization of the HIPC Data TemplatesStandardization of the HIPC Data Templates
Standardization of the HIPC Data Templates
 
CAIRR: A pipeline to submit AIRR data to the NCBI through the CEDAR Workbench
CAIRR: A pipeline to submit AIRR data to the NCBI through the CEDAR WorkbenchCAIRR: A pipeline to submit AIRR data to the NCBI through the CEDAR Workbench
CAIRR: A pipeline to submit AIRR data to the NCBI through the CEDAR Workbench
 
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
 
Type 2 fuzzy ontology ahmadchan
Type 2 fuzzy ontology ahmadchanType 2 fuzzy ontology ahmadchan
Type 2 fuzzy ontology ahmadchan
 
AN Intelligent Realtime multiple vessel collision risk assessment system
AN Intelligent Realtime multiple vessel collision risk assessment system AN Intelligent Realtime multiple vessel collision risk assessment system
AN Intelligent Realtime multiple vessel collision risk assessment system
 
Type-2 Fuzzy Ontology
Type-2 Fuzzy OntologyType-2 Fuzzy Ontology
Type-2 Fuzzy Ontology
 

Último

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 

Último (20)

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 

Cedar OnDemand: An intelligent browser extension to generate ontology-based metadata.

  • 1. CEDAR An intelligent browser extension to generate ontology-based metadata. OnDemand Syed Ahmad Chan Bukhari, PhD ahmad.chan@yale.edu
  • 2. Importance of Scientific Metadata ● Scientific data are generated by experiments or observations. ● Datasets must be accompanied by auxiliary information in order to be interpreted and accessed. Metadata helps ● Datasets more understandable for humans and processable for the machines ● Scientific data analysis- often requires multiple datasets to be integrated across multiple repositories. ● Discovery in the large variety of scientific datasets and support reproducibility.
  • 3. What is the high-quality metadata? ● Datasets and their metadata should be identifiable globally, described using standardized terminologies, and available in a standardized machine readable format.
  • 4. Challenges with the generation of high-quality metadata ● The diversity of metadata representation formats and the poor support for semantic markup typically result in metadata that are of poor quality. Metadata Diversity in NCBI repositories
  • 5. Current practices towards data standardization ● Scientific communities have developed templates incorporating detailed checklists of the metadata needed to describe about the particular types of experimental data sources. ● Minimum information standards such as ○ MIAME: Minimum information about a microarray experiment ○ MIAPE: Minimum Information About a Proteomics Experiment ● What is the minimum amount of information (metadata) needed for reporting results in a reproducible and reusable fashion.
  • 6. Metadata Standardization and availability ● A large number of public repositories use these community derived templates to collect metadata from users FAIRsharing provides a central catalog of existing standards and data formats.
  • 8. CEDAR helps to generate FAIR metadata
  • 9. CEDAR Advantages over conventional approaches ● Decrease authoring time ○ Suggest values ○ Pre-filling some of the fields ○ Extract metadata from unstructured sources ● Increase metadata quality (accurate, complete, standardized data) ○ No mistakes and inconsistencies ○ Validation (required values, format, data types) ○ Standardized metadata (ontologies) ○ Accurate, complete, standardized
  • 10. CEDAR provides run-time recommendations
  • 11. CEDAR can help editing metadata within its environment ● CEDAR template designing and metadata approaches are centralized. ● Outside of the CEDAR workbench, there are a number of existing portals providing conventional metadata submitting environments.
  • 12.
  • 13. ● CEDAR OnDemand is a browser extension ○ An extension is essentially a small software program that can access contents of a web page, modify it and can enhance the functionality of a web browser.
  • 14. Most of public data repositories provide web interfaces ● The lack of standardization in the collected metadata limits the source datasets to be broadly discovered and reused. ● The creation of standardized metadata can be facilitated using standard vocabularies/ontologies. ● CEDAR have developed technologies to facilitate high-quality metadata authoring. ● While CEDAR has been working closely with several data providers to implement such pipelines, there is a communication and implementation overhead.
  • 15. ● To reach out to the maximum available public biomedical data repositories and enable users to generate ontology linked standardised metadata within the repository specific environment. ● This approach enables the user to seamlessly enter ontologically-controlled metadata through existing web forms native to individual repositories. ● CEDAR OnDemand helps lower the barrier of incorporating ontologies into standardized metadata entry for public data repositories. The key advantage of this approach is that it facilitates the creation of ontology-annotated metadata into existing web forms without requiring the individual repositories to change any code.
  • 16. A manifest file is the entry point for the chrome extension script to take action
  • 17. ● CEDAR OnDemand facilitates users to create standardised machine readable metadata on web forms accessible through WWW. ● It can have its own interface to operate or can work seamlessly without providing any graphical interface. ● CEDAR OnDemand utilizes the CEDAR terminology API server and the NCBO web services to access ontologies available on bioportal and to predict relevant metadata.
  • 18.
  • 19. ● Upon activation, CEDAR OnDemand script analyses a web page contents through the browser document object model (DOM), which defines the content, structure and style of an HTML document. ● To predict the field specific ontology pool, CEDAR OnDemand script takes associated text of input fields in a webpage as inputs and invokes the CEDAR ontology server API through restful web services. ● To access the biomedical ontologies available on bioportal through CEDAR ontology server API, we use AJAX (asynchronous JavaScript and XML). AJAX communicates with CEDAR server asynchronously (in the background) through XMLHttpRequest Object to send and retrieve the data.
  • 20. http://data.bioontology.org Ontology Search • Download • Traverse • Search • Comment Widgets • Tree-view • Auto-complete • Graph-view Annotator Recommender Mapping Services • Create • Download • Upload ● Term recognition ● Ontology association ● Class Recommendation http://bioportal.bioontology.org NCBO Tools and services in summary
  • 21. ● Our algorithm syntactically matches the keywords mentioned in associated text of the field with the ontology description and fetches the relevant ontology URI (Universal Resource Identifier). ● To find the relevant ontology terms, our algorithm looks from the domain ontology first. [NCBITAXON, DOID, GO, OBI, PR,CL] ● Our approach narrows down the scope of ontology class research which helps to provide relevant semantic vocabulary runtime. ● While functioning, CEDAR OnDemand displays most relevant classes run-time when to author scientific metadata.
  • 24. Other potential usage of CEDAR OnDemand What could be other application areas?
  • 25. ● Auto-reading the web page contents, Its vulnerable, could be used for browser based eavesdropping attacks. E.g passwords, Credit Card ■ Gave control to users through manual activation ● Diversity in the input field. E.g <input type=text, <div, <inputfield, <text ■ Support <input type=text, <div, HTML5 ■ Limited support for twitter bootstrap ● Right ontology selection. Most of the ontologies in bioportal do not have definitions and description. ■ String mapping algorithm is currently used to fetch the right ontology ID ● Run-time delay ■ Limited to a set of ontologies (challenges and limitations)
  • 26. Future Work ● Topic to ontology prediction is the area where I have plan to focus in future to increase the precision. ● Required more metadata to display run-time e.g definitions It takes several minutes to display with in current setup. ○ Downloading ontologies to a local server could be possible solution ● Auto-filling feature would a great addition based on the pre filled fields
  • 27. Summary ● CEDAR OnDemand is a Chrome browser extension that help to create standardized high-quality metadata on the web forms available on web. ● It utilizes the functionality of cutting edge ontology web services and tools available at the NCBO and CEDAR workbench and make them available out of their working environment ● CEDAR OnDemand is an application independent browser extension which can work on mobile platform as well.
  • 28. Availability ● CEDAR OnDemand is available on chrome webstore freely. Source code can be accessed at Github http:/github.com/ahmadchan/cedarondemand
  • 29. Acknowledgement Kei-Hoi Cheung, Yale University, Dept. of Medical Informatics Kleinstein Lab, Yale University, Dept. of Pathology