SlideShare uma empresa Scribd logo
1 de 30
Baixar para ler offline
PROTOFILWW
A computational platform for the analysis of the relationships between
microorganisms and environmental parameters in activated sludge plants
José Fernandes
Bioinformatics Master Thesis
Prof. Anália Lourenço
Prof. Ana Nicolau
System requirements
• Insertion and retrieval of data has to be done quickly and easily
• Should be possible to export the data so it can be analyzed with other informatics
systems
• Should support statistical assessments
• Have user-friendly visualization capabilities
• Controlled access to data, based on user roles, accounting for data privacy issues
• Easy dissemination of related studies and results
• Always online (web-based)
• Help finding additional information about the microorganisms present in the biological
samples
Overview of the workflow of field and lab work
PROTOFILWWPROTOFILWW
1.635 lines x 137 columns
ProtoFilWW system major components
1. Content Management component: supports the
researchers managing and analyzing the data obtained
from the WWTP’s samples
2. Text Mining component: finding additional information
about the microorganisms present in the biological
samples
High-level integration perspective of ProtoFilWW
Drupal core
PLUGINS
Import data
Reports Access control
Other
services...
PROTOFILWW
SQL
XLS, TXT, CSV
Export dataXLS, TXT, CSV Solr/LuceneViews Solr
Backend
Views
XML
Relational
Database
UIMA
Contend Management component
• Open source Content Management System (CMS) and
Framework (CMF)
• Highly modular and with high extensibility
• Built in the PHP scripting language
WWTP Sample
1. Filamentous bacteria
2. Protozoa
3. Metazoa
4. Physical-chemical
5. Sample characterization
User roles
use case visitors collaborators WWTP researchers administrators
Find studies and results x x x x
Contact researchers x x x
Analysis of available data x
Data insertion x x
Creation of reports x
Export data x
Managing users x
Backup data x
Text Mining x x x x
Dynamic reporting and charting
Reports creation Reports display
Geolocation of the WWTPs
Address geocoding Map display
Text Mining
component
Listing the species
mentioned in a
document
Major Text Mining technologies used
• Lucene is a high-performance text search engine
library.
• Solr is a standalone enterprise search server with a
REST-like API
• UIMA is a powerful infrastructure for the storage,
transport, and retrieval of document and annotation
knowledge accumulated in NLP pipeline systems
• LINNAEUS is a popular organism name identification
system for biomedical literature that is capable of
normalizing to unambiguous NCBI taxonomy identifiers
Text Mining process in ProtoFilWW
Solr/Lucene
LINNAEUS
Solr UIMA
PMC Open Access SubsetPMC Open Access Subset Solr XML documentsSolr XML documents
XPath convertion
Solr LINNAEUS Annotator
UIMA Component Descriptor
Editor plugin UIMA type system for LINNAEUS
LINNAEUS UIMA wrapper running on CVD
Drupal Views Solr Backend
Major contributions
1. The Web-based computational system
www.protofilww.org
2. The Drupal module Views Solr Backend
3. The Solr UIMA plug-in for LINNAEUS Annotator
Em que trabalho após o mestrado…
Preventive Medicine
 Alert the user to the risk of Type 2 Diabetes.
 How?
1. We know the user has a gene mutation associated with Type 2
Diabetes, because he gave us is genome!
2. We know what he has eaten, because he told us!
3. We know what exercise he’s been doing, because he told us!
4. Genehome connects the dots!

Mais conteúdo relacionado

Mais procurados

The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook OntologyStuart Chalk
 
Improving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIImproving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIMartin Scharm
 
eXframe: A Semantic Web Platform for Genomic Experiments
eXframe: A Semantic Web Platform for Genomic ExperimentseXframe: A Semantic Web Platform for Genomic Experiments
eXframe: A Semantic Web Platform for Genomic ExperimentsTim Clark
 
exFrame: a Semantic Web Platform for Genomics Experiments
exFrame: a Semantic Web Platform for Genomics ExperimentsexFrame: a Semantic Web Platform for Genomics Experiments
exFrame: a Semantic Web Platform for Genomics ExperimentsTim Clark
 
Fairport domain specific metadata using w3 c dcat & skos w ontology views
Fairport domain specific metadata using w3 c dcat & skos w ontology viewsFairport domain specific metadata using w3 c dcat & skos w ontology views
Fairport domain specific metadata using w3 c dcat & skos w ontology viewsTim Clark
 
Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications Trish Whetzel
 
ClearTK: A Framework for Statistical Biomedical Natural Language ...
ClearTK: A Framework for Statistical Biomedical Natural Language ...ClearTK: A Framework for Statistical Biomedical Natural Language ...
ClearTK: A Framework for Statistical Biomedical Natural Language ...butest
 
Annotopia open annotation services platform
Annotopia open annotation services platformAnnotopia open annotation services platform
Annotopia open annotation services platformTim Clark
 
FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIRDOM
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...Carole Goble
 
From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...Catherine Canevet
 
Development of an statistical package for genetic evaluation of trees
Development of an statistical package for genetic evaluation of treesDevelopment of an statistical package for genetic evaluation of trees
Development of an statistical package for genetic evaluation of treesFacundo Muñoz
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)Carole Goble
 
bio data
bio databio data
bio data007dcp
 
44rd CEN WS/LT meeting PT social data
44rd CEN WS/LT meeting PT social data44rd CEN WS/LT meeting PT social data
44rd CEN WS/LT meeting PT social dataJoris Klerkx
 
DeepBlue epigenomic data server: programmatic data retrieval and analysis of ...
DeepBlue epigenomic data server: programmatic data retrieval and analysis of ...DeepBlue epigenomic data server: programmatic data retrieval and analysis of ...
DeepBlue epigenomic data server: programmatic data retrieval and analysis of ...Felipe Albrecht
 
Reproducibility of model-based results: standards, infrastructure, and recogn...
Reproducibility of model-based results: standards, infrastructure, and recogn...Reproducibility of model-based results: standards, infrastructure, and recogn...
Reproducibility of model-based results: standards, infrastructure, and recogn...FAIRDOM
 

Mais procurados (20)

The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook Ontology
 
Improving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIImproving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBI
 
eXframe: A Semantic Web Platform for Genomic Experiments
eXframe: A Semantic Web Platform for Genomic ExperimentseXframe: A Semantic Web Platform for Genomic Experiments
eXframe: A Semantic Web Platform for Genomic Experiments
 
exFrame: a Semantic Web Platform for Genomics Experiments
exFrame: a Semantic Web Platform for Genomics ExperimentsexFrame: a Semantic Web Platform for Genomics Experiments
exFrame: a Semantic Web Platform for Genomics Experiments
 
Fairport domain specific metadata using w3 c dcat & skos w ontology views
Fairport domain specific metadata using w3 c dcat & skos w ontology viewsFairport domain specific metadata using w3 c dcat & skos w ontology views
Fairport domain specific metadata using w3 c dcat & skos w ontology views
 
Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications
 
ClearTK: A Framework for Statistical Biomedical Natural Language ...
ClearTK: A Framework for Statistical Biomedical Natural Language ...ClearTK: A Framework for Statistical Biomedical Natural Language ...
ClearTK: A Framework for Statistical Biomedical Natural Language ...
 
Ngsp
NgspNgsp
Ngsp
 
The Chemtools LaBLog
The Chemtools LaBLogThe Chemtools LaBLog
The Chemtools LaBLog
 
Annotopia open annotation services platform
Annotopia open annotation services platformAnnotopia open annotation services platform
Annotopia open annotation services platform
 
FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIR data and model management for systems biology.
FAIR data and model management for systems biology.
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
 
From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...
 
Development of an statistical package for genetic evaluation of trees
Development of an statistical package for genetic evaluation of treesDevelopment of an statistical package for genetic evaluation of trees
Development of an statistical package for genetic evaluation of trees
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
bio data
bio databio data
bio data
 
44rd CEN WS/LT meeting PT social data
44rd CEN WS/LT meeting PT social data44rd CEN WS/LT meeting PT social data
44rd CEN WS/LT meeting PT social data
 
BASE : a powerful search engine for Open Access documents
BASE : a powerful search engine for Open Access documents BASE : a powerful search engine for Open Access documents
BASE : a powerful search engine for Open Access documents
 
DeepBlue epigenomic data server: programmatic data retrieval and analysis of ...
DeepBlue epigenomic data server: programmatic data retrieval and analysis of ...DeepBlue epigenomic data server: programmatic data retrieval and analysis of ...
DeepBlue epigenomic data server: programmatic data retrieval and analysis of ...
 
Reproducibility of model-based results: standards, infrastructure, and recogn...
Reproducibility of model-based results: standards, infrastructure, and recogn...Reproducibility of model-based results: standards, infrastructure, and recogn...
Reproducibility of model-based results: standards, infrastructure, and recogn...
 

Semelhante a Bioinformatics presentation to students University of Minho

"Data Provenance: Principles and Why it matters for BioMedical Applications"
"Data Provenance: Principles and Why it matters for BioMedical Applications""Data Provenance: Principles and Why it matters for BioMedical Applications"
"Data Provenance: Principles and Why it matters for BioMedical Applications"Pinar Alper
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesASIS&T
 
The Taverna Software Suite
The Taverna Software SuiteThe Taverna Software Suite
The Taverna Software SuitemyGrid team
 
Application of Library Management Software: NewGenLib
Application of Library Management Software: NewGenLibApplication of Library Management Software: NewGenLib
Application of Library Management Software: NewGenLibDavid Nzoputa Ofili
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Ola Spjuth
 
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...Juan Antonio Vizcaino
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryCarole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...Rafael C. Jimenez
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...Bonnie Hurwitz
 
Desktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omicsDesktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omicsDavid Wallom
 
Open Science for the Neuroinformatics community - presentation at DI4R
 Open Science for the Neuroinformatics community - presentation at DI4R Open Science for the Neuroinformatics community - presentation at DI4R
Open Science for the Neuroinformatics community - presentation at DI4ROpenAIRE
 
Frances McNamara - Kuali OLE Implementation at University of Chicago
Frances McNamara - Kuali OLE Implementation at University of ChicagoFrances McNamara - Kuali OLE Implementation at University of Chicago
Frances McNamara - Kuali OLE Implementation at University of ChicagoKuali Days UK
 
The Neuroinformatics community in OpenAIRE Connect (Presentation by Sorina Po...
The Neuroinformatics community in OpenAIRE Connect (Presentation by Sorina Po...The Neuroinformatics community in OpenAIRE Connect (Presentation by Sorina Po...
The Neuroinformatics community in OpenAIRE Connect (Presentation by Sorina Po...OpenAIRE
 
2012 Software Freedom Day Presentation about Koha ILMS
2012 Software Freedom Day Presentation about Koha ILMS2012 Software Freedom Day Presentation about Koha ILMS
2012 Software Freedom Day Presentation about Koha ILMSRYAN T.
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout Carole Goble
 
2014-06-03-Taverna-IS-ENES2
2014-06-03-Taverna-IS-ENES22014-06-03-Taverna-IS-ENES2
2014-06-03-Taverna-IS-ENES2myGrid team
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOMCarole Goble
 

Semelhante a Bioinformatics presentation to students University of Minho (20)

"Data Provenance: Principles and Why it matters for BioMedical Applications"
"Data Provenance: Principles and Why it matters for BioMedical Applications""Data Provenance: Principles and Why it matters for BioMedical Applications"
"Data Provenance: Principles and Why it matters for BioMedical Applications"
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
The Taverna Software Suite
The Taverna Software SuiteThe Taverna Software Suite
The Taverna Software Suite
 
Application of Library Management Software: NewGenLib
Application of Library Management Software: NewGenLibApplication of Library Management Software: NewGenLib
Application of Library Management Software: NewGenLib
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
 
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
ELIXIR Implementation Study: “Mining the Proteome: Enabling Automated Process...
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
Desktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omicsDesktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omics
 
Open Science for the Neuroinformatics community - presentation at DI4R
 Open Science for the Neuroinformatics community - presentation at DI4R Open Science for the Neuroinformatics community - presentation at DI4R
Open Science for the Neuroinformatics community - presentation at DI4R
 
Frances McNamara - Kuali OLE Implementation at University of Chicago
Frances McNamara - Kuali OLE Implementation at University of ChicagoFrances McNamara - Kuali OLE Implementation at University of Chicago
Frances McNamara - Kuali OLE Implementation at University of Chicago
 
The Neuroinformatics community in OpenAIRE Connect (Presentation by Sorina Po...
The Neuroinformatics community in OpenAIRE Connect (Presentation by Sorina Po...The Neuroinformatics community in OpenAIRE Connect (Presentation by Sorina Po...
The Neuroinformatics community in OpenAIRE Connect (Presentation by Sorina Po...
 
2012 Software Freedom Day Presentation about Koha ILMS
2012 Software Freedom Day Presentation about Koha ILMS2012 Software Freedom Day Presentation about Koha ILMS
2012 Software Freedom Day Presentation about Koha ILMS
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
2014-06-03-Taverna-IS-ENES2
2014-06-03-Taverna-IS-ENES22014-06-03-Taverna-IS-ENES2
2014-06-03-Taverna-IS-ENES2
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOM
 
Research Objects in Wf4Ever
Research Objects in Wf4EverResearch Objects in Wf4Ever
Research Objects in Wf4Ever
 

Mais de introfini

SEO para E-Commerce: lojas online com sucesso!
SEO para E-Commerce: lojas online com sucesso!SEO para E-Commerce: lojas online com sucesso!
SEO para E-Commerce: lojas online com sucesso!introfini
 
O que todos os developers devem saber sobre seo
O que todos os developers devem saber sobre seoO que todos os developers devem saber sobre seo
O que todos os developers devem saber sobre seointrofini
 
Workshop: Drupal para Bioinformática
Workshop: Drupal para BioinformáticaWorkshop: Drupal para Bioinformática
Workshop: Drupal para Bioinformáticaintrofini
 
E-Commerce e Marketing Digital com Drupal no eCommarketing Show Portugal 2011
E-Commerce e Marketing Digital com Drupal no eCommarketing Show Portugal 2011E-Commerce e Marketing Digital com Drupal no eCommarketing Show Portugal 2011
E-Commerce e Marketing Digital com Drupal no eCommarketing Show Portugal 2011introfini
 
Seo com Drupal - drupal camp lisboa 2011
Seo com Drupal -  drupal camp lisboa 2011Seo com Drupal -  drupal camp lisboa 2011
Seo com Drupal - drupal camp lisboa 2011introfini
 
Colóquio Drupal na Universidade do Minho
Colóquio Drupal na Universidade do MinhoColóquio Drupal na Universidade do Minho
Colóquio Drupal na Universidade do Minhointrofini
 

Mais de introfini (6)

SEO para E-Commerce: lojas online com sucesso!
SEO para E-Commerce: lojas online com sucesso!SEO para E-Commerce: lojas online com sucesso!
SEO para E-Commerce: lojas online com sucesso!
 
O que todos os developers devem saber sobre seo
O que todos os developers devem saber sobre seoO que todos os developers devem saber sobre seo
O que todos os developers devem saber sobre seo
 
Workshop: Drupal para Bioinformática
Workshop: Drupal para BioinformáticaWorkshop: Drupal para Bioinformática
Workshop: Drupal para Bioinformática
 
E-Commerce e Marketing Digital com Drupal no eCommarketing Show Portugal 2011
E-Commerce e Marketing Digital com Drupal no eCommarketing Show Portugal 2011E-Commerce e Marketing Digital com Drupal no eCommarketing Show Portugal 2011
E-Commerce e Marketing Digital com Drupal no eCommarketing Show Portugal 2011
 
Seo com Drupal - drupal camp lisboa 2011
Seo com Drupal -  drupal camp lisboa 2011Seo com Drupal -  drupal camp lisboa 2011
Seo com Drupal - drupal camp lisboa 2011
 
Colóquio Drupal na Universidade do Minho
Colóquio Drupal na Universidade do MinhoColóquio Drupal na Universidade do Minho
Colóquio Drupal na Universidade do Minho
 

Último

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 

Último (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 

Bioinformatics presentation to students University of Minho

  • 1. PROTOFILWW A computational platform for the analysis of the relationships between microorganisms and environmental parameters in activated sludge plants José Fernandes Bioinformatics Master Thesis Prof. Anália Lourenço Prof. Ana Nicolau
  • 2. System requirements • Insertion and retrieval of data has to be done quickly and easily • Should be possible to export the data so it can be analyzed with other informatics systems • Should support statistical assessments • Have user-friendly visualization capabilities • Controlled access to data, based on user roles, accounting for data privacy issues • Easy dissemination of related studies and results • Always online (web-based) • Help finding additional information about the microorganisms present in the biological samples
  • 3. Overview of the workflow of field and lab work PROTOFILWWPROTOFILWW
  • 4. 1.635 lines x 137 columns
  • 5. ProtoFilWW system major components 1. Content Management component: supports the researchers managing and analyzing the data obtained from the WWTP’s samples 2. Text Mining component: finding additional information about the microorganisms present in the biological samples
  • 6. High-level integration perspective of ProtoFilWW Drupal core PLUGINS Import data Reports Access control Other services... PROTOFILWW SQL XLS, TXT, CSV Export dataXLS, TXT, CSV Solr/LuceneViews Solr Backend Views XML Relational Database UIMA
  • 7. Contend Management component • Open source Content Management System (CMS) and Framework (CMF) • Highly modular and with high extensibility • Built in the PHP scripting language
  • 8. WWTP Sample 1. Filamentous bacteria 2. Protozoa 3. Metazoa 4. Physical-chemical 5. Sample characterization
  • 9. User roles use case visitors collaborators WWTP researchers administrators Find studies and results x x x x Contact researchers x x x Analysis of available data x Data insertion x x Creation of reports x Export data x Managing users x Backup data x Text Mining x x x x
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17. Dynamic reporting and charting Reports creation Reports display
  • 18. Geolocation of the WWTPs Address geocoding Map display
  • 19. Text Mining component Listing the species mentioned in a document
  • 20. Major Text Mining technologies used • Lucene is a high-performance text search engine library. • Solr is a standalone enterprise search server with a REST-like API • UIMA is a powerful infrastructure for the storage, transport, and retrieval of document and annotation knowledge accumulated in NLP pipeline systems • LINNAEUS is a popular organism name identification system for biomedical literature that is capable of normalizing to unambiguous NCBI taxonomy identifiers
  • 21. Text Mining process in ProtoFilWW Solr/Lucene LINNAEUS Solr UIMA PMC Open Access SubsetPMC Open Access Subset Solr XML documentsSolr XML documents XPath convertion
  • 22. Solr LINNAEUS Annotator UIMA Component Descriptor Editor plugin UIMA type system for LINNAEUS
  • 23. LINNAEUS UIMA wrapper running on CVD
  • 24. Drupal Views Solr Backend
  • 25. Major contributions 1. The Web-based computational system www.protofilww.org 2. The Drupal module Views Solr Backend 3. The Solr UIMA plug-in for LINNAEUS Annotator
  • 26. Em que trabalho após o mestrado…
  • 27.
  • 28.
  • 29.
  • 30. Preventive Medicine  Alert the user to the risk of Type 2 Diabetes.  How? 1. We know the user has a gene mutation associated with Type 2 Diabetes, because he gave us is genome! 2. We know what he has eaten, because he told us! 3. We know what exercise he’s been doing, because he told us! 4. Genehome connects the dots!