SlideShare uma empresa Scribd logo
1 de 15
Baixar para ler offline
An HPSSB (History,
    Philosophy and Social
Studies of Biology) Approach
  to Biomedical Ontologies

            Sabina Leonelli
  ESRC Centre for Genomics in Society
     Department of Sociology and
              Philosophy
          University of Exeter
        s.leonelli@exeter.ac.uk
An HPSSB Perspective on
       the epistemic role of e-Science
Characterisation of experimental science as encompassing a variety
   of ways of knowing and communicating, beyond what can be
   formalised
e.g. modelling, experimental practices, tacit familiarity with instruments
   and materials
This awareness needs to carry to e-Science: not attempt to replace
  laboratory activities, but to complement them (attention: pointing to
  new directions does not mean guiding research, exploratory quality
  of experimentation)

• History of biology ‘big science’ infrastructure since WWII; history of
  model organism research in biology, and of relations between
  biological and medical research
• Philosophy of biology The role of data, theories, different types of
  models, instruments and materials in experimental practices;
  Epistemic functions of classification
• Social Studies of biology: Social organisation of science; Forms of
  and conditions for cooperation and communication; Power relations
  among actors; Institutional and economic context
Case Study: The Gene Ontology
• Arguably most successful bio-ontology to date
• Developed for use by community databases as
  a standard for the annotation of gene products
   history steeped in model organism research
• Good tool for data sharing:
  – Choice of terms is based on research interests of
    users
  – Dynamic system: can be updated to reflect scientific
    developments

• Flexibility comes from appropriate curation:
  – Manual and labour-intensive (impossible to
    automate)
  – Research interests vary across epistemic cultures:
     • How to choose relevant and intelligible labels?
The Classification Problem

       stability of classificatory categories

                    versus

      dynamism and diversity of research
                  practices


 Can classification through standard
 categories enable collaborative research
 without at the same time stifling its
 development and pluralism?
GO as a Classification System
Making data travel across different epistemic
 communities, to facilitate cross-species, integrative
 research: classification of both biological phenomena
 and data
  • Data are associated to biological phenomena via machine-
    readable labels
  • Users can automatically assess the relevance of data as
    evidence for claims about those phenomena
  • To re-use data towards new discoveries, users need to assess
    their reliability within their own research context: meta-data
    enable users to ‘situate’ information through their own
    expertise and tacit knowledge

=
data are de-contextualised for travel and re-
  contextualised for appropriation by a new context
= access is differential: users can choose parameters
  
  for their queries depending on their interests and
  expertise
Classification
  of ‘mined’
     data
Classification of
data provenance
EVIDENCE CODES

Experimental evidence codes
          - Inferred from Mutant Phenotype
          - Inferred from Direct Assay
          - Inferred from Genetic Interaction
          - Inferred from Physical Interaction
          - Inferred from Expression Pattern

Computational analysis
          IEA         - Inferred from Electronic Annotation
          RCA         - Reviewed Computational Analysis
          ISS         - Inferred from Sequence Similarity

Author statement
            TAS        - Traceable Author Statement
            NAS        - Non-traceable Author Statement

Curatorial statement
            IC         - Inferred by Curator
            ND         - No biological Data available
GO as an Expert Community
The threat of imperialism Vs. GO as ‘service to biology’:
whoever chooses labels and what counts as meta-data
  determines nomenclature and protocols used as standard
  across biology (and thus interpretation of data as well as
  experimental set-ups)
1.   De-contextualisation: separating data from information about ‘local’ features of data production
2.   Abstraction: simplifying, eliminating or modifying characteristics of data to be standardised
3.   Knowledge-stabilisation: define terms and relations to mirror (what they see as) the consensus
4.   Situating: associate each dataset with a specific term (and thus a specific phenomenon)

Solution: Curator as mediator between requirements of e-
  Science (consistency, computability, ease of use and wide
  intelligibility) and the diverse practices characterising
  experimental biology
• GO curators develop specific expertise to tackle the threat
      – Cross-disciplinary training> awareness of diverse epistemic cultures
      – Experience ‘at the bench’ > awareness of what users need and look for
• Community involvement (content meetings, feedback,
  crowdsourcing, user training workshop and online
GO as a Scientific Institution
However: emergence of separate expertise is itself an
  obstacle to dialogue with users. Curators face two
  severe problems:
• Impossible to serve users without consultation, yet
  users do not provide feedback: lack of interest, time,
  expertise
• Need to minimise duplication/proliferation of labels,
  yet each curator/ontology has a different perception/
  function of/in the field

Solution: Consortia as regulatory centres --
  standardisation as a tool to serve diversity in
  epistemic practices and interests of users:
• Centralising expertise
• Centralising procedures
The Gene Ontology Consortium
• Michael Ashburner 1998: the terms used for data classification
    should be the ones used to describe research interests
•   July 1998: First meeting of the consortium, members from
    Saccharomyces Genome Database, Mouse Genome Informatics,
    FlyBase, Berkeley Drosophila Genome Project
• October 1999: funding application NIH, AstraZeneca
• 2000-1: Rapid expansion, including the Zebrafish Information
    Network, the Rat Genome Database, The Arabidopsis Information
    Resource, Gramene.
• 2002: Central office in Cambridge
• Grants from National Human Genome Research Institute (NHGRI),
  NIH, EU, AstraZeneca, InciteGenomics, United States Department of
  Agriculture, Research and Education Service and the UK Medical
  Research Council.
• De facto standard for classification, annotation and dissemination of
  genomic data in model organism biology
• In parallel: birth of the Open Biomedical Ontologies
  Consortium
The Institutional Role of
              Consortia: Enforcing
                  Collaboration
• Encourage feedback loops among curators:
    – Rules for bio-ontology development
    – Organisation of curator meetings and communication
    – Enhancing accountability and clear division of labour

• Encourage dialogue with users:
    – ‘Content meetings’
    – Experiment on peer review procedures (e.g. Reactome)
    – Liase with industry to align their data sharing practices

• Co-operate with journals (linking data disclosure with
    publication)

   E.g. Plant Physiology and TAIR: enforcing feedback on GO

• Train users and curators
    – Workshops at conferences and elsewhere
    – Enforce institutionalisation within universities (e.g. Stanford
      Biomedical Informatics; graduate training in UK system
      biology)
The multiple identities of GO
• GO needs to be playing several epistemic roles in biology
     • Classification system
     • Expert community
     • Regulatory institution
• Exemplifies and regulates epistemic and social relations
  between virtual (in silico) and material (wet) practices in
  biology
• Despite institutionalisation within biology, still far from
  having resolved tensions between curator’s vision of
  what technology can do for science, and user needs and
  practices
     •   Handling dissent on terms or definitions
     •   Providing sufficient meta-data to assess data provenance
     •   Non-overlapping datasets and checking data quality
     •   Long-term maintenance, strategies for revision and updating
         (how has GO actually been revised?)
Thanks to ESRC for funding and several bio-ontology
curators (including the GO team at EBI) for their patience
              and availability for interviews
• (in preparation) On the Role of Theory in Data-Driven Research:
  The Case of Bio-Ontologies.
• (2010) Documenting the Emergence of Bio-Ontologies: Or, Why
  Researching Bioinformatics Requires HPSSB. History and
  Philosophy of the Life Sciences.
• (2010) Packaging Data for Re-Use: Databases in Model Organism
  Biology. In Howlett, P and Morgan, MS (eds) How Well Do ‘Facts’
  Travel. CUP. 
• (2009) On the Locality of Data and Claims About Phenomena.
  Philosophy of Science 76, 5.
• (2009) Centralising Labels to Distribute Data: The Regulatory Role
  of Genomic Consortia. In Atkinson et al (eds.) Handbook for
  Genetics and Society: Mapping the New Genomic Era. Routledge,
  pp. 469-485.
• (2008) Bio-Ontologies as Tools for Integration in Biology.
  Biological Theory 3, 1: 8-11.
Abstract

   This paper reflects on the analytic challenges emerging from the
    study of bioinformatic tools recently created to store and
    disseminate biological data, such as databases, repositories and
    bio-ontologies. I focus my discussion on the Gene Ontology, a
    term that defines three entities at once: a classification system
    facilitating the distribution and use of genomic data as evidence
    towards new insights; an expert community specialised in the
    curation of those data; and a scientific institution promoting the
    use of this tool among experimental biologists. These three
    dimensions of the Gene Ontology can be clearly distinguished
    analytically, but are tightly intertwined in practice. I suggest that
    this is true of all bioinformatic tools: they need to be understood
    simultaneously as epistemic, social and institutional entities,
    since they shape the knowledge extracted from data and at the
    same time regulate the organisation, development and
    communication of research. This viewpoint has one important
    implication for the methodologies used to study these tools, that
    is the need to integrate historical, philosophical and sociological
    approaches. I illustrate this claim through examples of
    misunderstandings that may result from a narrowly disciplinary
    study of the Gene Ontology, as I experienced them in my own
    research.

Mais conteúdo relacionado

Mais procurados

Ontologies for Semantic Normalization of Immunological Data
Ontologies for Semantic Normalization of Immunological DataOntologies for Semantic Normalization of Immunological Data
Ontologies for Semantic Normalization of Immunological DataYannick Pouliot
 
Technology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksTechnology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksAlexander Pico
 
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...mlaij
 
Introduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems BiologyIntroduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems BiologyMrinal Vashisth
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesUniversity of Malaya
 
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...Seattle DAML meetup
 
Data sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK StoryData sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK StoryResearch Information Network
 
Michelangelo Ceci – Tecniche di data-mining per la caratterizzazione di entit...
Michelangelo Ceci – Tecniche di data-mining per la caratterizzazione di entit...Michelangelo Ceci – Tecniche di data-mining per la caratterizzazione di entit...
Michelangelo Ceci – Tecniche di data-mining per la caratterizzazione di entit...eventi-ITBbari
 
Role of Bioinformatics in Cancer Research
Role of Bioinformatics in Cancer Research Role of Bioinformatics in Cancer Research
Role of Bioinformatics in Cancer Research Akash Arora
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the futurePistoia Alliance
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance
 
Developing Frameworks and Tools for Animal Trait Ontology (ATO)
Developing Frameworks and Tools for Animal Trait Ontology (ATO) Developing Frameworks and Tools for Animal Trait Ontology (ATO)
Developing Frameworks and Tools for Animal Trait Ontology (ATO) Jie Bao
 
NetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarNetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarAlexander Pico
 
I NTRODUCTION.doc
I NTRODUCTION.docI NTRODUCTION.doc
I NTRODUCTION.docbutest
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisCatherine Canevet
 

Mais procurados (20)

Bio ontology drtc-seminar_anwesha
Bio ontology drtc-seminar_anweshaBio ontology drtc-seminar_anwesha
Bio ontology drtc-seminar_anwesha
 
Ontologies for Semantic Normalization of Immunological Data
Ontologies for Semantic Normalization of Immunological DataOntologies for Semantic Normalization of Immunological Data
Ontologies for Semantic Normalization of Immunological Data
 
Mrr iti phar_mu
Mrr iti phar_muMrr iti phar_mu
Mrr iti phar_mu
 
Technology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksTechnology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive Networks
 
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
 
Introduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems BiologyIntroduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems Biology
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future Perspectives
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
 
Data sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK StoryData sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK Story
 
Michelangelo Ceci – Tecniche di data-mining per la caratterizzazione di entit...
Michelangelo Ceci – Tecniche di data-mining per la caratterizzazione di entit...Michelangelo Ceci – Tecniche di data-mining per la caratterizzazione di entit...
Michelangelo Ceci – Tecniche di data-mining per la caratterizzazione di entit...
 
Role of Bioinformatics in Cancer Research
Role of Bioinformatics in Cancer Research Role of Bioinformatics in Cancer Research
Role of Bioinformatics in Cancer Research
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the future
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier Datathon
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Developing Frameworks and Tools for Animal Trait Ontology (ATO)
Developing Frameworks and Tools for Animal Trait Ontology (ATO) Developing Frameworks and Tools for Animal Trait Ontology (ATO)
Developing Frameworks and Tools for Animal Trait Ontology (ATO)
 
NetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarNetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David Amar
 
I NTRODUCTION.doc
I NTRODUCTION.docI NTRODUCTION.doc
I NTRODUCTION.doc
 
Bioinformatics Projects And Applications
Bioinformatics Projects And ApplicationsBioinformatics Projects And Applications
Bioinformatics Projects And Applications
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
 

Destaque

Introduction: The Past - Future of Research Communications
Introduction: The Past - Future of Research CommunicationsIntroduction: The Past - Future of Research Communications
Introduction: The Past - Future of Research CommunicationsAnita de Waard
 
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...ICZN
 
Scientific Sensemaking
Scientific SensemakingScientific Sensemaking
Scientific SensemakingAnita de Waard
 
C-SHALS 2010: representing scientific discourse, or: why triples are not enough
C-SHALS 2010: representing scientific discourse, or:  why triples are not enoughC-SHALS 2010: representing scientific discourse, or:  why triples are not enough
C-SHALS 2010: representing scientific discourse, or: why triples are not enoughAnita de Waard
 
What’s wrong with research papers - and (how) can we fix it?
What’s wrong with research papers -  and (how) can we fix it?What’s wrong with research papers -  and (how) can we fix it?
What’s wrong with research papers - and (how) can we fix it?Anita de Waard
 
'These Results Suggest That...', Knowledge Attribution in Scientific Discourse
'These Results Suggest That...', Knowledge Attribution in Scientific Discourse'These Results Suggest That...', Knowledge Attribution in Scientific Discourse
'These Results Suggest That...', Knowledge Attribution in Scientific DiscourseAnita de Waard
 
Workflows and challenges
Workflows and challengesWorkflows and challenges
Workflows and challengesAnita de Waard
 

Destaque (7)

Introduction: The Past - Future of Research Communications
Introduction: The Past - Future of Research CommunicationsIntroduction: The Past - Future of Research Communications
Introduction: The Past - Future of Research Communications
 
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
 
Scientific Sensemaking
Scientific SensemakingScientific Sensemaking
Scientific Sensemaking
 
C-SHALS 2010: representing scientific discourse, or: why triples are not enough
C-SHALS 2010: representing scientific discourse, or:  why triples are not enoughC-SHALS 2010: representing scientific discourse, or:  why triples are not enough
C-SHALS 2010: representing scientific discourse, or: why triples are not enough
 
What’s wrong with research papers - and (how) can we fix it?
What’s wrong with research papers -  and (how) can we fix it?What’s wrong with research papers -  and (how) can we fix it?
What’s wrong with research papers - and (how) can we fix it?
 
'These Results Suggest That...', Knowledge Attribution in Scientific Discourse
'These Results Suggest That...', Knowledge Attribution in Scientific Discourse'These Results Suggest That...', Knowledge Attribution in Scientific Discourse
'These Results Suggest That...', Knowledge Attribution in Scientific Discourse
 
Workflows and challenges
Workflows and challengesWorkflows and challenges
Workflows and challenges
 

Semelhante a Sabina Leonelli

Metadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATOMetadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATOAlejandra Gonzalez-Beltran
 
Sansone bio sharing introduction
Sansone bio sharing introductionSansone bio sharing introduction
Sansone bio sharing introductionMIBBI Checklists
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Amit Sheth
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataSusanna-Assunta Sansone
 
"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013
"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013
"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013Susanna-Assunta Sansone
 
Virtual Communities: Catalysts for Advancing Scholarship
Virtual Communities: Catalysts for Advancing ScholarshipVirtual Communities: Catalysts for Advancing Scholarship
Virtual Communities: Catalysts for Advancing ScholarshipJohn Butler
 
Virtual Communities: Catalysts for Advancing Scholarship
Virtual Communities: Catalysts for Advancing ScholarshipVirtual Communities: Catalysts for Advancing Scholarship
Virtual Communities: Catalysts for Advancing ScholarshipJohn Butler
 
Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...Susanna-Assunta Sansone
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble
 
FAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and NeuroscienceFAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and NeuroscienceSusanna-Assunta Sansone
 
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Susanna-Assunta Sansone
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliLEARN Project
 
CINECA webinar slides: Making cohort data FAIR
CINECA webinar slides: Making cohort data FAIRCINECA webinar slides: Making cohort data FAIR
CINECA webinar slides: Making cohort data FAIRCINECAProject
 
Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015Susanna-Assunta Sansone
 
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...Globus
 

Semelhante a Sabina Leonelli (20)

Metadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATOMetadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
 
Sansone bio sharing introduction
Sansone bio sharing introductionSansone bio sharing introduction
Sansone bio sharing introduction
 
Sansone mibbi-intro
Sansone mibbi-introSansone mibbi-intro
Sansone mibbi-intro
 
B4OS-2012
B4OS-2012B4OS-2012
B4OS-2012
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
 
"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013
"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013
"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013
 
Virtual Communities: Catalysts for Advancing Scholarship
Virtual Communities: Catalysts for Advancing ScholarshipVirtual Communities: Catalysts for Advancing Scholarship
Virtual Communities: Catalysts for Advancing Scholarship
 
Virtual Communities: Catalysts for Advancing Scholarship
Virtual Communities: Catalysts for Advancing ScholarshipVirtual Communities: Catalysts for Advancing Scholarship
Virtual Communities: Catalysts for Advancing Scholarship
 
Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
Edulearn2014 itd
Edulearn2014 itdEdulearn2014 itd
Edulearn2014 itd
 
FAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and NeuroscienceFAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and Neuroscience
 
Bloomsbury Conference
Bloomsbury ConferenceBloomsbury Conference
Bloomsbury Conference
 
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina Leonelli
 
21st Century Research Landscape
21st Century Research Landscape21st Century Research Landscape
21st Century Research Landscape
 
CINECA webinar slides: Making cohort data FAIR
CINECA webinar slides: Making cohort data FAIRCINECA webinar slides: Making cohort data FAIR
CINECA webinar slides: Making cohort data FAIR
 
Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015
 
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...
 

Mais de Anita de Waard

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseAnita de Waard
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?Anita de Waard
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataAnita de Waard
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsAnita de Waard
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesAnita de Waard
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Anita de Waard
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?Anita de Waard
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data ManagementAnita de Waard
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseAnita de Waard
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of PublishingAnita de Waard
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsAnita de Waard
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryAnita de Waard
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data SharingAnita de Waard
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingAnita de Waard
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumAnita de Waard
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataAnita de Waard
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016Anita de Waard
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...Anita de Waard
 

Mais de Anita de Waard (20)

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data Commons
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring Guidelines
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
 
History of the future
History of the futureHistory of the future
History of the future
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of Publishing
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost Recovery
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data Sharing
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly Publishing
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
 

Último

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfOverkill Security
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 

Último (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 

Sabina Leonelli

  • 1. An HPSSB (History, Philosophy and Social Studies of Biology) Approach to Biomedical Ontologies Sabina Leonelli ESRC Centre for Genomics in Society Department of Sociology and Philosophy University of Exeter s.leonelli@exeter.ac.uk
  • 2. An HPSSB Perspective on the epistemic role of e-Science Characterisation of experimental science as encompassing a variety of ways of knowing and communicating, beyond what can be formalised e.g. modelling, experimental practices, tacit familiarity with instruments and materials This awareness needs to carry to e-Science: not attempt to replace laboratory activities, but to complement them (attention: pointing to new directions does not mean guiding research, exploratory quality of experimentation) • History of biology ‘big science’ infrastructure since WWII; history of model organism research in biology, and of relations between biological and medical research • Philosophy of biology The role of data, theories, different types of models, instruments and materials in experimental practices; Epistemic functions of classification • Social Studies of biology: Social organisation of science; Forms of and conditions for cooperation and communication; Power relations among actors; Institutional and economic context
  • 3. Case Study: The Gene Ontology • Arguably most successful bio-ontology to date • Developed for use by community databases as a standard for the annotation of gene products  history steeped in model organism research • Good tool for data sharing: – Choice of terms is based on research interests of users – Dynamic system: can be updated to reflect scientific developments • Flexibility comes from appropriate curation: – Manual and labour-intensive (impossible to automate) – Research interests vary across epistemic cultures: • How to choose relevant and intelligible labels?
  • 4.
  • 5. The Classification Problem stability of classificatory categories versus dynamism and diversity of research practices  Can classification through standard categories enable collaborative research without at the same time stifling its development and pluralism?
  • 6. GO as a Classification System Making data travel across different epistemic communities, to facilitate cross-species, integrative research: classification of both biological phenomena and data • Data are associated to biological phenomena via machine- readable labels • Users can automatically assess the relevance of data as evidence for claims about those phenomena • To re-use data towards new discoveries, users need to assess their reliability within their own research context: meta-data enable users to ‘situate’ information through their own expertise and tacit knowledge = data are de-contextualised for travel and re- contextualised for appropriation by a new context = access is differential: users can choose parameters for their queries depending on their interests and expertise
  • 7. Classification of ‘mined’ data
  • 8. Classification of data provenance EVIDENCE CODES Experimental evidence codes - Inferred from Mutant Phenotype - Inferred from Direct Assay - Inferred from Genetic Interaction - Inferred from Physical Interaction - Inferred from Expression Pattern Computational analysis IEA - Inferred from Electronic Annotation RCA - Reviewed Computational Analysis ISS - Inferred from Sequence Similarity Author statement TAS - Traceable Author Statement NAS - Non-traceable Author Statement Curatorial statement IC - Inferred by Curator ND - No biological Data available
  • 9. GO as an Expert Community The threat of imperialism Vs. GO as ‘service to biology’: whoever chooses labels and what counts as meta-data determines nomenclature and protocols used as standard across biology (and thus interpretation of data as well as experimental set-ups) 1. De-contextualisation: separating data from information about ‘local’ features of data production 2. Abstraction: simplifying, eliminating or modifying characteristics of data to be standardised 3. Knowledge-stabilisation: define terms and relations to mirror (what they see as) the consensus 4. Situating: associate each dataset with a specific term (and thus a specific phenomenon) Solution: Curator as mediator between requirements of e- Science (consistency, computability, ease of use and wide intelligibility) and the diverse practices characterising experimental biology • GO curators develop specific expertise to tackle the threat – Cross-disciplinary training> awareness of diverse epistemic cultures – Experience ‘at the bench’ > awareness of what users need and look for • Community involvement (content meetings, feedback, crowdsourcing, user training workshop and online
  • 10. GO as a Scientific Institution However: emergence of separate expertise is itself an obstacle to dialogue with users. Curators face two severe problems: • Impossible to serve users without consultation, yet users do not provide feedback: lack of interest, time, expertise • Need to minimise duplication/proliferation of labels, yet each curator/ontology has a different perception/ function of/in the field Solution: Consortia as regulatory centres -- standardisation as a tool to serve diversity in epistemic practices and interests of users: • Centralising expertise • Centralising procedures
  • 11. The Gene Ontology Consortium • Michael Ashburner 1998: the terms used for data classification should be the ones used to describe research interests • July 1998: First meeting of the consortium, members from Saccharomyces Genome Database, Mouse Genome Informatics, FlyBase, Berkeley Drosophila Genome Project • October 1999: funding application NIH, AstraZeneca • 2000-1: Rapid expansion, including the Zebrafish Information Network, the Rat Genome Database, The Arabidopsis Information Resource, Gramene. • 2002: Central office in Cambridge • Grants from National Human Genome Research Institute (NHGRI), NIH, EU, AstraZeneca, InciteGenomics, United States Department of Agriculture, Research and Education Service and the UK Medical Research Council. • De facto standard for classification, annotation and dissemination of genomic data in model organism biology • In parallel: birth of the Open Biomedical Ontologies Consortium
  • 12. The Institutional Role of Consortia: Enforcing Collaboration • Encourage feedback loops among curators: – Rules for bio-ontology development – Organisation of curator meetings and communication – Enhancing accountability and clear division of labour • Encourage dialogue with users: – ‘Content meetings’ – Experiment on peer review procedures (e.g. Reactome) – Liase with industry to align their data sharing practices • Co-operate with journals (linking data disclosure with publication) E.g. Plant Physiology and TAIR: enforcing feedback on GO • Train users and curators – Workshops at conferences and elsewhere – Enforce institutionalisation within universities (e.g. Stanford Biomedical Informatics; graduate training in UK system biology)
  • 13. The multiple identities of GO • GO needs to be playing several epistemic roles in biology • Classification system • Expert community • Regulatory institution • Exemplifies and regulates epistemic and social relations between virtual (in silico) and material (wet) practices in biology • Despite institutionalisation within biology, still far from having resolved tensions between curator’s vision of what technology can do for science, and user needs and practices • Handling dissent on terms or definitions • Providing sufficient meta-data to assess data provenance • Non-overlapping datasets and checking data quality • Long-term maintenance, strategies for revision and updating (how has GO actually been revised?)
  • 14. Thanks to ESRC for funding and several bio-ontology curators (including the GO team at EBI) for their patience and availability for interviews • (in preparation) On the Role of Theory in Data-Driven Research: The Case of Bio-Ontologies. • (2010) Documenting the Emergence of Bio-Ontologies: Or, Why Researching Bioinformatics Requires HPSSB. History and Philosophy of the Life Sciences. • (2010) Packaging Data for Re-Use: Databases in Model Organism Biology. In Howlett, P and Morgan, MS (eds) How Well Do ‘Facts’ Travel. CUP.  • (2009) On the Locality of Data and Claims About Phenomena. Philosophy of Science 76, 5. • (2009) Centralising Labels to Distribute Data: The Regulatory Role of Genomic Consortia. In Atkinson et al (eds.) Handbook for Genetics and Society: Mapping the New Genomic Era. Routledge, pp. 469-485. • (2008) Bio-Ontologies as Tools for Integration in Biology. Biological Theory 3, 1: 8-11.
  • 15. Abstract This paper reflects on the analytic challenges emerging from the study of bioinformatic tools recently created to store and disseminate biological data, such as databases, repositories and bio-ontologies. I focus my discussion on the Gene Ontology, a term that defines three entities at once: a classification system facilitating the distribution and use of genomic data as evidence towards new insights; an expert community specialised in the curation of those data; and a scientific institution promoting the use of this tool among experimental biologists. These three dimensions of the Gene Ontology can be clearly distinguished analytically, but are tightly intertwined in practice. I suggest that this is true of all bioinformatic tools: they need to be understood simultaneously as epistemic, social and institutional entities, since they shape the knowledge extracted from data and at the same time regulate the organisation, development and communication of research. This viewpoint has one important implication for the methodologies used to study these tools, that is the need to integrate historical, philosophical and sociological approaches. I illustrate this claim through examples of misunderstandings that may result from a narrowly disciplinary study of the Gene Ontology, as I experienced them in my own research.