SlideShare uma empresa Scribd logo
1 de 12
Baixar para ler offline
Debian Med in 2011




            Steffen Möller
                for the
       Debian Med Community



Bioinformatics Open Source Conference
              Vienna, 2011
The Challenge – a technical view
●   Increasing specialisation of tools and databases
    ●   Access to larger number of resources
    ●   Frequent external updates
●   Increase in local administrative work
    ●   Project-specific installations
    ●   Differential or Meta-analyses
●   Increasing platform diversity
    ●   Local vs cloud
    ●   Local vs mobile
The Challenge – a community
                    perspective
●   Work sharing
    ●   Somebody somewhere has installed what you are about to install
    ●   Packaging is tremendously helpful even when you do it just for yourself
    ●   The distribution of such packages allows other researchers profit from your effort
●   Skill sharing
    ●   Not everyone knows how to install everything correctly
    ●   Tutorials / Mailing list discussions shall refer to identically installed packages
●   Collaborative Outreach
    ●   Computational biologists get instant access to software packages
        –   Working out of the box
        –   Compiling out of the box
    ●   Debian Med has professional non-scientists sending patches to the researchers
Science and the role of a Linux
                 Distribution
●   Publication of findings
    ●   World-wide distribution of methods
    ●   Authorships and publications are forwarded in package
        descriptions
●   Collaboration
    ●   Software packages “meet” in the distribution
    ●   Availability of source code as invitation to contribute
●   Education
    ●   Reach out to students at every level
    ●   No black boxes
“What is special about Debian (Med)?”
 ●   We are not special
     ●   active support of downstream distributions, e.g. Ubuntu
     ●   the converse
 ●   Debian can be you!
     ●   open to everyone
     ●   training “on the job” to get your packages in
     ●   finding volunteers to maintain packages for you
 ●   Med Community support
     ●   shared package maintenance via subversion / git
     ●   portal to bug reports, biological packages
     ●   “ontology-like” tagging of packages
What happened since BOSC 2010
●   Many new packages
        NGS (qiime), Ensembl, Blast+, gbrowse
●   Many updates
        Autodocktools, BALLView, Bio*
●   Many new contributors
●   Many new users
●   Bio Cloud environments using Debian
    ●   GSoC project for “Cluster in Cloud” worked with Torque
    ●   Basic cloud images with Debian/Ubuntu became a commodity
●   Debian Med Sprint on Bioinformatics (January)
    ●   Closed loop for packaging with NERC Bio-Linux
    ●   Close ties with Taverna and Eagle Genomics
Steady increase in new users
●   Showing Debian graphs from popcon.debian.org, for
    Ubuntu multiply by 8
●   An average package is installed more frequently than
    there are participants at BOSC+ISMB together
●   Separate listing of recent installations / recent use

                                                  R/qtl ~700




                                                clustalw ~240


                                                       mafft ~240
Inter-institutional team building
●   Maintenance by active contributor to source code
    ●   true interest in bug reports
    ●   barrier free talking to providers of external libraries
    ●   immediate feedback on incompatibilities
●   Community
    ●   influx of skills with every package supported
    ●   volunteers address the details in packaging work
         –   Translations
         –   Format changes ...
    ●   fluent transition between power user and developer
    ●   appeal to volunteers
         –   Packages can be immediately modified and rebuilt
         –   Students at all levels and non-scientists may contribute
Ongoing development
●   Packaging of classical Java developments
    ●   Taverna's build dependencies still are not all in Debian – not in the right
        version, this means
    ●   similarly for Jalview – Java is difficult because of jar files shipping with
        source code
●   Establishment of complete workflows
    ●   closing gaps on tutorials
    ●   packages missing for
        –   In silico docking
        –   Automated genome assembly and annotation
●   Data management
    ●   BioMaj – very nice GUI application
    ●   getData – may be first to link data with Debian packages
“Not even source” packages
●   Some tools like VMD or Rosetta are surprisingly not in Debian, even
    though they allow the download of source code
●   But they don't allow to redistribute anything, which is sad since these
    tools are not straight-forward to compile
       –   Compiler version likely to be newer than expected
       –   Libraries like BOOST may have deprecated a function
       –   Source code may require patches that are only found in discussion forums
●   Debian Med hence hosts only the automated build instructions
●   Ironically, this “not even source” concept was first adopted by CERN,
    where they have Debian packages for their particle analysis tools - in
    reaction to the BOSC abstract
What's next
●   We need those problem solving workflows running
    smoothly
●   Hoping to bring the Medical and Biological parts of
    Debian Med closer together
    ●   Ontologies (nothing in Debian yet, again it's Java) and
        reasoning
    ●   Phenotyping of individuals and respective data
        management
●   Finding more bioinformatics groups already using
    Debian (or its derivatives) to help reducing
    redundancies and work with us all
Visit Debian Med on
http://debian-med.alioth.debian.org
http://wiki.debian.org/DebianMed

or just use Debian, Ubuntu or any of its derived
distributions
    at home or work
    directly, virtually or in the cloud

Mais conteúdo relacionado

Semelhante a G04-Misc-Debianmed

Moeller Debian Bosc2009
Moeller Debian Bosc2009Moeller Debian Bosc2009
Moeller Debian Bosc2009
bosc
 
Debian general presentation
Debian general presentationDebian general presentation
Debian general presentation
Ding Zhou
 
Improved developer productivity thanks to Maven and OSGi - Lukasz Dywicki (Co...
Improved developer productivity thanks to Maven and OSGi - Lukasz Dywicki (Co...Improved developer productivity thanks to Maven and OSGi - Lukasz Dywicki (Co...
Improved developer productivity thanks to Maven and OSGi - Lukasz Dywicki (Co...
mfrancis
 

Semelhante a G04-Misc-Debianmed (20)

Moeller Debian Bosc2009
Moeller Debian Bosc2009Moeller Debian Bosc2009
Moeller Debian Bosc2009
 
Effectively using Open Source with conda
Effectively using Open Source with condaEffectively using Open Source with conda
Effectively using Open Source with conda
 
Using nu get the way you should svcc
Using nu get the way you should   svccUsing nu get the way you should   svcc
Using nu get the way you should svcc
 
2018 ABRF Tools for improving rigor and reproducibility in bioinformatics
2018 ABRF Tools for improving rigor and reproducibility in bioinformatics2018 ABRF Tools for improving rigor and reproducibility in bioinformatics
2018 ABRF Tools for improving rigor and reproducibility in bioinformatics
 
Organize your chickens: NuGet for the enterprise
Organize your chickens: NuGet for the enterpriseOrganize your chickens: NuGet for the enterprise
Organize your chickens: NuGet for the enterprise
 
O365Con19 - Sharing Code Efficiently in your Organisation - Elio Struyf
O365Con19 - Sharing Code Efficiently in your Organisation - Elio StruyfO365Con19 - Sharing Code Efficiently in your Organisation - Elio Struyf
O365Con19 - Sharing Code Efficiently in your Organisation - Elio Struyf
 
VIB - Very Important Bundles
VIB - Very Important BundlesVIB - Very Important Bundles
VIB - Very Important Bundles
 
Continuous Packaging is also Mandatory for DevOps
Continuous Packaging is also Mandatory for DevOpsContinuous Packaging is also Mandatory for DevOps
Continuous Packaging is also Mandatory for DevOps
 
Advanced Usage of the Debian Packaging System
Advanced Usage of the Debian Packaging SystemAdvanced Usage of the Debian Packaging System
Advanced Usage of the Debian Packaging System
 
Using NuGet the way you should - TechDays NL 2014
Using NuGet the way you should - TechDays NL 2014Using NuGet the way you should - TechDays NL 2014
Using NuGet the way you should - TechDays NL 2014
 
2022.03.23 Conda and Conda environments.pptx
2022.03.23 Conda and Conda environments.pptx2022.03.23 Conda and Conda environments.pptx
2022.03.23 Conda and Conda environments.pptx
 
Debian general presentation
Debian general presentationDebian general presentation
Debian general presentation
 
OpenMRS RegaDB Integration
OpenMRS RegaDB IntegrationOpenMRS RegaDB Integration
OpenMRS RegaDB Integration
 
Ceph Day SF 2015 - Keynote
Ceph Day SF 2015 - Keynote Ceph Day SF 2015 - Keynote
Ceph Day SF 2015 - Keynote
 
Mono Repo
Mono RepoMono Repo
Mono Repo
 
Koha vs abcd
Koha vs abcdKoha vs abcd
Koha vs abcd
 
PURL and vers: The Mostly Universal Package URL and Version Ranges Identifier...
PURL and vers: The Mostly Universal Package URL and Version Ranges Identifier...PURL and vers: The Mostly Universal Package URL and Version Ranges Identifier...
PURL and vers: The Mostly Universal Package URL and Version Ranges Identifier...
 
Distributions and package management in the containers era
Distributions and package management in the containers eraDistributions and package management in the containers era
Distributions and package management in the containers era
 
Improved developer productivity thanks to Maven and OSGi - Lukasz Dywicki (Co...
Improved developer productivity thanks to Maven and OSGi - Lukasz Dywicki (Co...Improved developer productivity thanks to Maven and OSGi - Lukasz Dywicki (Co...
Improved developer productivity thanks to Maven and OSGi - Lukasz Dywicki (Co...
 
Alfresco DevCon 2018: SDK 3 Multi Module project using Nexus 3 for releases a...
Alfresco DevCon 2018: SDK 3 Multi Module project using Nexus 3 for releases a...Alfresco DevCon 2018: SDK 3 Multi Module project using Nexus 3 for releases a...
Alfresco DevCon 2018: SDK 3 Multi Module project using Nexus 3 for releases a...
 

Mais de Bioinformatics Open Source Conference

Mais de Bioinformatics Open Source Conference (20)

Running workflows through galaxy bosc presentation
Running workflows through galaxy bosc presentationRunning workflows through galaxy bosc presentation
Running workflows through galaxy bosc presentation
 
Talk1 ben sadi for_gmod_bosc_2011
Talk1 ben sadi for_gmod_bosc_2011Talk1 ben sadi for_gmod_bosc_2011
Talk1 ben sadi for_gmod_bosc_2011
 
Bosc mercer
Bosc mercerBosc mercer
Bosc mercer
 
Mobyle 1 0_new_features_new_types_of_service
Mobyle 1 0_new_features_new_types_of_serviceMobyle 1 0_new_features_new_types_of_service
Mobyle 1 0_new_features_new_types_of_service
 
Bosc2011 arakawa
Bosc2011 arakawaBosc2011 arakawa
Bosc2011 arakawa
 
Bosc2011 isobar-fbp
Bosc2011 isobar-fbpBosc2011 isobar-fbp
Bosc2011 isobar-fbp
 
Talk6 biopython bosc2011
Talk6 biopython bosc2011Talk6 biopython bosc2011
Talk6 biopython bosc2011
 
Unipro ugene bosc 2011 update
Unipro ugene bosc 2011 updateUnipro ugene bosc 2011 update
Unipro ugene bosc 2011 update
 
Bosc2011 ntino-krampis-full
Bosc2011 ntino-krampis-fullBosc2011 ntino-krampis-full
Bosc2011 ntino-krampis-full
 
Bosc talk 7-15-2011x
Bosc talk 7-15-2011xBosc talk 7-15-2011x
Bosc talk 7-15-2011x
 
F02-Cloud-Cloud BioLinux
F02-Cloud-Cloud BioLinuxF02-Cloud-Cloud BioLinux
F02-Cloud-Cloud BioLinux
 
B07-GenomeContent-Biomart
B07-GenomeContent-BiomartB07-GenomeContent-Biomart
B07-GenomeContent-Biomart
 
G03-SemanticWeb-OntoCAT
G03-SemanticWeb-OntoCATG03-SemanticWeb-OntoCAT
G03-SemanticWeb-OntoCAT
 
F06-Cloud-Enabling NGS
F06-Cloud-Enabling NGSF06-Cloud-Enabling NGS
F06-Cloud-Enabling NGS
 
D03-NextGen-Bio-NGS
D03-NextGen-Bio-NGSD03-NextGen-Bio-NGS
D03-NextGen-Bio-NGS
 
F07-Cloud-Hadoop-BAM
F07-Cloud-Hadoop-BAMF07-Cloud-Hadoop-BAM
F07-Cloud-Hadoop-BAM
 
C03-Visualization-Webapollo
C03-Visualization-WebapolloC03-Visualization-Webapollo
C03-Visualization-Webapollo
 
F01-Cloud-Mygene.info
F01-Cloud-Mygene.infoF01-Cloud-Mygene.info
F01-Cloud-Mygene.info
 
A01-Openness in knowledge-based systems
A01-Openness in knowledge-based systemsA01-Openness in knowledge-based systems
A01-Openness in knowledge-based systems
 
F03-Cloud-Obiwee
F03-Cloud-ObiweeF03-Cloud-Obiwee
F03-Cloud-Obiwee
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 

G04-Misc-Debianmed

  • 1. Debian Med in 2011 Steffen Möller for the Debian Med Community Bioinformatics Open Source Conference Vienna, 2011
  • 2. The Challenge – a technical view ● Increasing specialisation of tools and databases ● Access to larger number of resources ● Frequent external updates ● Increase in local administrative work ● Project-specific installations ● Differential or Meta-analyses ● Increasing platform diversity ● Local vs cloud ● Local vs mobile
  • 3. The Challenge – a community perspective ● Work sharing ● Somebody somewhere has installed what you are about to install ● Packaging is tremendously helpful even when you do it just for yourself ● The distribution of such packages allows other researchers profit from your effort ● Skill sharing ● Not everyone knows how to install everything correctly ● Tutorials / Mailing list discussions shall refer to identically installed packages ● Collaborative Outreach ● Computational biologists get instant access to software packages – Working out of the box – Compiling out of the box ● Debian Med has professional non-scientists sending patches to the researchers
  • 4. Science and the role of a Linux Distribution ● Publication of findings ● World-wide distribution of methods ● Authorships and publications are forwarded in package descriptions ● Collaboration ● Software packages “meet” in the distribution ● Availability of source code as invitation to contribute ● Education ● Reach out to students at every level ● No black boxes
  • 5. “What is special about Debian (Med)?” ● We are not special ● active support of downstream distributions, e.g. Ubuntu ● the converse ● Debian can be you! ● open to everyone ● training “on the job” to get your packages in ● finding volunteers to maintain packages for you ● Med Community support ● shared package maintenance via subversion / git ● portal to bug reports, biological packages ● “ontology-like” tagging of packages
  • 6. What happened since BOSC 2010 ● Many new packages NGS (qiime), Ensembl, Blast+, gbrowse ● Many updates Autodocktools, BALLView, Bio* ● Many new contributors ● Many new users ● Bio Cloud environments using Debian ● GSoC project for “Cluster in Cloud” worked with Torque ● Basic cloud images with Debian/Ubuntu became a commodity ● Debian Med Sprint on Bioinformatics (January) ● Closed loop for packaging with NERC Bio-Linux ● Close ties with Taverna and Eagle Genomics
  • 7. Steady increase in new users ● Showing Debian graphs from popcon.debian.org, for Ubuntu multiply by 8 ● An average package is installed more frequently than there are participants at BOSC+ISMB together ● Separate listing of recent installations / recent use R/qtl ~700 clustalw ~240 mafft ~240
  • 8. Inter-institutional team building ● Maintenance by active contributor to source code ● true interest in bug reports ● barrier free talking to providers of external libraries ● immediate feedback on incompatibilities ● Community ● influx of skills with every package supported ● volunteers address the details in packaging work – Translations – Format changes ... ● fluent transition between power user and developer ● appeal to volunteers – Packages can be immediately modified and rebuilt – Students at all levels and non-scientists may contribute
  • 9. Ongoing development ● Packaging of classical Java developments ● Taverna's build dependencies still are not all in Debian – not in the right version, this means ● similarly for Jalview – Java is difficult because of jar files shipping with source code ● Establishment of complete workflows ● closing gaps on tutorials ● packages missing for – In silico docking – Automated genome assembly and annotation ● Data management ● BioMaj – very nice GUI application ● getData – may be first to link data with Debian packages
  • 10. “Not even source” packages ● Some tools like VMD or Rosetta are surprisingly not in Debian, even though they allow the download of source code ● But they don't allow to redistribute anything, which is sad since these tools are not straight-forward to compile – Compiler version likely to be newer than expected – Libraries like BOOST may have deprecated a function – Source code may require patches that are only found in discussion forums ● Debian Med hence hosts only the automated build instructions ● Ironically, this “not even source” concept was first adopted by CERN, where they have Debian packages for their particle analysis tools - in reaction to the BOSC abstract
  • 11. What's next ● We need those problem solving workflows running smoothly ● Hoping to bring the Medical and Biological parts of Debian Med closer together ● Ontologies (nothing in Debian yet, again it's Java) and reasoning ● Phenotyping of individuals and respective data management ● Finding more bioinformatics groups already using Debian (or its derivatives) to help reducing redundancies and work with us all
  • 12. Visit Debian Med on http://debian-med.alioth.debian.org http://wiki.debian.org/DebianMed or just use Debian, Ubuntu or any of its derived distributions at home or work directly, virtually or in the cloud