BioSmalltalk

E
ESUGESUG
BioSmalltalk
Hernán Morales Durand - ESUG 2023 @ Lyon, France
BioSmalltalk is...
... a library for Bioinformatics
...implemented in Pharo
...part of the Open Bioinformatics
Foundation (OBF)
...not intended to be a replacement of
+30000 awesome bioinformatics tools (but
it could save some time).
What is Bioinformatics?
https://www.genome.gov/sites/default/
fi
les/media/
fi
les/2021-11/Sequencing_Cost_Data_Table_Aug2021.xls
BioSmalltalk
GTGCATCTGACTCCTGAGGAGAAG
CACGTAGACTGAGGACTCCTCTTC
GUGCAUCUGACUCCUGAGGAGAAG
V H L T P E E K
DNA
(transcription)
RNA
(translation)
protein
BioSmalltalk: Basic operations
Basic operations with
biological sequences
'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC
DNA) [ACTGGTGATA]"
BioSmalltalk: Basic operations
'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC
DNA) [ACTGGTGATA]"
DNA
Alphabet
Sequence length
BioSmalltalk: Basic operations
'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC
DNA) [ACTGGTGATA]"
DNA
Alphabet
Sequence length
BioSmalltalk: Basic operations
'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC
DNA) [ACTGGTGATA]"
transcribe "a BioSequence(10) ([GAUC] IUPAC ->
RNA -> Unambiguous) [ACUGGUGAUA]"
BioSmalltalk: Basic operations
RNA
Alphabet
'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC
DNA) [ACTGGTGATA]"
transcribe "a BioSequence(10) ([GAUC] IUPAC ->
RNA -> Unambiguous) [ACUGGUGAUA]"
backTranscribe
BioSmalltalk: Basic operations
'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC
DNA) [ACTGGTGATA]"
complement "a BioSequence(10) ([GATCN] IUPAC
DNA) [TGACCACTAT]"
BioSmalltalk: Basic operations
'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC
DNA) [ACTGGTGATA]"
reverseComplement "a BioSequence(10) ([GATCN] IUPAC
DNA) [TATCACCAGT]"
BioSmalltalk: Basic operations
'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC
DNA) [ACTGGTGATA]"
translate
"a BioSequence(3)
([ACDEFGHIKLMNPQRSTVWY] IUPAC ->
Protein) [TGD]"
BioSmalltalk: Basic operations
Sequence Utilities
'ACTGGTGATA' asSequence gcContent
BioSmalltalk: Sequence statistics
"40s0"
BioSmalltalk: Sequence statistics
GC Skew plot
'ACTGGTGATA' asSequence gcContent
BioSmalltalk: Sequence statistics
'ACTGGTGATA' asSequence molecularWeightNonDegen
"3146.0499999999997"
'ACTGGTGATA' asSequence gcContent
BioSmalltalk: Sequence statistics
'ACTGGTGATA' asSequence molecularWeightNonDegen
'ACTGGTGATA' asSequence lcc
"an OrderedCollection(-1.4948676426993133
-0.16609640474436815 -1.4948676426993133
-1.4948676426993133)"
'ACTGGTGATA' asSequence gcContent
BioSmalltalk: Sequence statistics
'ACTGGTGATA' asSequence molecularWeightNonDegen
'ACTGGTGATA' asSequence lcc
'ACTGGTGATA' asSequence occurrencesOfLetters
"a Dictionary($A->3
$C->1 $G->3 $T->3 )"
(BioSequence newAmbiguousDNA:'AHT') disambiguate
"#('AAT' 'ATT' 'ACT')"
BioSmalltalk: Sequence utilities
(BioSequence newAmbiguousDNA:'AHT') disambiguate
'ACGTACGTACGT' asSequence kmersCount:'CG'
'ACGTACGTACGT' asSequence longestConsecutive: $A
BioSmalltalk: Sequence utilities
'ACTGGTGATA' asSequence crc32.
'ACTGGTGATA' asSequence gcg.
'ACTGGTGATA' asSequence seguid.
(BioParser parseMultiFastaFile: 'ls_orchid.fasta') plot
550 600 650 700 750 800
0.0
10.0
20.0
30.0
40.0
50.0
Histogram of 94 FASTA sequences
Sequence
Count
Sequence lengths (bp)
BioSmalltalk: Sequence utilities
Sequence Alignment
BioSmalltalk: Sequence alignment
BioAlignment new
addSequence: 'ACTGCTAGCTAG';
addSequence: 'ACT-CTAGCTAG';
addSequence: 'ACTGGTANATGG';
addSequence: 'ACTGATTGCTGG';
addSequence: 'ACTGCTTGATTG';
yourself
BioSmalltalk: Sequence alignment
BioSmalltalk: Sequence alignment
latestBlast := BioBlastWrapper ncbi local latest.
latestBlast nucleotide
query: '555';
hitListSize: 10;
filterLowComplexity;
expectValue: 10;
blastn;
blastPlainService.
Program Query Type DB Type Comparison
blastn Nucleotide Nucleotide Nucleotide-
Nucleotide
blastp Protein Protein Protein-
Protein
tblastn Protein Nucleotide Protein-
Protein
blastx Nucleotide Protein Protein-
Protein
BioSmalltalk: Sequence alignment
aligner := BioMAFFTWrapper new.
aligner
input: 'COVID-19-01.fasta';
addOutputParameter: 'output.aln';
execute
BioSmalltalk: Sequence alignment
aligner := ALNeedlemanWunsch new.
aligner
align: 'AC-AATAGAC'
with: 'ACGAATAGAT'.
https://github.com/hernanmd/needleman-wunsch
Implementation of Needleman-Wunsh algorithm native to Pharo
BioSmalltalk: Plotting alignment pipeline
outputFilename := 'COVID-19-MAFFT-2023-08-24_21-37-49.aln'.
sarsCoV2SequencesUIDs := 'seqIDs.txt' asFileReference lines.
multiFasta := BioParser parseMultiFasta: (
BioEntrezClient new nuccore
uids: sarsCoV2SequencesUIDs;
setFasta;
setModeText;
fetch) result.
multiFastaCompleteGenomes := multiFasta
select: [ : f | f name endsWith: 'complete genome' ].
BioMAFFTWrapper new
auto;
maxiterate: 1000;
input: multiFastaCompleteGenomes;
addOutputParameter: outputFilename;
execute.
(BioParser parseMultiFastaAlignmentFile: outputFilename asFileReference) plot.
BioSmalltalk: Miscellany
• Sequences: Consensus, Repeats, Codon Tables, IUPAC
Alphabets, Features, Records.
• Genome downloads
• Wrappers: PLINK, Cutadapt, MUSCLE, BLAST, CLUSTAL,
STRUCTURE, ShapeIt, HH-Suite, ACANA, AGA, samtools, etc.
• Formatters: FASTA, GenBank, PED, BED, MEGA, Arlequin, etc
• Parsers: GenBank & Entrez XML, ID's, FASTA.
• Databases: NCBI Entrez, REBASE.
BioSmalltalk: Databases
BioSmalltalk: Databases
BioEntrezClient organization listAtCategoryNamed: 'accessing public -
databases' ].
"#(#gds #geo #genome #pmc #genomeprj #nlmcatalog #unigene #homologene
#nucest #peptidome #journals #domains #structure #omia #omim #pubmed
#biosystems #popset #cancerchromosomes #gensat #snp #books #ncbisearch
#gene #pcsubstance #nuccore #protein #cdd #sra #nucgss #proteinclusters
#biosample #taxonomy #unists #probe #mesh #pcassay #gap #pccompound)"
Applications
Evidence of positive selection towards Zebuine haplotypes
in the BoLA region of Brangus cattle
D. E. Goszczynski1†a
C. M. Corbi-Botto1a
, H. M. Durand1
, A. Rogberg-Muñoz1,2,3
,
S. Munilla2,3
, P. Peral-Garcia1
, R. J. C. Cantet2,3
and G. Giovambattista1
1
Facultad de Ciencias Veterinarias, Instituto de Genética Veterinaria (IGEVET) (UNLP-CONICET LA PLATA), La Plata, Buenos Aires, Argentina; 2
Departamento de
Producción, Facultad de Agronomía, Universidad de Buenos Aires, Buenos Aires, Argentina; 3
Instituto de Investigaciones en Producción Animal (INPA)
(UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina.
(Received 5 January 2017; Accepted 25 April 2017; First published online 14 July 2017)
The Brangus breed was developed to combine the superior characteristics of both of its founder breeds, Angus and Brahman.
It combines the high adaptability to tropical and subtropical environments, disease resistance, and overall hardiness of Zebu cattle
with the reproductive potential and carcass quality of Angus. It is known that the major histocompatibility complex (MHC, also
known as bovine leucocyte antigen: BoLA), located on chromosome 23, encodes several genes involved in the adaptive immune
response and may be responsible for adaptation to harsh environments. The objective of this work was to evaluate whether
the local breed ancestry percentages in the BoLA locus of a Brangus population diverged from the estimated genome-wide
proportions and to identify signatures of positive selection in this genomic region. For this, 167 animals (100 Brangus, 45 Angus
and 22 Brahman) were genotyped using a high-density single nucleotide polymorphism array. The local ancestry analysis showed
that more than half of the haplotypes (55.0%) shared a Brahman origin. This value was significantly different from the global
genome-wide proportion estimated by cluster analysis (34.7% Brahman), and the proportion expected by pedigree (37.5% Brahman).
The analysis of selection signatures by genetic differentiation ( Fst) and extended haplotype homozygosity-based methods (iHS and Rsb)
revealed 10 and seven candidate regions, respectively. The analysis of the genes located within these candidate regions showed mainly
genes involved in immune response-related pathway, while other genes and pathways were also observed (cell surface signalling
pathways, membrane proteins and ion-binding proteins). Our results suggest that the BoLA region of Brangus cattle may have been
enriched with Brahman haplotypes as a consequence of selection processes to promote adaptation to subtropical environments.
Keywords: Brangus, major histocompatibility complex, selection signatures, BoLA, ancestral haplotypes
Animal (2018), 12:2, pp 215–223 © The Animal Consortium 2017
doi:10.1017/S1751731117001380
animal
Case Report
DNA profile of dog feces as evidence to solve a homicide
L.S. Barrientos a,1,2
, J.A. Crespi a,1,2
, A. Fameli b
, D.M. Posik a,2
, H. Morales a,2
, P. Peral García a,2
,
G. Giovambattista a,⇑
a
IGEVET – Instituto de Genética Veterinaria (UNLP-CONICET LA PLATA), Facultad de Ciencias Veterinarias, UNLP, La Plata, Buenos Aires, Argentina
b
GECOBI – Grupo de Genética y Ecología en Conservación y Biodiversidad, Museo Argentino de Ciencias Naturales ‘‘Bernardino Rivadavia”, Av. Angel Gallardo 470, C1405DJR
Buenos Aires, Argentina
a r t i c l e i n f o
Article history:
Received 31 March 2016
Received in revised form 20 June 2016
Accepted 10 August 2016
Available online 10 August 2016
Keywords:
Forensic sciences
Non-human DNA
Dog
Mitochondrial DNA
Feces
a b s t r a c t
Dog fecal samples were collected at the crime scene and from the shoes of the suspect to see whether
they could be linked. DNA was genotyped using a 145 bp fragment containing a 60 bp hotspot region
of the mitochondrial DNA (mtDNA) control region. Once the species origin was identified, sequences were
aligned with the 23 canine haplotypes defined, showing that evidence and reference had 100% identity
with haplotype 5. The frequency of haplotype 5 and the exclusion power of the reference population were
0.056 and 0.89, respectively. The forensic index showed that it was 20 times more likely that the evidence
belonged to the reference dog than to some other unknown animal. The results support that the mtDNA
hypervariable region 1 (HV1) is a good alternative for typing in trace or degraded casework samples when
the STR panel fails, and demonstrate the utility of domestic animal samples to give additional information
to solve human legal cases.
! 2016 Published by Elsevier Ireland Ltd.
1. Introduction
Non-human DNA analysis in forensic science has seen growth in
recent years. Applications range from investigations of crimes of
humans to cruelty and poaching in animal/wildlife species, where
DNA evidence from animals, plants, bacteria and viruses has been
used in criminal investigations [1].
Animal Forensic Genetics is defined as ‘‘The application of rele-
vant genetic techniques and theory to legal matters, for enforce-
ment issues, concerning animal biological material” [2]. Domestic
close relationship with people, determination of the genetic profile
of pets would provide a valuable forensic tool.
Canine biological materials including hair, feces and saliva can
be found when contact between dogs and humans takes place.
Most of the described collection, sampling, and extraction are used
in medical diagnostic applications [8,9], wildlife population [10,11]
and wildlife illegal traffic studies [12]. Fecal DNA is often degraded
due to environmental factors and continued active deterioration by
the large numbers of bacteria present with the feces. Also, feces
contain many known PCR inhibitors such as bile salts [13]. As fecal
Legal Medicine 22 (2016) 54–57
Contents lists available at ScienceDirect
Legal Medicine
journal homepage: www.elsevier.com/locate/legalmed
RESEARCH ARTICLE
Runs of homozygosity in a selected cattle
population with extremely inbred bulls:
Descriptive and functional analyses revealed
highly variable patterns
Daniel Goszczynski1
, Antonio Molina2
, Ester Terán1
, Hernán Morales-Durand1
, Pablo Ross3
,
Hao Cheng3
, Guillermo Giovambattista1,2,3,4
, Sebastián Demyda-Peyrás1,2,3,4
*
1 IGEVET–Instituto de Genética Veterinaria "Ing. Fernando N. Dulout” (UNLP-CONICET LA PLATA),
Facultad de Ciencias Veterinarias UNLP, La Plata, Argentina, 2 Departamento de Genética, Universidad de
Córdoba, Córdoba, España, 3 Department of Animal Science, University of California, Davis, Davis,
California, United States of America, 4 Departamento de Producción Animal, Facultad de Ciencias
Veterinarias, Universidad Nacional de La Plata, La Plata, Buenos Aires, Argentina
* sdemyda@igevet.gob.ar
Abstract
The analysis of runs of homozygosity (ROH), using high throughput genomic data, has
become a valuable and frequently used methodology to characterize the genomic and
inbreeding variation of livestock and wildlife animal populations. However, this methodology
has been scarcely used in highly inbred domestic animals. Here, we analyzed and charac-
terized the occurrence of ROH fragments in highly inbred (HI; average pedigree-based
inbreeding coefficient FPED = 0.164; 0.103 to 0.306) and outbred Retinta bulls (LI; average
FPED = 0.008; 0 to 0.025). We studied the length of the fragments, their abundance, and
genome distribution using high-density microarray data. The number of ROH was signifi-
cantly higher in the HI group, especially for long fragments (>8Mb). In the LI group, the num-
ber of ROH continuously decreased with fragment length. Genome-wide distribution of
ROH was highly variable between samples. Some chromosomes presented a larger num-
ber of fragments (BTA1, BTA19, BTA29), others had longer fragments (BTA4, BTA12,
a1111111111
a1111111111
a1111111111
a1111111111
a1111111111
OPEN ACCESS
Citation: Goszczynski D, Molina A, Terán E,
Morales-Durand H, Ross P, Cheng H, et al. (2018)
Runs of homozygosity in a selected cattle
population with extremely inbred bulls: Descriptive
and functional analyses revealed highly variable
patterns. PLoS ONE 13(7): e0200069. https://doi.
org/10.1371/journal.pone.0200069
Editor: Arda Yildirim, Gaziosmanpasa University,
TURKEY
Received: September 18, 2017
Accepted: June 19, 2018
Published: July 9, 2018
Vol. 29 no. 18 2013, pages 2355–2356
BIOINFORMATICS APPLICATIONS NOTE doi:10.1093/bioinformatics/btt398
Sequence analysis Advance Access publication July 9, 2013
BioSmalltalk: a pure object system and library for bioinformatics
Hernán F. Morales* and Guillermo Giovambattista
Instituto de Genética Veterinaria (IGEVET), CONICET La Plata–Facultad de Ciencias Veterinarias, Universidad Nacional
de La Plata, La Plata B1900AVW, CC 296 Argentina
Associate Editor: Janet Kelso
ABSTRACT
Summary: We have developed BioSmalltalk, a new environment
system for pure object-oriented bioinformatics programming.
Adaptive end-user programming systems tend to become more im-
portant for discovering biological knowledge, as is demonstrated by
the emergence of open-source programming toolkits for bioinfor-
matics in the past years. Our software is intended to bridge the gap
between bioscientists and rapid software prototyping while preserving
the possibility of scaling to whole-system biology applications.
BioSmalltalk performs better in terms of execution time and memory
usage than Biopython and BioPerl for some classical situations.
Availability: BioSmalltalk is cross-platform and freely available (MIT
license) through the Google Project Hosting at http://code.google.
com/p/biosmalltalk
Contact: hernan.morales@gmail.com
Supplementary information: Supplementary data are available at
Bioinformatics online.
Received on January 12, 2013; revised on June 5, 2013; accepted on
July 3, 2013
1 INTRODUCTION
We present a novel free/open source software (FOSS) platform
for the development of bioinformatics software and applications.
BioSmalltalk attempts to reconcile the current de facto scripting
modalities of textual programming languages with the features of
Smalltalk (Goldberg and Robson, 1983), which has a pure object
dynamic programming environment.
BioSmalltalk provides similar functionality to other FOSS
toolkits for bioinformatics, such as BioPerl (Stajich et al.,
2002), Biopython (Cock et al., 2009) and BioJava (Holland
et al., 2008), based in industry-leading general-purpose textual
programming languages, and toolkits, including the Bio* pro-
jects. The Bio* toolkits’ usage of OO is commonly hybrid or
emulated through modules (Cock et al., 2009; Stajich et al.,
2002), mixing objects with primitive data types and hampering
the use of reflective functionalities (Maes, 1977). BioSmalltalk
benefits from decreased source code verbosity, and its execution
in a self-contained snapshot system that promotes run-time
adaptability, critical for systems where shutdown cycles cannot
be tolerated (Hirschfeld and Lämmel, 2005).
2 FEATURES
2.1 Bioinformatics
BioSmalltalk provides objects to manipulate biological se-
quences and data from databases like the Entrez system
(Schuler et al., 1996). It also contains wrappers for command-
line tools like ClustalW (Thompson, 1994) and HMMER (Finn,
2011) sequence visualization and format conversion.
We based implementation on existing FOSS bioinformatics
platforms, specifically BioPerl and Biopython, to prevent educa-
tional obsolescence, preserving the familiar object model inter-
faces for experienced bioinformaticians.
BioSmalltalk contains tokenizers, parsers and formatters for
common sequence identifiers, FASTA, BLAST and Entrez
XML, PHYLIP (Felsenstein, 1989), Arlequin (Excoffier, 2005)
and others. Most parsers use PetitParser (Renggli et al., 2010), a
dynamically reconfigurable parser library. Additional features
can be found in the project documentation. We did a microbe-
nchmark to compare the performance of our library using
the script in Figure 1. We have executed the scripts five times
immediately after booting without unnecessary processes (Tests
were performed on GNU/Linux Debian kernel 2.6.32-
Downloaded
from
https://academic.oup.com/bioinformatics/article/29/18/2355/240534
by
INRIA
user
on
https://github.com/hernanmd/BioSmalltalk
Thank you
1 de 41

Recomendados

20081216 06陳倩琪 紅麴菌基因體之定序與分析 por
20081216 06陳倩琪 紅麴菌基因體之定序與分析20081216 06陳倩琪 紅麴菌基因體之定序與分析
20081216 06陳倩琪 紅麴菌基因體之定序與分析Monascus2008
1.7K visualizações50 slides
Genome responses of trypanosome infected cattle por
Genome responses of trypanosome infected cattleGenome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattleLaurence Dawkins-Hall
58 visualizações1 slide
1st_Seminar_PHD.pptx por
1st_Seminar_PHD.pptx1st_Seminar_PHD.pptx
1st_Seminar_PHD.pptxAnanya Sinha
22 visualizações73 slides
Gmr2301 Breeding Transgenic Cattle For Human Therapeutics por
Gmr2301 Breeding Transgenic Cattle For Human Therapeutics Gmr2301 Breeding Transgenic Cattle For Human Therapeutics
Gmr2301 Breeding Transgenic Cattle For Human Therapeutics Avi Dey
214 visualizações14 slides
A systematic, data driven approach to the combined analysis of microarray and... por
A systematic, data driven approach to the combined analysis of microarray and...A systematic, data driven approach to the combined analysis of microarray and...
A systematic, data driven approach to the combined analysis of microarray and...Laurence Dawkins-Hall
63 visualizações7 slides
Genomic surveillance of Rift Valley fever virus por
Genomic surveillance of Rift Valley fever virusGenomic surveillance of Rift Valley fever virus
Genomic surveillance of Rift Valley fever virusILRI
11 visualizações1 slide

Mais conteúdo relacionado

Similar a BioSmalltalk

2013_WCBSURC.pptx por
2013_WCBSURC.pptx2013_WCBSURC.pptx
2013_WCBSURC.pptxJan Clement Santiago
121 visualizações1 slide
final AEGIS report por
final AEGIS reportfinal AEGIS report
final AEGIS reportElise Mason
68 visualizações15 slides
Lysosomal Porage Diseases Case Study por
Lysosomal Porage Diseases Case StudyLysosomal Porage Diseases Case Study
Lysosomal Porage Diseases Case StudyRachelle Lewis
2 visualizações40 slides
Functional annotation por
Functional annotationFunctional annotation
Functional annotationRavi Gandham
867 visualizações24 slides
Hepatitis C por
Hepatitis CHepatitis C
Hepatitis Cmafsana
466 visualizações15 slides
Isolation of microsatellites Channa por
Isolation of microsatellites ChannaIsolation of microsatellites Channa
Isolation of microsatellites ChannaMin Pau Tan
233 visualizações3 slides

Similar a BioSmalltalk(20)

final AEGIS report por Elise Mason
final AEGIS reportfinal AEGIS report
final AEGIS report
Elise Mason68 visualizações
Lysosomal Porage Diseases Case Study por Rachelle Lewis
Lysosomal Porage Diseases Case StudyLysosomal Porage Diseases Case Study
Lysosomal Porage Diseases Case Study
Rachelle Lewis2 visualizações
Functional annotation por Ravi Gandham
Functional annotationFunctional annotation
Functional annotation
Ravi Gandham867 visualizações
Hepatitis C por mafsana
Hepatitis CHepatitis C
Hepatitis C
mafsana466 visualizações
Isolation of microsatellites Channa por Min Pau Tan
Isolation of microsatellites ChannaIsolation of microsatellites Channa
Isolation of microsatellites Channa
Min Pau Tan233 visualizações
pMINERVA Cloning System por Michael Weiner
pMINERVA Cloning SystempMINERVA Cloning System
pMINERVA Cloning System
Michael Weiner264 visualizações
Tyler future of genomics thurs 0920 por Sucheta Tripathy
Tyler future of genomics thurs 0920Tyler future of genomics thurs 0920
Tyler future of genomics thurs 0920
Sucheta Tripathy228 visualizações
Cassava at CIAT por CIAT
Cassava at CIATCassava at CIAT
Cassava at CIAT
CIAT1.4K visualizações
Genovesio et al j biomol screen 2011-genovesio-1087057111415521 por Neil Emans, Ph.D
Genovesio et al j biomol screen 2011-genovesio-1087057111415521Genovesio et al j biomol screen 2011-genovesio-1087057111415521
Genovesio et al j biomol screen 2011-genovesio-1087057111415521
Neil Emans, Ph.D386 visualizações
Multigene engineering in plants por Senthil Natesan
Multigene engineering in plantsMultigene engineering in plants
Multigene engineering in plants
Senthil Natesan2.9K visualizações
CrossGen-Merck manuscript por Kush Sharma
CrossGen-Merck manuscriptCrossGen-Merck manuscript
CrossGen-Merck manuscript
Kush Sharma103 visualizações
Jan2016 pac bio giab por GenomeInABottle
Jan2016 pac bio giabJan2016 pac bio giab
Jan2016 pac bio giab
GenomeInABottle2.5K visualizações
CISGENICS por Dr Anjani Kumar
CISGENICSCISGENICS
CISGENICS
Dr Anjani Kumar701 visualizações
Functional annotation of invertebrate genomes por Surya Saha
Functional annotation of invertebrate genomesFunctional annotation of invertebrate genomes
Functional annotation of invertebrate genomes
Surya Saha331 visualizações
Bioinformatics.Practical Notebook por Naima Tahsin
Bioinformatics.Practical NotebookBioinformatics.Practical Notebook
Bioinformatics.Practical Notebook
Naima Tahsin4.5K visualizações
Eradicating diseases (genome) por Utkarsh Verma
Eradicating diseases (genome)Eradicating diseases (genome)
Eradicating diseases (genome)
Utkarsh Verma926 visualizações
Annotated Bibliography On The Landscape Of Transcription por Lori Bowie
Annotated Bibliography On The Landscape Of TranscriptionAnnotated Bibliography On The Landscape Of Transcription
Annotated Bibliography On The Landscape Of Transcription
Lori Bowie2 visualizações
Recurrent Incidence Of Babesiosis Among Adults Essay por Nicole Savoie
Recurrent Incidence Of Babesiosis Among Adults EssayRecurrent Incidence Of Babesiosis Among Adults Essay
Recurrent Incidence Of Babesiosis Among Adults Essay
Nicole Savoie3 visualizações

Mais de ESUG

Workshop: Identifying concept inventories in agile programming por
Workshop: Identifying concept inventories in agile programmingWorkshop: Identifying concept inventories in agile programming
Workshop: Identifying concept inventories in agile programmingESUG
9 visualizações16 slides
Technical documentation support in Pharo por
Technical documentation support in PharoTechnical documentation support in Pharo
Technical documentation support in PharoESUG
27 visualizações39 slides
The Pharo Debugger and Debugging tools: Advances and Roadmap por
The Pharo Debugger and Debugging tools: Advances and RoadmapThe Pharo Debugger and Debugging tools: Advances and Roadmap
The Pharo Debugger and Debugging tools: Advances and RoadmapESUG
56 visualizações44 slides
Sequence: Pipeline modelling in Pharo por
Sequence: Pipeline modelling in PharoSequence: Pipeline modelling in Pharo
Sequence: Pipeline modelling in PharoESUG
86 visualizações22 slides
Migration process from monolithic to micro frontend architecture in mobile ap... por
Migration process from monolithic to micro frontend architecture in mobile ap...Migration process from monolithic to micro frontend architecture in mobile ap...
Migration process from monolithic to micro frontend architecture in mobile ap...ESUG
19 visualizações35 slides
Analyzing Dart Language with Pharo: Report and early results por
Analyzing Dart Language with Pharo: Report and early resultsAnalyzing Dart Language with Pharo: Report and early results
Analyzing Dart Language with Pharo: Report and early resultsESUG
106 visualizações30 slides

Mais de ESUG(20)

Workshop: Identifying concept inventories in agile programming por ESUG
Workshop: Identifying concept inventories in agile programmingWorkshop: Identifying concept inventories in agile programming
Workshop: Identifying concept inventories in agile programming
ESUG9 visualizações
Technical documentation support in Pharo por ESUG
Technical documentation support in PharoTechnical documentation support in Pharo
Technical documentation support in Pharo
ESUG27 visualizações
The Pharo Debugger and Debugging tools: Advances and Roadmap por ESUG
The Pharo Debugger and Debugging tools: Advances and RoadmapThe Pharo Debugger and Debugging tools: Advances and Roadmap
The Pharo Debugger and Debugging tools: Advances and Roadmap
ESUG56 visualizações
Sequence: Pipeline modelling in Pharo por ESUG
Sequence: Pipeline modelling in PharoSequence: Pipeline modelling in Pharo
Sequence: Pipeline modelling in Pharo
ESUG86 visualizações
Migration process from monolithic to micro frontend architecture in mobile ap... por ESUG
Migration process from monolithic to micro frontend architecture in mobile ap...Migration process from monolithic to micro frontend architecture in mobile ap...
Migration process from monolithic to micro frontend architecture in mobile ap...
ESUG19 visualizações
Analyzing Dart Language with Pharo: Report and early results por ESUG
Analyzing Dart Language with Pharo: Report and early resultsAnalyzing Dart Language with Pharo: Report and early results
Analyzing Dart Language with Pharo: Report and early results
ESUG106 visualizações
Transpiling Pharo Classes to JS ECMAScript 5 versus ECMAScript 6 por ESUG
Transpiling Pharo Classes to JS ECMAScript 5 versus ECMAScript 6Transpiling Pharo Classes to JS ECMAScript 5 versus ECMAScript 6
Transpiling Pharo Classes to JS ECMAScript 5 versus ECMAScript 6
ESUG37 visualizações
A Unit Test Metamodel for Test Generation por ESUG
A Unit Test Metamodel for Test GenerationA Unit Test Metamodel for Test Generation
A Unit Test Metamodel for Test Generation
ESUG49 visualizações
Creating Unit Tests Using Genetic Programming por ESUG
Creating Unit Tests Using Genetic ProgrammingCreating Unit Tests Using Genetic Programming
Creating Unit Tests Using Genetic Programming
ESUG46 visualizações
Threaded-Execution and CPS Provide Smooth Switching Between Execution Modes por ESUG
Threaded-Execution and CPS Provide Smooth Switching Between Execution ModesThreaded-Execution and CPS Provide Smooth Switching Between Execution Modes
Threaded-Execution and CPS Provide Smooth Switching Between Execution Modes
ESUG52 visualizações
Exploring GitHub Actions through EGAD: An Experience Report por ESUG
Exploring GitHub Actions through EGAD: An Experience ReportExploring GitHub Actions through EGAD: An Experience Report
Exploring GitHub Actions through EGAD: An Experience Report
ESUG17 visualizações
Pharo: a reflective language A first systematic analysis of reflective APIs por ESUG
Pharo: a reflective language A first systematic analysis of reflective APIsPharo: a reflective language A first systematic analysis of reflective APIs
Pharo: a reflective language A first systematic analysis of reflective APIs
ESUG57 visualizações
Garbage Collector Tuning por ESUG
Garbage Collector TuningGarbage Collector Tuning
Garbage Collector Tuning
ESUG20 visualizações
Improving Performance Through Object Lifetime Profiling: the DataFrame Case por ESUG
Improving Performance Through Object Lifetime Profiling: the DataFrame CaseImproving Performance Through Object Lifetime Profiling: the DataFrame Case
Improving Performance Through Object Lifetime Profiling: the DataFrame Case
ESUG43 visualizações
Pharo DataFrame: Past, Present, and Future por ESUG
Pharo DataFrame: Past, Present, and FuturePharo DataFrame: Past, Present, and Future
Pharo DataFrame: Past, Present, and Future
ESUG43 visualizações
thisContext in the Debugger por ESUG
thisContext in the DebuggerthisContext in the Debugger
thisContext in the Debugger
ESUG36 visualizações
Websockets for Fencing Score por ESUG
Websockets for Fencing ScoreWebsockets for Fencing Score
Websockets for Fencing Score
ESUG18 visualizações
ShowUs: PharoJS.org Develop in Pharo, Run on JavaScript por ESUG
ShowUs: PharoJS.org Develop in Pharo, Run on JavaScriptShowUs: PharoJS.org Develop in Pharo, Run on JavaScript
ShowUs: PharoJS.org Develop in Pharo, Run on JavaScript
ESUG46 visualizações
Advanced Object- Oriented Design Mooc por ESUG
Advanced Object- Oriented Design MoocAdvanced Object- Oriented Design Mooc
Advanced Object- Oriented Design Mooc
ESUG85 visualizações
A New Architecture Reconciling Refactorings and Transformations por ESUG
A New Architecture Reconciling Refactorings and TransformationsA New Architecture Reconciling Refactorings and Transformations
A New Architecture Reconciling Refactorings and Transformations
ESUG28 visualizações

Último

BushraDBR: An Automatic Approach to Retrieving Duplicate Bug Reports por
BushraDBR: An Automatic Approach to Retrieving Duplicate Bug ReportsBushraDBR: An Automatic Approach to Retrieving Duplicate Bug Reports
BushraDBR: An Automatic Approach to Retrieving Duplicate Bug ReportsRa'Fat Al-Msie'deen
5 visualizações49 slides
LAVADORA ROLO.docx por
LAVADORA ROLO.docxLAVADORA ROLO.docx
LAVADORA ROLO.docxSamuelRamirez83524
7 visualizações1 slide
A first look at MariaDB 11.x features and ideas on how to use them por
A first look at MariaDB 11.x features and ideas on how to use themA first look at MariaDB 11.x features and ideas on how to use them
A first look at MariaDB 11.x features and ideas on how to use themFederico Razzoli
45 visualizações36 slides
DSD-INT 2023 Wave-Current Interaction at Montrose Tidal Inlet System and Its ... por
DSD-INT 2023 Wave-Current Interaction at Montrose Tidal Inlet System and Its ...DSD-INT 2023 Wave-Current Interaction at Montrose Tidal Inlet System and Its ...
DSD-INT 2023 Wave-Current Interaction at Montrose Tidal Inlet System and Its ...Deltares
9 visualizações32 slides
DSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - Afternoon por
DSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - AfternoonDSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - Afternoon
DSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - AfternoonDeltares
13 visualizações43 slides
Advanced API Mocking Techniques por
Advanced API Mocking TechniquesAdvanced API Mocking Techniques
Advanced API Mocking TechniquesDimpy Adhikary
19 visualizações11 slides

Último(20)

BushraDBR: An Automatic Approach to Retrieving Duplicate Bug Reports por Ra'Fat Al-Msie'deen
BushraDBR: An Automatic Approach to Retrieving Duplicate Bug ReportsBushraDBR: An Automatic Approach to Retrieving Duplicate Bug Reports
BushraDBR: An Automatic Approach to Retrieving Duplicate Bug Reports
Ra'Fat Al-Msie'deen5 visualizações
A first look at MariaDB 11.x features and ideas on how to use them por Federico Razzoli
A first look at MariaDB 11.x features and ideas on how to use themA first look at MariaDB 11.x features and ideas on how to use them
A first look at MariaDB 11.x features and ideas on how to use them
Federico Razzoli45 visualizações
DSD-INT 2023 Wave-Current Interaction at Montrose Tidal Inlet System and Its ... por Deltares
DSD-INT 2023 Wave-Current Interaction at Montrose Tidal Inlet System and Its ...DSD-INT 2023 Wave-Current Interaction at Montrose Tidal Inlet System and Its ...
DSD-INT 2023 Wave-Current Interaction at Montrose Tidal Inlet System and Its ...
Deltares9 visualizações
DSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - Afternoon por Deltares
DSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - AfternoonDSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - Afternoon
DSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - Afternoon
Deltares13 visualizações
Advanced API Mocking Techniques por Dimpy Adhikary
Advanced API Mocking TechniquesAdvanced API Mocking Techniques
Advanced API Mocking Techniques
Dimpy Adhikary19 visualizações
DSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - Parker por Deltares
DSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - ParkerDSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - Parker
DSD-INT 2023 SFINCS Modelling in the U.S. Pacific Northwest - Parker
Deltares9 visualizações
Cycleops - Automate deployments on top of bare metal.pptx por Thanassis Parathyras
Cycleops - Automate deployments on top of bare metal.pptxCycleops - Automate deployments on top of bare metal.pptx
Cycleops - Automate deployments on top of bare metal.pptx
Thanassis Parathyras30 visualizações
.NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra... por Marc Müller
.NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra....NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra...
.NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra...
Marc Müller38 visualizações
Winter '24 Release Chat.pdf por melbourneauuser
Winter '24 Release Chat.pdfWinter '24 Release Chat.pdf
Winter '24 Release Chat.pdf
melbourneauuser9 visualizações
El Arte de lo Possible por Neo4j
El Arte de lo PossibleEl Arte de lo Possible
El Arte de lo Possible
Neo4j38 visualizações
ict act 1.pptx por sanjaniarun08
ict act 1.pptxict act 1.pptx
ict act 1.pptx
sanjaniarun0813 visualizações
DSD-INT 2023 Dam break simulation in Derna (Libya) using HydroMT_SFINCS - Prida por Deltares
DSD-INT 2023 Dam break simulation in Derna (Libya) using HydroMT_SFINCS - PridaDSD-INT 2023 Dam break simulation in Derna (Libya) using HydroMT_SFINCS - Prida
DSD-INT 2023 Dam break simulation in Derna (Libya) using HydroMT_SFINCS - Prida
Deltares18 visualizações
HarshithAkkapelli_Presentation.pdf por harshithakkapelli
HarshithAkkapelli_Presentation.pdfHarshithAkkapelli_Presentation.pdf
HarshithAkkapelli_Presentation.pdf
harshithakkapelli11 visualizações
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t... por Deltares
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...
Deltares9 visualizações
Software evolution understanding: Automatic extraction of software identifier... por Ra'Fat Al-Msie'deen
Software evolution understanding: Automatic extraction of software identifier...Software evolution understanding: Automatic extraction of software identifier...
Software evolution understanding: Automatic extraction of software identifier...
Ra'Fat Al-Msie'deen7 visualizações
What Can Employee Monitoring Software Do?​ por wAnywhere
What Can Employee Monitoring Software Do?​What Can Employee Monitoring Software Do?​
What Can Employee Monitoring Software Do?​
wAnywhere21 visualizações
SUGCON ANZ Presentation V2.1 Final.pptx por Jack Spektor
SUGCON ANZ Presentation V2.1 Final.pptxSUGCON ANZ Presentation V2.1 Final.pptx
SUGCON ANZ Presentation V2.1 Final.pptx
Jack Spektor22 visualizações
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI... por Marc Müller
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...
Marc Müller36 visualizações
Software testing company in India.pptx por SakshiPatel82
Software testing company in India.pptxSoftware testing company in India.pptx
Software testing company in India.pptx
SakshiPatel827 visualizações

BioSmalltalk

  • 1. BioSmalltalk Hernán Morales Durand - ESUG 2023 @ Lyon, France
  • 2. BioSmalltalk is... ... a library for Bioinformatics ...implemented in Pharo ...part of the Open Bioinformatics Foundation (OBF) ...not intended to be a replacement of +30000 awesome bioinformatics tools (but it could save some time).
  • 6. GTGCATCTGACTCCTGAGGAGAAG CACGTAGACTGAGGACTCCTCTTC GUGCAUCUGACUCCUGAGGAGAAG V H L T P E E K DNA (transcription) RNA (translation) protein BioSmalltalk: Basic operations
  • 8. 'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC DNA) [ACTGGTGATA]" BioSmalltalk: Basic operations
  • 9. 'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC DNA) [ACTGGTGATA]" DNA Alphabet Sequence length BioSmalltalk: Basic operations
  • 10. 'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC DNA) [ACTGGTGATA]" DNA Alphabet Sequence length BioSmalltalk: Basic operations
  • 11. 'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC DNA) [ACTGGTGATA]" transcribe "a BioSequence(10) ([GAUC] IUPAC -> RNA -> Unambiguous) [ACUGGUGAUA]" BioSmalltalk: Basic operations RNA Alphabet
  • 12. 'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC DNA) [ACTGGTGATA]" transcribe "a BioSequence(10) ([GAUC] IUPAC -> RNA -> Unambiguous) [ACUGGUGAUA]" backTranscribe BioSmalltalk: Basic operations
  • 13. 'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC DNA) [ACTGGTGATA]" complement "a BioSequence(10) ([GATCN] IUPAC DNA) [TGACCACTAT]" BioSmalltalk: Basic operations
  • 14. 'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC DNA) [ACTGGTGATA]" reverseComplement "a BioSequence(10) ([GATCN] IUPAC DNA) [TATCACCAGT]" BioSmalltalk: Basic operations
  • 15. 'ACTGGTGATA' asSequence "a BioSequence(10) ([GATCN] IUPAC DNA) [ACTGGTGATA]" translate "a BioSequence(3) ([ACDEFGHIKLMNPQRSTVWY] IUPAC -> Protein) [TGD]" BioSmalltalk: Basic operations
  • 19. 'ACTGGTGATA' asSequence gcContent BioSmalltalk: Sequence statistics 'ACTGGTGATA' asSequence molecularWeightNonDegen "3146.0499999999997"
  • 20. 'ACTGGTGATA' asSequence gcContent BioSmalltalk: Sequence statistics 'ACTGGTGATA' asSequence molecularWeightNonDegen 'ACTGGTGATA' asSequence lcc "an OrderedCollection(-1.4948676426993133 -0.16609640474436815 -1.4948676426993133 -1.4948676426993133)"
  • 21. 'ACTGGTGATA' asSequence gcContent BioSmalltalk: Sequence statistics 'ACTGGTGATA' asSequence molecularWeightNonDegen 'ACTGGTGATA' asSequence lcc 'ACTGGTGATA' asSequence occurrencesOfLetters "a Dictionary($A->3 $C->1 $G->3 $T->3 )"
  • 22. (BioSequence newAmbiguousDNA:'AHT') disambiguate "#('AAT' 'ATT' 'ACT')" BioSmalltalk: Sequence utilities
  • 23. (BioSequence newAmbiguousDNA:'AHT') disambiguate 'ACGTACGTACGT' asSequence kmersCount:'CG' 'ACGTACGTACGT' asSequence longestConsecutive: $A BioSmalltalk: Sequence utilities 'ACTGGTGATA' asSequence crc32. 'ACTGGTGATA' asSequence gcg. 'ACTGGTGATA' asSequence seguid.
  • 24. (BioParser parseMultiFastaFile: 'ls_orchid.fasta') plot 550 600 650 700 750 800 0.0 10.0 20.0 30.0 40.0 50.0 Histogram of 94 FASTA sequences Sequence Count Sequence lengths (bp) BioSmalltalk: Sequence utilities
  • 27. BioAlignment new addSequence: 'ACTGCTAGCTAG'; addSequence: 'ACT-CTAGCTAG'; addSequence: 'ACTGGTANATGG'; addSequence: 'ACTGATTGCTGG'; addSequence: 'ACTGCTTGATTG'; yourself BioSmalltalk: Sequence alignment
  • 28. BioSmalltalk: Sequence alignment latestBlast := BioBlastWrapper ncbi local latest. latestBlast nucleotide query: '555'; hitListSize: 10; filterLowComplexity; expectValue: 10; blastn; blastPlainService. Program Query Type DB Type Comparison blastn Nucleotide Nucleotide Nucleotide- Nucleotide blastp Protein Protein Protein- Protein tblastn Protein Nucleotide Protein- Protein blastx Nucleotide Protein Protein- Protein
  • 29. BioSmalltalk: Sequence alignment aligner := BioMAFFTWrapper new. aligner input: 'COVID-19-01.fasta'; addOutputParameter: 'output.aln'; execute
  • 30. BioSmalltalk: Sequence alignment aligner := ALNeedlemanWunsch new. aligner align: 'AC-AATAGAC' with: 'ACGAATAGAT'. https://github.com/hernanmd/needleman-wunsch Implementation of Needleman-Wunsh algorithm native to Pharo
  • 31. BioSmalltalk: Plotting alignment pipeline outputFilename := 'COVID-19-MAFFT-2023-08-24_21-37-49.aln'. sarsCoV2SequencesUIDs := 'seqIDs.txt' asFileReference lines. multiFasta := BioParser parseMultiFasta: ( BioEntrezClient new nuccore uids: sarsCoV2SequencesUIDs; setFasta; setModeText; fetch) result. multiFastaCompleteGenomes := multiFasta select: [ : f | f name endsWith: 'complete genome' ]. BioMAFFTWrapper new auto; maxiterate: 1000; input: multiFastaCompleteGenomes; addOutputParameter: outputFilename; execute. (BioParser parseMultiFastaAlignmentFile: outputFilename asFileReference) plot.
  • 32. BioSmalltalk: Miscellany • Sequences: Consensus, Repeats, Codon Tables, IUPAC Alphabets, Features, Records. • Genome downloads • Wrappers: PLINK, Cutadapt, MUSCLE, BLAST, CLUSTAL, STRUCTURE, ShapeIt, HH-Suite, ACANA, AGA, samtools, etc. • Formatters: FASTA, GenBank, PED, BED, MEGA, Arlequin, etc • Parsers: GenBank & Entrez XML, ID's, FASTA. • Databases: NCBI Entrez, REBASE.
  • 34. BioSmalltalk: Databases BioEntrezClient organization listAtCategoryNamed: 'accessing public - databases' ]. "#(#gds #geo #genome #pmc #genomeprj #nlmcatalog #unigene #homologene #nucest #peptidome #journals #domains #structure #omia #omim #pubmed #biosystems #popset #cancerchromosomes #gensat #snp #books #ncbisearch #gene #pcsubstance #nuccore #protein #cdd #sra #nucgss #proteinclusters #biosample #taxonomy #unists #probe #mesh #pcassay #gap #pccompound)"
  • 36. Evidence of positive selection towards Zebuine haplotypes in the BoLA region of Brangus cattle D. E. Goszczynski1†a C. M. Corbi-Botto1a , H. M. Durand1 , A. Rogberg-Muñoz1,2,3 , S. Munilla2,3 , P. Peral-Garcia1 , R. J. C. Cantet2,3 and G. Giovambattista1 1 Facultad de Ciencias Veterinarias, Instituto de Genética Veterinaria (IGEVET) (UNLP-CONICET LA PLATA), La Plata, Buenos Aires, Argentina; 2 Departamento de Producción, Facultad de Agronomía, Universidad de Buenos Aires, Buenos Aires, Argentina; 3 Instituto de Investigaciones en Producción Animal (INPA) (UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina. (Received 5 January 2017; Accepted 25 April 2017; First published online 14 July 2017) The Brangus breed was developed to combine the superior characteristics of both of its founder breeds, Angus and Brahman. It combines the high adaptability to tropical and subtropical environments, disease resistance, and overall hardiness of Zebu cattle with the reproductive potential and carcass quality of Angus. It is known that the major histocompatibility complex (MHC, also known as bovine leucocyte antigen: BoLA), located on chromosome 23, encodes several genes involved in the adaptive immune response and may be responsible for adaptation to harsh environments. The objective of this work was to evaluate whether the local breed ancestry percentages in the BoLA locus of a Brangus population diverged from the estimated genome-wide proportions and to identify signatures of positive selection in this genomic region. For this, 167 animals (100 Brangus, 45 Angus and 22 Brahman) were genotyped using a high-density single nucleotide polymorphism array. The local ancestry analysis showed that more than half of the haplotypes (55.0%) shared a Brahman origin. This value was significantly different from the global genome-wide proportion estimated by cluster analysis (34.7% Brahman), and the proportion expected by pedigree (37.5% Brahman). The analysis of selection signatures by genetic differentiation ( Fst) and extended haplotype homozygosity-based methods (iHS and Rsb) revealed 10 and seven candidate regions, respectively. The analysis of the genes located within these candidate regions showed mainly genes involved in immune response-related pathway, while other genes and pathways were also observed (cell surface signalling pathways, membrane proteins and ion-binding proteins). Our results suggest that the BoLA region of Brangus cattle may have been enriched with Brahman haplotypes as a consequence of selection processes to promote adaptation to subtropical environments. Keywords: Brangus, major histocompatibility complex, selection signatures, BoLA, ancestral haplotypes Animal (2018), 12:2, pp 215–223 © The Animal Consortium 2017 doi:10.1017/S1751731117001380 animal
  • 37. Case Report DNA profile of dog feces as evidence to solve a homicide L.S. Barrientos a,1,2 , J.A. Crespi a,1,2 , A. Fameli b , D.M. Posik a,2 , H. Morales a,2 , P. Peral García a,2 , G. Giovambattista a,⇑ a IGEVET – Instituto de Genética Veterinaria (UNLP-CONICET LA PLATA), Facultad de Ciencias Veterinarias, UNLP, La Plata, Buenos Aires, Argentina b GECOBI – Grupo de Genética y Ecología en Conservación y Biodiversidad, Museo Argentino de Ciencias Naturales ‘‘Bernardino Rivadavia”, Av. Angel Gallardo 470, C1405DJR Buenos Aires, Argentina a r t i c l e i n f o Article history: Received 31 March 2016 Received in revised form 20 June 2016 Accepted 10 August 2016 Available online 10 August 2016 Keywords: Forensic sciences Non-human DNA Dog Mitochondrial DNA Feces a b s t r a c t Dog fecal samples were collected at the crime scene and from the shoes of the suspect to see whether they could be linked. DNA was genotyped using a 145 bp fragment containing a 60 bp hotspot region of the mitochondrial DNA (mtDNA) control region. Once the species origin was identified, sequences were aligned with the 23 canine haplotypes defined, showing that evidence and reference had 100% identity with haplotype 5. The frequency of haplotype 5 and the exclusion power of the reference population were 0.056 and 0.89, respectively. The forensic index showed that it was 20 times more likely that the evidence belonged to the reference dog than to some other unknown animal. The results support that the mtDNA hypervariable region 1 (HV1) is a good alternative for typing in trace or degraded casework samples when the STR panel fails, and demonstrate the utility of domestic animal samples to give additional information to solve human legal cases. ! 2016 Published by Elsevier Ireland Ltd. 1. Introduction Non-human DNA analysis in forensic science has seen growth in recent years. Applications range from investigations of crimes of humans to cruelty and poaching in animal/wildlife species, where DNA evidence from animals, plants, bacteria and viruses has been used in criminal investigations [1]. Animal Forensic Genetics is defined as ‘‘The application of rele- vant genetic techniques and theory to legal matters, for enforce- ment issues, concerning animal biological material” [2]. Domestic close relationship with people, determination of the genetic profile of pets would provide a valuable forensic tool. Canine biological materials including hair, feces and saliva can be found when contact between dogs and humans takes place. Most of the described collection, sampling, and extraction are used in medical diagnostic applications [8,9], wildlife population [10,11] and wildlife illegal traffic studies [12]. Fecal DNA is often degraded due to environmental factors and continued active deterioration by the large numbers of bacteria present with the feces. Also, feces contain many known PCR inhibitors such as bile salts [13]. As fecal Legal Medicine 22 (2016) 54–57 Contents lists available at ScienceDirect Legal Medicine journal homepage: www.elsevier.com/locate/legalmed
  • 38. RESEARCH ARTICLE Runs of homozygosity in a selected cattle population with extremely inbred bulls: Descriptive and functional analyses revealed highly variable patterns Daniel Goszczynski1 , Antonio Molina2 , Ester Terán1 , Hernán Morales-Durand1 , Pablo Ross3 , Hao Cheng3 , Guillermo Giovambattista1,2,3,4 , Sebastián Demyda-Peyrás1,2,3,4 * 1 IGEVET–Instituto de Genética Veterinaria "Ing. Fernando N. Dulout” (UNLP-CONICET LA PLATA), Facultad de Ciencias Veterinarias UNLP, La Plata, Argentina, 2 Departamento de Genética, Universidad de Córdoba, Córdoba, España, 3 Department of Animal Science, University of California, Davis, Davis, California, United States of America, 4 Departamento de Producción Animal, Facultad de Ciencias Veterinarias, Universidad Nacional de La Plata, La Plata, Buenos Aires, Argentina * sdemyda@igevet.gob.ar Abstract The analysis of runs of homozygosity (ROH), using high throughput genomic data, has become a valuable and frequently used methodology to characterize the genomic and inbreeding variation of livestock and wildlife animal populations. However, this methodology has been scarcely used in highly inbred domestic animals. Here, we analyzed and charac- terized the occurrence of ROH fragments in highly inbred (HI; average pedigree-based inbreeding coefficient FPED = 0.164; 0.103 to 0.306) and outbred Retinta bulls (LI; average FPED = 0.008; 0 to 0.025). We studied the length of the fragments, their abundance, and genome distribution using high-density microarray data. The number of ROH was signifi- cantly higher in the HI group, especially for long fragments (>8Mb). In the LI group, the num- ber of ROH continuously decreased with fragment length. Genome-wide distribution of ROH was highly variable between samples. Some chromosomes presented a larger num- ber of fragments (BTA1, BTA19, BTA29), others had longer fragments (BTA4, BTA12, a1111111111 a1111111111 a1111111111 a1111111111 a1111111111 OPEN ACCESS Citation: Goszczynski D, Molina A, Terán E, Morales-Durand H, Ross P, Cheng H, et al. (2018) Runs of homozygosity in a selected cattle population with extremely inbred bulls: Descriptive and functional analyses revealed highly variable patterns. PLoS ONE 13(7): e0200069. https://doi. org/10.1371/journal.pone.0200069 Editor: Arda Yildirim, Gaziosmanpasa University, TURKEY Received: September 18, 2017 Accepted: June 19, 2018 Published: July 9, 2018
  • 39. Vol. 29 no. 18 2013, pages 2355–2356 BIOINFORMATICS APPLICATIONS NOTE doi:10.1093/bioinformatics/btt398 Sequence analysis Advance Access publication July 9, 2013 BioSmalltalk: a pure object system and library for bioinformatics Hernán F. Morales* and Guillermo Giovambattista Instituto de Genética Veterinaria (IGEVET), CONICET La Plata–Facultad de Ciencias Veterinarias, Universidad Nacional de La Plata, La Plata B1900AVW, CC 296 Argentina Associate Editor: Janet Kelso ABSTRACT Summary: We have developed BioSmalltalk, a new environment system for pure object-oriented bioinformatics programming. Adaptive end-user programming systems tend to become more im- portant for discovering biological knowledge, as is demonstrated by the emergence of open-source programming toolkits for bioinfor- matics in the past years. Our software is intended to bridge the gap between bioscientists and rapid software prototyping while preserving the possibility of scaling to whole-system biology applications. BioSmalltalk performs better in terms of execution time and memory usage than Biopython and BioPerl for some classical situations. Availability: BioSmalltalk is cross-platform and freely available (MIT license) through the Google Project Hosting at http://code.google. com/p/biosmalltalk Contact: hernan.morales@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. Received on January 12, 2013; revised on June 5, 2013; accepted on July 3, 2013 1 INTRODUCTION We present a novel free/open source software (FOSS) platform for the development of bioinformatics software and applications. BioSmalltalk attempts to reconcile the current de facto scripting modalities of textual programming languages with the features of Smalltalk (Goldberg and Robson, 1983), which has a pure object dynamic programming environment. BioSmalltalk provides similar functionality to other FOSS toolkits for bioinformatics, such as BioPerl (Stajich et al., 2002), Biopython (Cock et al., 2009) and BioJava (Holland et al., 2008), based in industry-leading general-purpose textual programming languages, and toolkits, including the Bio* pro- jects. The Bio* toolkits’ usage of OO is commonly hybrid or emulated through modules (Cock et al., 2009; Stajich et al., 2002), mixing objects with primitive data types and hampering the use of reflective functionalities (Maes, 1977). BioSmalltalk benefits from decreased source code verbosity, and its execution in a self-contained snapshot system that promotes run-time adaptability, critical for systems where shutdown cycles cannot be tolerated (Hirschfeld and Lämmel, 2005). 2 FEATURES 2.1 Bioinformatics BioSmalltalk provides objects to manipulate biological se- quences and data from databases like the Entrez system (Schuler et al., 1996). It also contains wrappers for command- line tools like ClustalW (Thompson, 1994) and HMMER (Finn, 2011) sequence visualization and format conversion. We based implementation on existing FOSS bioinformatics platforms, specifically BioPerl and Biopython, to prevent educa- tional obsolescence, preserving the familiar object model inter- faces for experienced bioinformaticians. BioSmalltalk contains tokenizers, parsers and formatters for common sequence identifiers, FASTA, BLAST and Entrez XML, PHYLIP (Felsenstein, 1989), Arlequin (Excoffier, 2005) and others. Most parsers use PetitParser (Renggli et al., 2010), a dynamically reconfigurable parser library. Additional features can be found in the project documentation. We did a microbe- nchmark to compare the performance of our library using the script in Figure 1. We have executed the scripts five times immediately after booting without unnecessary processes (Tests were performed on GNU/Linux Debian kernel 2.6.32- Downloaded from https://academic.oup.com/bioinformatics/article/29/18/2355/240534 by INRIA user on