SlideShare a Scribd company logo
1 of 26
Download to read offline
Ab Initio Protein Structure
Prediction
ag1805xag1805x
Protein structure prediction
● Protein structure prediction (PSP) is the
prediction of the three-dimensional structure
of a protein from its amino acid sequence
i.e. the prediction of its tertiary structure
from its primary structure.
Protein Structure Prediction: Methods
Similar Protein
Structure
Available
Not
Available
Template Based
Method
ab initio modelling
Threading
ab initio modelling
● ab initio modelling conducts a
conformational search under the guidance of
a designed energy function.
● This procedure usually generates a number of
possible conformations (structure decoys),
and final models are selected from them.
● A successful ab initio modelling depends on
three factors:
➢ an accurate energy function with which the
native structure of a protein corresponds to the
most thermodynamically stable state, compared
to all possible decoy structures;
➢ an efficient search method which can quickly
identify the low-energy states through
conformational search;
➢ selection of native-like models from a pool
of decoy structures.
Energy Functions
● Energy classified into two groups:
➔ Physics-based energy functions
➔ Knowledge-based energy functions
Physics-Based Energy Functions
“In a strictly-defined physics-based ab initio method,
interactions between atoms should be based on
quantum mechanics and the coulomb potential with
only a few fundamental parameters such as the
electron charge and the Planck constant; all atoms
should be described by their atom types where only
the number of electrons is relevant.”
(Hagler et al. 1974; Weiner et al. 1984)
Physics-Based Energy Functions
“In a strictly-defined physics-based ab initio method,
interactions between atoms should be based on
quantum mechanics and the coulomb potential with
only a few fundamental parameters such as the
electron charge and the Planck constant; all atoms
should be described by their atom types where only
the number of electrons is relevant.”
(Hagler et al. 1974; Weiner et al. 1984)
A compromised force field with a large
number of selected atom types is used. In
each atom type, the chemical and physical
properties of the atoms are enough alike
with the parameters calculated from crystal
packing or quantum mechanical theory.
● Well-known examples of such all-atom physics-
based force fields include:
✔ AMBER
✔ CHARMM
✔ OPLS
✔ GROMOS96
● These potentials contain terms associated with
bond lengths, angles, torsion angles, van der
Waals, and electrostatics interactions.
● The major difference between them lies in the
selection of atom types and the interaction
parameters.
Knowledge-Based Energy Function
● Refers to the empirical energy terms derived from the
statistics of the solved structures in deposited PDB.
● Can be divided into two types:
➢ generic and sequence-independent terms such as the
hydrogen bonding and the local backbone stiffness of a
polypeptide chain
➢ amino-acid or protein-sequence dependent terms, e.g. pair
wise residue contact potential, distance dependent atomic
contact potential , and secondary structure propensities
Conformational Search Methods
● Successful ab initio modelling of protein structures
depends on the availability of a powerful conformation
search method which can efficiently find the global
minimum energy structure for a given energy function
with complicated energy landscape.
● Types:
➔ Monte Carlo Simulations
➔ Molecular Dynamics
➔ Genetic Algorithm
➔ Mathematical Optimization
Monte Carlo Simulations
● Its core idea is to use random samples of
parameters or inputs to explore the behavior
of a complex system or process.
Initial configuration of particles
in a system
Monte Carlo move is attempted
that changes the configuration of
the particles
Move is accepted or rejected
based on an acceptance
criterion
Calculates the value of a
property of interest
An accurate average value of this
property can be obtained
StepsinMCsimulation
Molecular Dynamics
● MD simulation solves Newton’s equations of motion at each step of
atom movement, which is probably the most faithful method
depicting atomistically what is occurring in proteins.
● The method is therefore most-often used for the study of protein
folding pathways
● The long simulation time is one of the major issues of this method,
since the incremental time scale is usually in the order of
femtoseconds (10 15 s) while the fastest folding time of a small−
protein (less than 100 residues) is in the millisecond range in nature.
Genetic Algorithm
● The genetic algorithm is a method for solving problems
that is based on natural selection, the process that drives
biological evolution.
● The genetic algorithm repeatedly modifies a population of
individual solutions.
● At each step, the genetic algorithm selects individuals at
random from the current population to be parents and uses
them to produce the children for the next generation.
● Over successive generations, the population "evolves"
toward an optimal solution.
Mathematical Optimization
● Mathematical optimization is the selection of a best
element (with regard to some criteria) from some
set of available alternatives.
Model Selection
● The selection of protein models has been
emerged as a new field called Model Quality
Assessment Programs (MQAP)
● Modelling selection approaches can be
classified into two types:
 energy based
 free-energy based
Physics-Based Energy Function
● Selects the decoy with the lowest energy.
Knowledge-Based Energy Function
● Sippl developed a pair wise residue-distance based
potential (Sippl 1990) using the statistics of known PDB
structures in 1990 (its newest version is PROSA II (Sippl
1993; Wiederstein and Sippl 2007) ).
● A variety of knowledge-based potentials have been
proposed, which include atomic interaction potential,
solvation potential, hydrogen bond potential, torsion angle
potential, etc.
Sequence-Structure Compatibility Function
● Best models are selected not purely based on energy functions.
● They are selected based on the compatibility of target sequences
to model structures.
● The earliest and still successful example is that by Luthy et al.
(1992), who used threading scores to evaluate structures.
● Colovos and Yeates (1993) later used a quadratic error function
to describe the non-covalently bonded interactions among CC,
CN, CO, NN, NO and OO, where near-native structures have
fewer errors than other decoys
Clustering of Decoy Structures
● Cluster analysis or clustering is the task of grouping a set of objects in such a
way that objects in the same group (called a cluster) are more similar (in
some sense or another) to each other than to those in other groups (clusters).
● The cluster-centre conformation of the largest cluster is considered closer to
native structures than the majority of decoys.
● In the work by Shortle et al. (1998), for all 12 cases tested, the cluster-centre
conformation of the largest cluster was closer to native structures than the
majority of decoys. Cluster-centre structures were ranked as the top 1–5%
closest to their native structures.
Algorithms&Serversofabinitiomodelling
Fig.: Flowchart of the
ROSETTA protocol
Fig.:Flowchart of I-TASSER protein structure modelling
Thank You
ag1805xag1805x

More Related Content

What's hot

What's hot (20)

Structural databases
Structural databases Structural databases
Structural databases
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
Rasmol
RasmolRasmol
Rasmol
 
Dynamic programming
Dynamic programming Dynamic programming
Dynamic programming
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Protein protein interaction
Protein protein interactionProtein protein interaction
Protein protein interaction
 
Homology modelling
Homology modellingHomology modelling
Homology modelling
 
Fasta
FastaFasta
Fasta
 
Threading modeling methods
Threading modeling methodsThreading modeling methods
Threading modeling methods
 
Cath
CathCath
Cath
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
Motif & Domain
Motif & DomainMotif & Domain
Motif & Domain
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
Prosite
PrositeProsite
Prosite
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
 
Fasta
FastaFasta
Fasta
 

Viewers also liked (10)

Genome Assembly
Genome AssemblyGenome Assembly
Genome Assembly
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Ddbj
DdbjDdbj
Ddbj
 
Cytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScapeCytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScape
 
Protein database
Protein databaseProtein database
Protein database
 
Kegg database resources
Kegg database resources Kegg database resources
Kegg database resources
 
Biological databases
Biological databasesBiological databases
Biological databases
 
protein data bank
protein data bankprotein data bank
protein data bank
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Protein structure: details
Protein structure: detailsProtein structure: details
Protein structure: details
 

Similar to Ab Initio Protein Structure Prediction

13C Chemical shifts of SUMO protein in the
13C Chemical shifts of SUMO protein in the13C Chemical shifts of SUMO protein in the
13C Chemical shifts of SUMO protein in the
Abhilash Kannan
 

Similar to Ab Initio Protein Structure Prediction (20)

Computational chemistry
Computational chemistryComputational chemistry
Computational chemistry
 
Molecular modelling (1)
Molecular modelling (1)Molecular modelling (1)
Molecular modelling (1)
 
Structure based drug design- kiranmayi
Structure based drug design- kiranmayiStructure based drug design- kiranmayi
Structure based drug design- kiranmayi
 
Monte Carlo Simulations & Membrane Simulation and Dynamics
Monte Carlo Simulations & Membrane Simulation and DynamicsMonte Carlo Simulations & Membrane Simulation and Dynamics
Monte Carlo Simulations & Membrane Simulation and Dynamics
 
docking
docking docking
docking
 
Computational methodologies
Computational methodologiesComputational methodologies
Computational methodologies
 
Molecular modelling
Molecular modelling Molecular modelling
Molecular modelling
 
A systematic approach for the generation and verification of structural hypot...
A systematic approach for the generation and verification of structural hypot...A systematic approach for the generation and verification of structural hypot...
A systematic approach for the generation and verification of structural hypot...
 
STUDY OF NANO-SYSTEMS FOR COMPUTER SIMULATIONS
STUDY OF NANO-SYSTEMS FOR COMPUTER SIMULATIONSSTUDY OF NANO-SYSTEMS FOR COMPUTER SIMULATIONS
STUDY OF NANO-SYSTEMS FOR COMPUTER SIMULATIONS
 
Molecular modelling
Molecular modellingMolecular modelling
Molecular modelling
 
HOMOLOGY MODELLING.pptx
HOMOLOGY MODELLING.pptxHOMOLOGY MODELLING.pptx
HOMOLOGY MODELLING.pptx
 
Molecular mechanics and dynamics
Molecular mechanics and dynamicsMolecular mechanics and dynamics
Molecular mechanics and dynamics
 
protein Modeling Abi.pptx
protein Modeling Abi.pptxprotein Modeling Abi.pptx
protein Modeling Abi.pptx
 
Conformational_Analysis.pptx
Conformational_Analysis.pptxConformational_Analysis.pptx
Conformational_Analysis.pptx
 
CADD
CADDCADD
CADD
 
13C Chemical shifts of SUMO protein in the
13C Chemical shifts of SUMO protein in the13C Chemical shifts of SUMO protein in the
13C Chemical shifts of SUMO protein in the
 
molecular docking screnning. pptx
molecular docking screnning. pptxmolecular docking screnning. pptx
molecular docking screnning. pptx
 
Molecular maodeling and drug design
Molecular maodeling and drug designMolecular maodeling and drug design
Molecular maodeling and drug design
 
Molecular dynamics Simulation.pptx
Molecular dynamics Simulation.pptxMolecular dynamics Simulation.pptx
Molecular dynamics Simulation.pptx
 
Quantum Mechanics in Molecular modeling
Quantum Mechanics in Molecular modelingQuantum Mechanics in Molecular modeling
Quantum Mechanics in Molecular modeling
 

More from Arindam Ghosh

More from Arindam Ghosh (18)

Network embedding in biomedical data science
Network embedding in biomedical data scienceNetwork embedding in biomedical data science
Network embedding in biomedical data science
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Pharmacogenomics & its ethical issues
Pharmacogenomics & its ethical  issuesPharmacogenomics & its ethical  issues
Pharmacogenomics & its ethical issues
 
Limb development in vertebrates
Limb development in vertebratesLimb development in vertebrates
Limb development in vertebrates
 
Canning fish
Canning fishCanning fish
Canning fish
 
Polymerase Chain Reaction (PCR)
Polymerase Chain Reaction (PCR)Polymerase Chain Reaction (PCR)
Polymerase Chain Reaction (PCR)
 
Carbon Nanotubes
Carbon NanotubesCarbon Nanotubes
Carbon Nanotubes
 
Java - Interfaces & Packages
Java - Interfaces & PackagesJava - Interfaces & Packages
Java - Interfaces & Packages
 
Freshers day anchoring script
Freshers day anchoring scriptFreshers day anchoring script
Freshers day anchoring script
 
Artificial Vectors
Artificial VectorsArtificial Vectors
Artificial Vectors
 
Pseudo code
Pseudo codePseudo code
Pseudo code
 
Hamiltonian path
Hamiltonian pathHamiltonian path
Hamiltonian path
 
Cedrus of Himachal Pradesh
Cedrus of Himachal PradeshCedrus of Himachal Pradesh
Cedrus of Himachal Pradesh
 
MySQL and bioinformatics
MySQL and bioinformatics MySQL and bioinformatics
MySQL and bioinformatics
 
Protein sorting in mitochondria
Protein sorting in mitochondriaProtein sorting in mitochondria
Protein sorting in mitochondria
 
Survey of softwares for phylogenetic analysis
Survey of softwares for phylogenetic analysisSurvey of softwares for phylogenetic analysis
Survey of softwares for phylogenetic analysis
 
Publicly available tools and open resources in Bioinformatics
Publicly available  tools and open resources in BioinformaticsPublicly available  tools and open resources in Bioinformatics
Publicly available tools and open resources in Bioinformatics
 

Recently uploaded

Recently uploaded (20)

Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 

Ab Initio Protein Structure Prediction

  • 1. Ab Initio Protein Structure Prediction ag1805xag1805x
  • 2. Protein structure prediction ● Protein structure prediction (PSP) is the prediction of the three-dimensional structure of a protein from its amino acid sequence i.e. the prediction of its tertiary structure from its primary structure.
  • 3. Protein Structure Prediction: Methods Similar Protein Structure Available Not Available Template Based Method ab initio modelling Threading
  • 4. ab initio modelling ● ab initio modelling conducts a conformational search under the guidance of a designed energy function. ● This procedure usually generates a number of possible conformations (structure decoys), and final models are selected from them.
  • 5. ● A successful ab initio modelling depends on three factors: ➢ an accurate energy function with which the native structure of a protein corresponds to the most thermodynamically stable state, compared to all possible decoy structures; ➢ an efficient search method which can quickly identify the low-energy states through conformational search; ➢ selection of native-like models from a pool of decoy structures.
  • 6. Energy Functions ● Energy classified into two groups: ➔ Physics-based energy functions ➔ Knowledge-based energy functions
  • 7. Physics-Based Energy Functions “In a strictly-defined physics-based ab initio method, interactions between atoms should be based on quantum mechanics and the coulomb potential with only a few fundamental parameters such as the electron charge and the Planck constant; all atoms should be described by their atom types where only the number of electrons is relevant.” (Hagler et al. 1974; Weiner et al. 1984)
  • 8. Physics-Based Energy Functions “In a strictly-defined physics-based ab initio method, interactions between atoms should be based on quantum mechanics and the coulomb potential with only a few fundamental parameters such as the electron charge and the Planck constant; all atoms should be described by their atom types where only the number of electrons is relevant.” (Hagler et al. 1974; Weiner et al. 1984)
  • 9. A compromised force field with a large number of selected atom types is used. In each atom type, the chemical and physical properties of the atoms are enough alike with the parameters calculated from crystal packing or quantum mechanical theory.
  • 10. ● Well-known examples of such all-atom physics- based force fields include: ✔ AMBER ✔ CHARMM ✔ OPLS ✔ GROMOS96 ● These potentials contain terms associated with bond lengths, angles, torsion angles, van der Waals, and electrostatics interactions. ● The major difference between them lies in the selection of atom types and the interaction parameters.
  • 11. Knowledge-Based Energy Function ● Refers to the empirical energy terms derived from the statistics of the solved structures in deposited PDB. ● Can be divided into two types: ➢ generic and sequence-independent terms such as the hydrogen bonding and the local backbone stiffness of a polypeptide chain ➢ amino-acid or protein-sequence dependent terms, e.g. pair wise residue contact potential, distance dependent atomic contact potential , and secondary structure propensities
  • 12. Conformational Search Methods ● Successful ab initio modelling of protein structures depends on the availability of a powerful conformation search method which can efficiently find the global minimum energy structure for a given energy function with complicated energy landscape. ● Types: ➔ Monte Carlo Simulations ➔ Molecular Dynamics ➔ Genetic Algorithm ➔ Mathematical Optimization
  • 13. Monte Carlo Simulations ● Its core idea is to use random samples of parameters or inputs to explore the behavior of a complex system or process.
  • 14. Initial configuration of particles in a system Monte Carlo move is attempted that changes the configuration of the particles Move is accepted or rejected based on an acceptance criterion Calculates the value of a property of interest An accurate average value of this property can be obtained StepsinMCsimulation
  • 15. Molecular Dynamics ● MD simulation solves Newton’s equations of motion at each step of atom movement, which is probably the most faithful method depicting atomistically what is occurring in proteins. ● The method is therefore most-often used for the study of protein folding pathways ● The long simulation time is one of the major issues of this method, since the incremental time scale is usually in the order of femtoseconds (10 15 s) while the fastest folding time of a small− protein (less than 100 residues) is in the millisecond range in nature.
  • 16. Genetic Algorithm ● The genetic algorithm is a method for solving problems that is based on natural selection, the process that drives biological evolution. ● The genetic algorithm repeatedly modifies a population of individual solutions. ● At each step, the genetic algorithm selects individuals at random from the current population to be parents and uses them to produce the children for the next generation. ● Over successive generations, the population "evolves" toward an optimal solution.
  • 17. Mathematical Optimization ● Mathematical optimization is the selection of a best element (with regard to some criteria) from some set of available alternatives.
  • 18. Model Selection ● The selection of protein models has been emerged as a new field called Model Quality Assessment Programs (MQAP) ● Modelling selection approaches can be classified into two types:  energy based  free-energy based
  • 19. Physics-Based Energy Function ● Selects the decoy with the lowest energy.
  • 20. Knowledge-Based Energy Function ● Sippl developed a pair wise residue-distance based potential (Sippl 1990) using the statistics of known PDB structures in 1990 (its newest version is PROSA II (Sippl 1993; Wiederstein and Sippl 2007) ). ● A variety of knowledge-based potentials have been proposed, which include atomic interaction potential, solvation potential, hydrogen bond potential, torsion angle potential, etc.
  • 21. Sequence-Structure Compatibility Function ● Best models are selected not purely based on energy functions. ● They are selected based on the compatibility of target sequences to model structures. ● The earliest and still successful example is that by Luthy et al. (1992), who used threading scores to evaluate structures. ● Colovos and Yeates (1993) later used a quadratic error function to describe the non-covalently bonded interactions among CC, CN, CO, NN, NO and OO, where near-native structures have fewer errors than other decoys
  • 22. Clustering of Decoy Structures ● Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense or another) to each other than to those in other groups (clusters). ● The cluster-centre conformation of the largest cluster is considered closer to native structures than the majority of decoys. ● In the work by Shortle et al. (1998), for all 12 cases tested, the cluster-centre conformation of the largest cluster was closer to native structures than the majority of decoys. Cluster-centre structures were ranked as the top 1–5% closest to their native structures.
  • 24. Fig.: Flowchart of the ROSETTA protocol
  • 25. Fig.:Flowchart of I-TASSER protein structure modelling