IMP Reference Guide
develop.63b38c487d,2024/12/26
The Integrative Modeling Platform
|
This script performs Docking, Monte Carlo optimization, gathering of solutions from the Monte Carlo runs, DOMINO sampling, and finally writing the resulting models.
This uses HEXDOCK in text mode to perform a docking of a subunit (the ligand) into another subunit (the receptor). The script can be used as a standalone program to perform a docking or write the solutions. The script can be modified to use any docking program, by making a couple of changes:
HexDocking
is used only from emagefit and uses the dock()
method. This class can be replaced with another class that provides a dock()
method that saves the results to a file.read_hex_transforms()
and filter_docking_results()
. The only function that needs to be adapted for a different docking program is parse_hex_transform()
, which both of them use.This can be run as a standalone program. The help of the script gives the parameters required, and a typical command is:
To write the elements of the first cluster:
This script returns the em2d score. It is useful for comparing models obtained by other sampling algorithms apart from the one described in the EMageFit paper. To score a model the parameters required are:
An example:
This utility can be used to convert one or more EM images from Spider format to JPG.
The IMP.em2d Python package contains a number of utility modules:
IMP.em2d.buildxlinks - Contains all the code for generating the order of the dockings. It also contains the class InitialDockingFromXlinks
, which is used to move the position of the subunits acting as ligand close to the receptor.
IMP.em2d.DominoModel - Contains the DominoModel
class, which has an IMP.Model as the main member. The class manages the details of setting the model restraints, performing the Monte Carlo runs, configuring the DOMINO sampler, and storing the results in a database.
IMP.em2d.MonteCarloRelativeMoves - Contains the class MonteCarloRelativeMoves
for setting and configuring a simulated annealing Monte Carlo optimizer. It also manages the profiles of temperature and iterations for the sampling. The optimizer uses one IMP.em2d.RelativePositionMover object per docking to propose relative moves of a ligand respect to the receptor.
IMP.em2d.restraints - Creates the restraints used for the modeling. It is called from DominoModel.
IMP.em2d.sampling - Sets the positions and orientations for the components of the assembly before combining them using DOMINO. In the EMageFit benchmark the set of Monte Carlo solutions was used as input to DOMINO, but any other combination of positions and orientations for the subunits could be used.
IMP.em2d.solutions_io - Contains the class ResultsDB
for managing the database of solutions obtained during modeling.
IMP.em2d.Database, IMP.em2d.argminmax, IMP.em2d.csv_related, and IMP.em2d.utility.py are supporting modules. ResultsDB
inherits all the basic functionality from IMP.em2d.Database, which is a wrapper for SQLite databases. The wrapper is easy to use, general, and it does not depend on IMP
.
Some other scripts can be found in the IMP.em2d.imp_general Python package. These scripts are general and perform basic and/or frequent tasks in IMP
. In principle they could be used in other IMP
scripts.
IMP.em2d.imp_general.representation - The main script. It contains functions for obtaining the representation of an assembly from one or more PDB files, creating rigid bodies for the components of the assembly, simplifying the structure of a protein using beads, getting coordinates and distance between residues, etc.
IMP.em2d.imp_general.alignments - A couple of functions to align assemblies.
IMP.em2d.imp_general.comparisons - Functions to compute the cross-correlation coefficient between density maps, RMSD and DRMS between models, and placement score for the subunits of an assembly as defined in the EMageFit paper.
IMP.em2d.imp_general.movement - Functions for transforming a rigid body or a structure.