IMP  2.4.0
The Integrative Modeling Platform
EMageFit scripts and tools

Table of Contents

Command line tools

emagefit: Run all steps of modeling

This script performs Docking, Monte Carlo optimization, gathering of solutions from the Monte Carlo runs, DOMINO sampling, and finally writing the resulting models.

emagefit_dock: Docking using the HEXDOCK program

This uses HEXDOCK in text mode to perform a docking of a subunit (the ligand) into another subunit (the receptor). The script can be used as a standalone program to perform a docking or write the solutions. The script can be modified to use any docking program, by making a couple of changes:

emagefit_cluster: Performs clustering of the solutions stored in a database file.

This can be run as a standalone program. The help of the script gives the parameters required, and a typical command is:

emagefit_cluster --exp config_step_3.py --db domino_solutions.db --o clusters.db --n 100 --orderby em2d --log clusters.log --rmsd 10

To write the elements of the first cluster:

emagefit --exp config_step_3.py --o domino_solutions.db --wcl clusters.db 1

emagefit_score: Scores a model using EM images.

This script returns the em2d score. It is useful for comparing models obtained by other sampling algorithms apart from the one described in the EMageFit paper. To score a model the parameters required are:

An example:

emagefit_score structure.pdb myimages.sel 3.6 20 5 100

convert_spider_to_jpg: Image conversion utility for EMageFit.

This utility can be used to convert one or more EM images from Spider format to JPG.

IMP.em2d Python package utilities

The IMP.em2d Python package contains a number of utility modules:

IMP.em2d.buildxlinks - Contains all the code for generating the order of the dockings. It also contains the class InitialDockingFromXlinks, which is used to move the position of the subunits acting as ligand close to the receptor.

IMP.em2d.DominoModel - Contains the DominoModel class, which has an IMP.Model as the main member. The class manages the details of setting the model restraints, performing the Monte Carlo runs, configuring the DOMINO sampler, and storing the results in a database.

IMP.em2d.MonteCarloRelativeMoves - Contains the class MonteCarloRelativeMoves for setting and configuring a simulated annealing Monte Carlo optimizer. It also manages the profiles of temperature and iterations for the sampling. The optimizer uses one IMP.em2d.RelativePositionMover object per docking to propose relative moves of a ligand respect to the receptor.

IMP.em2d.restraints - Creates the restraints used for the modeling. It is called from DominoModel.

IMP.em2d.sampling - Sets the positions and orientations for the components of the assembly before combining them using DOMINO. In the EMageFit benchmark the set of Monte Carlo solutions was used as input to DOMINO, but any other combination of positions and orientations for the the subunits could be used.

IMP.em2d.solutions_io - Contains the class ResultsDB for managing the database of solutions obtained during modeling.

IMP.em2d.Database, IMP.em2d.argminmax, IMP.em2d.csv_related, and IMP.em2d.utility.py are supporting modules. ResultsDB inherits all the basic functionality from IMP.em2d.Database, which is a wrapper for SQLite databases. The wrapper is easy to use, general, and it does not depend on IMP.

IMP.em2d.imp_general Python package utilities

Some other scripts can be found in the IMP.em2d.imp_general Python package. These scripts are general and perform basic and/or frequent tasks in IMP. In principle they could be used in other IMP scripts.

IMP.em2d.imp_general.representation - The main script. It contains functions for obtaining the representation of an assembly from one or more PDB files, creating rigid bodies for the components of the assembly, simplifying the structure of a protein using beads, getting coordinates and distance between residues, etc.

IMP.em2d.imp_general.alignments - A couple of functions to align assemblies.

IMP.em2d.imp_general.comparisons - Functions to compute the cross-correlation coefficient between density maps, RMSD and DRMS between models, and placement score for the subunits of an assembly as defined in the EMageFit paper.

IMP.em2d.imp_general.movement - Functions for transforming a rigid body or a structure.