IMP Tutorial  for IMP version 2.4.0
Modeling of RNA Polymerase II stalk


This example utilizes chemical cross-linking coupled with mass spectrometry, negative-stain electron microscopy (EM), and x-ray crystallography data to determine the localization of two subunits of RNA Polymerase II. We will try to reconstruct the stalk of the complex, comprising of subunits Rpb4 and Rpb7, hypothesizing that we know already the structure of the remaining 10-subunit complex. The example can be easily generalized to any other set of subunits.

To work through the example on your own system, you will need the following packages installed:

(On a Mac you can get the Python packages above using the pip tool, e.g. by running a command like sudo easy_install pip, then install the packages with something like sudo pip install scikit-learn; sudo pip install matplotlib; sudo pip install biopython. numpy and scipy are already installed on modern Macs. Something similar may also work on a Linux box, although it's probably better to install the packages using the distribution's package manager, such as yum or apt-get.)

Then download the input files, either by cloning the github repository or by downloading the zip file.

The rnapolii example contains three directories: analysis, data and modeling.

Background of RNA Polymerase II

RNA Pol II is a eukaryotic complex that catalyzes DNA transcription to synthesize mRNA strands. Eukaryotic RNA polymerase II contains 12 subunits, Rpb1 to Rpb12. The yeast RNA Pol II dissociates into a 10-subunit core and a Rpb4/Rpb7 heterodimer. Rpb4 and Rpb7 are conserved from yeast to humans, and form a stalk-like protrusion extending from the main body of the RNA Pol II complex.

Integrative Modeling using IMP

This example will use data from chemical cross linking, EM and x-ray crystallography to localize the two subunits of the RNA Polymerase II stalk (Rpb4, Rpb7) to a static core of the remaining ten subunits.

The four stages of Integrative Modeling

Structural modeling using IMP is divided into four stages.

Click the links below to see a breakdown of all the modeling steps.

Running the script

The first three modeling stages are all contained within one script, You can get started by simply changing into the rnapolii/modeling directory and then running the script with Python:


It will take a very long time to complete the sampling; to get an idea of what's going on you can run it with only 50 output frames by adding the --test option:

python --test

On to stage 1...