IMP logo
IMP Tutorial
MultiFoXS: determination of multi-state models

Table of Contents

Introduction

In many cases our data comes from a heterogeneous sample. Heterogeneity can be both compositional and conformational. Interpretation of data collected on a heterogeneous sample requires a multi-state model. A multi-state model is a model that specifies two or more co-existing structural states and values for other parameters, such as the weights of each state.

IMP includes a command line tool multi_foxs for enumeration and scoring of multi-state models against SAXS profiles. There is also a web interface available, which functions similarly.

In this example, we will demonstrate how to use it using PDB structures of replication protein A and a simulated SAXS profile.

This tutorial can be followed in several ways:

For full help on the multi_foxs tool, run from a command line:

multi_foxs --help

Calculation

The structure of the RPA in complex with ssDNA is available in the Protein Data Bank (PDB) as code 1jmc (file 1jmc.pdb in the multi_foxs/rpa directory), and the unbound RPA structures as code 1fgu (file 1fguA.pdb, 1fguB.pdb), while the SAXS profile is given in the weighted.dat file. The SAXS profile was simulated from the 3 structures with the following weights:

1jmc.pdb 0.5
1fguA.pdb 0.25
1fguB.pdb 0.25

3% Gaussian noise was added to the simulated profile.

The atomic structures can be fit against the SAXS profile by running MultiFoXS:

cd multi_foxs/rpa
multi_foxs weighted.dat 1fguA.pdb 1fguB.pdb 1jmc.pdb

FoXS calculates the theoretical profiles of the input structures, enumerates multi-state models, and fits them to the input SAXS profile. The output is a list of best scoring multi-state models of size 1, 2, and 3 in files ensembles_size_1.txt, ensembles_size_2.txt, and ensembles_size_3.txt, respectively. It also generates multi_state_model_x_x_x.dat files, containing the computed theoretical profile for the multi-state model and its fit against the experimental profile.

In this example, the best fit is obtained using a 3-state model (𝝌=2.80) ensembles_size_3.txt, with weights of 0.53, 0.23, and 0.25, showing a maximal deviation of 2% compared to the weights that were used to simulate the profile:

1 | 2.80 | x1 2.80 (1.03, 0.53)
0 | 0.525 (0.525, 1.000) | 1jmc.pdb (0.333)
1 | 0.226 (0.226, 1.000) | 1fguA.pdb (0.333)
2 | 0.249 (0.249, 1.000) | 1fguB.pdb (0.333)
CC BY-SA logo