These scripts are extracted from TADbit, a computational framework to analyze and model the chromatin fiber in three dimensions, to illustrate the use of IMP to model a genomic region with 5 Topologically Associating Domains (TADs) from the fly genome.
data: contains the input data and configuration files.
01_model_and_analyze.pythe main IMP script modeling to model the region
IMPOptimizer.pycontains classes and scripts to find the optimal parameters in the modelling for IMP
outputswill contain the results of the script as a folder structure with:
- .log file
- .tsv optimal parameters' file for IMP
- .xyz file for each model
- .cmm UCSF Chimera file for each model
- .cmd UCSF Chimera file to display all the models at the same time
- .json file to visualize and analyze the model using TADkit. TADkit creates interactive 3D representations of chromatin conformations.
testfile to run the test
Running the IMP scripts for the fly genome:
To run the modeling script, just change into its directory and run it from the command line, e.g.
python 01_model_and_analyze.py --cfg ../data/chr4.cfg --ncpus 12
A prebuild optimal imp parameters it's already included in the data directory to speed up the test which should take around 2 hours in a single cpu. If you wish to test the optimization and the modeling just delete the configuration file or change the extension to something else than .tsv. The whole computation should take around 40 hours in a single cpu.
The results will be produced in the outputs directory as described above.
Author(s): François Serra, Davide Baù, David Castillo, Guillaume Filion, Marc A. Marti-Renom
Date: October 1st, 2016
- Serra, F., Baù, D., Filion, G., & Marti-Renom, M. A. (2016). Structural features of the fly chromatin colors revealed by automatic three-dimensional modeling. bioRxiv. doi:10.1101/036764