IMP Reference Guide
develop.63b38c487d,2024/12/21
The Integrative Modeling Platform
|
Function for preparing spatiotemporal models for sampling. More...
Function for preparing spatiotemporal models for sampling.
Functions | |
def | prepare_protein_library |
Function that reads in experimental stoicheometery data and calculates which compositions and location assignments should be sampled for spatiotemporal modeling, which are saved as config files. More... | |
def IMP.spatiotemporal.prepare_protein_library.prepare_protein_library | ( | times, | |
exp_comp_map, | |||
expected_subcomplexes, | |||
nmodels, | |||
output_dir = '' , |
|||
template_topology = '' , |
|||
template_dict = {} , |
|||
match_final_state = True |
|||
) |
Function that reads in experimental stoicheometery data and calculates which compositions and location assignments should be sampled for spatiotemporal modeling, which are saved as config files.
Optionally, a PMI topology file can be provided, in which case topology files for each composition and location assignment are also written. The output is 3 types of files:
times | list of strings, the times at which the stoicheometery data should be read. |
exp_comp_map | dictionary, which describes protein stoicheometery. The key describes the protein, which should correspond to names within the expected_subcomplexes. Only copy numbers for proteins or subcomplexes included in this dictionary will be scored. For each of these proteins, a csv file should be provided with protein copy number data. The csv file should have 3 columns, 1) "Time", which matches up to the possible times in the graph, 2) "mean", the average protein copy number at that time point from experiment, and 3) "std", the standard deviation of that protein copy number from experiment. |
expected_subcomplexes | list of all possible subcomplex strings in the model. Should be a list without duplicates of all components in the subcomplex configuration files. |
nmodels | int, number of models with different protein copy numbers to generate at each time point. |
output_dir | string, directory where the output will be written. Empty string assumes the current working directory. |
template_topology | string, name of the topology file for the complete complex. (default: '', no topology files are output) |
template_dict | dictionary for connecting the spatiotemporal model to the topology file. The keys (string) are the names of the proteins, defined by the expected_complexes variable. The values (list) are the names of all proteins in the topology file that should have the same copy number as the labeled protein, specifically the "molecule_name." (default: {}, no topology files are output) |
match_final_state | Boolean, determines whether to fix the final state to the state defined by expected_subcomplexes. True enforces this match and thus ensures that the final time has only one state. (default: True) |
Definition at line 57 of file prepare_protein_library.py.