[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [IMP-users] running imp with DNA sequence



On 12/10/17 1:17 AM, מירב בריטברד wrote:
WARNING: Replacing sequence residue 1 "THR" with PDB type "DC"
WARNING: Replacing sequence residue 2 "GLY" with PDB type "DT"
WARNING: Replacing sequence residue 3 "CYS" with PDB type "DG"
WARNING: Replacing sequence residue 4 "ALA" with PDB type "DC"

1. Looks like you have an offset between your FASTA and PDB files. Your FASTA file reads "TGCA..." but your PDB is "CTGC...". Either fix the FASTA sequence to match or use the pdb_offset field in your topology file (assuming you're using one).

2. PMI assumes all sequences are of proteins by default. If you're calling the API directly, add is_nucleic=True when you call create_molecule(). If you're using a topology file, it doesn't seem to be documented anywhere, but add ',DNA' to the FASTA id in that file (you presumably have "DNA1" right now - just change it to "DNA1,DNA").

BTW, to my knowledge no DNA or RNA modeling has been done with IMP, so you may need to fix some bugs in the code here and there as you proceed.

	Ben
--
                      https://salilab.org/~ben/
"It is a capital mistake to theorize before one has data."
	- Sir Arthur Conan Doyle