Re: [IMP-users] problems with a sample script (basic IMP optimization)

I think them main problem is the roughness of the scoring function caused by the excluded volume terms (as I had cryptically mentioned before). Your scoring function looks something like EM+sum over many pairs, where EM goes from 0 to 1 and each of the terms being summed over goes from, more of less 0 to 1 also. So the EM score is swamped by the excluded volume terms as soon as there is any overlap. And MC just moves things randomly, hoping for a good solution. So it will take a while to find anything good. In addition, if you use Conjugate Gradients, the derivatives of the excluded volume score on atoms are pretty useless since they are computed using each atom ball (explanation of the problem would probably benefit from a picture, I'll work on that :-).

My suggestions are

- don't use a full atomic representation for your proteins (eg, simplify them using IMP.atom.create_simplified_along_backbone()). This will smooth the excluded volume scoring function out and making scoring faster.

- use Conjugate Gradients in conjunction with MC (create a conjugate gradients optimizer and add it to the MC one using mc.set_local_optimizer()). This will then perform local minimization after each MC step.

- when using monte carlo for optimization (as opposed to sampling), you should call set_return_best(True) on the object so that it saves the best state it finds, rather than just returning the last state accepted (which may have higher score).

On Jul 27, 2010, at 6:30 AM, Benjamin SCHWARZ wrote:

Hi list,

   In order to have a better insight I modified my script, keeping only the em.FitRestraint in my model (script attached).
   The score for the initial model (5.96046447754e-08) is reassuringly near 0; which is reasonable when comparing two maps sampled from the same structure.
   Nevertheless, when chains are randomly moved, the score does not radically improve, even when the transformations move them far away from their initial position. Moreover, I have an example where the score is minimized from 0.37 for the initial (moved chains) position to 0.49 for the optimized solution !?

   I may have missed something, and I would need some technical support to fix the problem quickly.

   --Ben

<fitWithRestraints.py>

Dr. Benjamin SCHWARZ
Biocomputing group
Email : ">schwarz@igbmc.fr
Voice : +33 (0)3 68 85 47 30
FAX : +33 (0)3 68 85 47 18

<logo-uds-signature.gif>

Structural Biology & Genomics Dept. - IGBMC
1 rue Laurent Fries
BP 10142
F - 67404 Illkirch CEDEX

_______________________________________________
IMP-users mailing list
">IMP-users@salilab.org
https://salilab.org/mailman/listinfo/imp-users