[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [IMP-dev] [Fwd: PDB lib]

To: imp-dev@salilab.org
Subject: Re: [IMP-dev] [Fwd: PDB lib]
From: Ben Webb <ben@salilab.org>
Date: Fri, 16 Nov 2007 10:37:12 -0800

Daniel Russel wrote:

I think Ben and I have a disagreement about the significance of pickinga PDB reader at this point. As I see it, any PDB reading should behidden behind some very general interface (I have a proposal using theMolecularHierarchyDecorator).

Agreed. We can certainly use whichever PDB reader you're happy with inthe short term. But I have a lot of experience with reading dodgy PDBfiles (although Eashwar is probably the lab expert in this regard) soknow there's a lot of corner cases to worry about, and I don't relishfixing all of these again with a new PDB reader. And that's before westart worrying about heterogens.

I picked my pdb reader since it is small and so can be stuck in the withrest of imp so that no one has to worry about installing externallibraries and it does what I want, namely give me a hierarchy forproteins and a bond information.

We're talking about two different things here. You want to distributeyour PDB reader with IMP. I don't want to include the code in IMP SVN.The two issues are orthogonal; if you really want to bundle the code, wecan do it at makedist time (for tarball releases) or we can use an SVNexternals definition if you want it for SVN users. For the latter, justpoint me to the URL of your SVN repository (presumably this is a pathwithin the CGAL SVN, or if you prefer I can make a repository for yourPDB reader at svn.salilab.org).

the one Frido sent- I don't see how to get bonds out of it, butotherwise fine. The documentation really sucks, so I might be missingsomething.

How are you "getting bonds" out of a PDB file? PDB files don't providethat information. (The most you can get is the CONECT and SSBONDrecords.) For that you generally need a description of the topology,which is generally covered by the topology file portion of most MMforcefields. I really hope this stuff isn't hard-coded, because thatwould really have trouble with patches and other residue modifications(think covalently-bonded ligands, acetylated termini, disulfides, MSE's,cyclic proteins, nucleic acids, etc.).

Hao's project absolutely requires HETATMs, for example. And I
don't share your concern for runtime checks, since PDB reading is not
performance-critical.
My concern on checks was not for efficiency, it was for correctness.Depending on strings is poor as capitalization or abbreviation errorsdon't easily get caught.

I didn't mean you wouldn't have actual atom type objects, just that theyshouldn't be hardcoded. For example, Modeller reads a set of residuetypes from its parameter files at runtime, and after that maps everyresidue type in the PDB file from the string to an integer residue type.Unknown residue types result in a warning, and the generation of a newinteger residue type at runtime. You could of course use Residue objectsrather than integer types.

	Ben
--
ben@salilab.org                      http://salilab.org/~ben/
"It is a capital mistake to theorize before one has data."
	- Sir Arthur Conan Doyle

Follow-Ups:
- Re: [IMP-dev] [Fwd: PDB lib]
  - From: Daniel Russel <drussel@salilab.org>

References:
- [IMP-dev] [Fwd: PDB lib]
  - From: Ben Webb <ben@salilab.org>
- Re: [IMP-dev] [Fwd: PDB lib]
  - From: Ben Webb <ben@salilab.org>
- Re: [IMP-dev] [Fwd: PDB lib]
  - From: Daniel Russel <drussel@salilab.org>

Prev by Date: Re: [IMP-dev] [Fwd: PDB lib]
Next by Date: [IMP-dev] [Fwd: Re: [Fwd: PDB lib]]
Previous by thread: Re: [IMP-dev] [Fwd: PDB lib]
Next by thread: Re: [IMP-dev] [Fwd: PDB lib]
Index(es):
- Date
- Thread