[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [IMP-dev] [Fwd: PDB lib]

To: imp-dev@salilab.org
Subject: Re: [IMP-dev] [Fwd: PDB lib]
From: Ben Webb <ben@salilab.org>
Date: Wed, 14 Nov 2007 18:29:45 -0800

At any rate, this PDB reader stuff needs to be discussed on imp-devbefore we proceed. For example, what's wrong with the BALL stuff youwere playing with before?
BALL is dead. No activity on email list. No response to bugs. No move to
actually document their newest version even though it was released a
year ago. I don't think we want to tie ourselves to it. Sure we can take
it to IMP dev. No one else seems to care much :-)

People certainly care (they keep coming to talk to me, anyway). But Iguess they don't like writing emails.

If that's really the case for BALL, then we should probably exploreother possibilities, as per Frido's email. I know that BALL's Pythoninterface is rather lacking, certainly.

I have looked around and asked around and couldn't find any decent PDB
readers (in C or C++) which are not buried in some huge project.

Why can't we link against this PDB library, rather thancut-and-pasting thousands of lines of code?

The nice thing about it is that it is small and simple and mine so we
can just ship it along with IMP and not worry about dependencies, name
collisions etc. I don't want people to have to get another library from
somewhere else, hence my desire to put a copy into imp svn. Soon enough
the lib will make it to fedora extras (whenever the next CGAL release
is) so we could potentially just use that.

If it's an external library, it should be a dependency, not part of IMP.Otherwise, regardless of whether you describe it as a "fork", it'll forkas versions of it elsewhere change. CGAL source control sounds like thebest place for it if it's going to be part of CGAL. Embedded copies ofother projects are a great way to ensure that bugs never get fixed(think of all the projects that bundle zlib).

and 3. from a brief reading, it looks like a not-very-good PDB libraryanyway (hard-coded atom names - what's with that?)

Well, it is either that or use strings which pushes the checks to
runtime rather than compile time. Adding to an enum and recompiling is
trivial (and adding a constant externally works just as well for must
purposes). Checking everywhere than an object falls in a small set of
allowed strings is hard (especially if you can't specify that set of
strings anywhere). BALL has hardcoded atoms for that matter (just a lot
more of them :-)

A PDB reader which needs to be recompiled for every new HETATM type issimply not going to work. Seehttp://www.bmrb.wisc.edu/elec_dep/pdb_het_library/pdbhetn.htm forexample. Hao's project absolutely requires HETATMs, for example. And Idon't share your concern for runtime checks, since PDB reading is notperformance-critical.

Any PDB reader that we adopt needs to be extendable at runtime. EvenModeller can do that. PyMol, for example, has a library of HETATMfragments (stored as Python pickles, I believe). It also needs to beextensible to be able to read PDBML or possibly MMCIF.

Everybody and his dog has written a PDB reader. Andrej wrote one. Mayawrote one. Javi wrote one. Keren wrote one. There's one in biopython,one in BALL, one in PyMol, one in Chimera, and one in Biskit, all freeand widely available software. I can't believe we have to burden theworld with another one.

	Ben
--
ben@salilab.org                      http://salilab.org/~ben/
"It is a capital mistake to theorize before one has data."
	- Sir Arthur Conan Doyle

Follow-Ups:
- Re: [IMP-dev] [Fwd: PDB lib]
  - From: Daniel Russel <drussel@salilab.org>

References:
- [IMP-dev] [Fwd: PDB lib]
  - From: Ben Webb <ben@salilab.org>

Prev by Date: Re: [IMP-dev] [Fwd: PDB lib]
Next by Date: Re: [IMP-dev] [Fwd: PDB lib]
Previous by thread: Re: [IMP-dev] [Fwd: PDB lib]
Next by thread: Re: [IMP-dev] [Fwd: PDB lib]
Index(es):
- Date
- Thread