IMP logo

IMP::statistics Namespace Reference


Detailed Description

This module provides methods for clustering.

Author:
Keren Lasker, Daniel Russel
Version:
1.0
Overview:
This module provides code to compute clusterings. Adaptors are provided that allow easy clustering of points, and configurations of models in IMP::ConfigurationSet objects among other things.
Examples
Examples can be found on the IMP.statistics examples page.
License:
LGPL. This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
Publications:
  • Daniel Russel, Keren Lasker, Ben Webb, Dina Schneidman, Javier Velazquez-Muriel, Andrej Sali, “Integrative assembly modeling using IMP”, submitted, 2010. This paper provides an overview of the key concepts in IMP and how to apply them to biological problems.
  • Frank Alber, Friedrich Foerster, Dmitry Korkin, Maya Topf, Andrej Sali, “Integrating diverse data for structure determination of macromolecular assemblies”, Annual Review of Biochemistry, 2008. This paper provides a review of the integrative structure determination methodology and various data sources that can be used.


Data Structures

class  ConfigurationSetXYZEmbedding
 Embed a configuration using the XYZ coordinates of a set of particles. More...
class  Embedding
 Map clustering data to spatial positions. More...
class  HighDensityEmbedding
class  KMeansClustering
class  KMLProxy
 Proxy to apply k-means clustering on a set of Particles. More...
class  ParticleEmbedding
class  PartitionalClustering
 The base class for clusterings of data sets. More...
class  VectorDEmbedding
 Simply return the coordinates of a VectorD. More...

Functions

std::string get_data_path (std::string file_name)
 Return the path to installed data for this module.
std::string get_example_path (std::string file_name)
 Return the path to installed example data for this module.
KMeansClusteringget_lloyds_kmeans (Embedding *embedding, unsigned int k, unsigned int iterations)
std::string get_module_name ()
const VersionInfoget_module_version_info ()

Function Documentation

std::string IMP::statistics::get_data_path ( std::string  file_name  ) 

Return the path to installed data for this module.

Each module has its own data directory, so be sure to use the version of this function in the correct module. To read the data file "data_library" that was placed in the data directory of module "mymodule", do something like

    std::ifstream in(IMP::mymodule::get_data_path("data_library"));
This will ensure that the code works when IMP is installed or used via the tools/imppy.sh script.

std::string IMP::statistics::get_example_path ( std::string  file_name  ) 

Return the path to installed example data for this module.

Each module has its own example directory, so be sure to use the version of this function in the correct module. For example to read the file example_protein.pdb located in the examples directory of the IMP::atom module, do

    IMP::atom::read_pdb(IMP::atom::get_example_path("example_protein.pdb", model));
This will ensure that the code works when IMP is installed or used via the tools/imppy.sh script.

KMeansClustering* IMP::statistics::get_lloyds_kmeans ( Embedding *  embedding,
unsigned int  k,
unsigned int  iterations 
)

Return a k-means clustering of all points contained in the embedding (ie [0... embedding->get_number_of_embeddings())). These points are then clustered into k clusters. More iterations takes longer but produces a better clustering.


Generated on Mon Mar 8 23:09:03 2010 for IMP by doxygen 1.5.8