ParaSAM: a parallelized version of the significance analysis of microarrays algorithm.
Abstract
MOTIVATION: Significance analysis of microarrays (SAM) is a widely used permutation-based approach to identifying differentially expressed genes in microarray datasets. While SAM is freely available as an Excel plug-in and as an R-package, analyses are often limited for large datasets due to very high memory requirements. SUMMARY: We have developed a parallelized version of the SAM algorithm called ParaSAM to overcome the memory limitations. This high performance multithreaded application provides the scientific community with an easy and manageable client-server Windows application with graphical user interface and does not require programming experience to run. The parallel nature of the application comes from the use of web services to perform the permutations. Our results indicate that ParaSAM is not only faster than the serial version, but also can analyze extremely large datasets that cannot be performed using existing implementations. AVAILABILITY: A web version open to the public is available at http://bioanalysis.genomics.mcg.edu/parasam. For local installations, both the windows and web implementations of ParaSAM are available for free at http://www.amdcc.org/bioinformatics/software/parasam.aspx.Citation
Bioinformatics. 2010 Jun 1; 26(11):1465-1467ae974a485f413a2113503eed53cd6c53
10.1093/bioinformatics/btq161
Scopus Count
Related articles
- ParaKMeans: Implementation of a parallelized K-means algorithm suitable for general laboratory use.
- Authors: Kraj P, Sharma A, Garge N, Podolsky R, McIndoe RA
- Issue date: 2008 Apr 16
- A modified hyperplane clustering algorithm allows for efficient and accurate clustering of extremely large datasets.
- Authors: Sharma A, Podolsky R, Zhao J, McIndoe RA
- Issue date: 2009 May 1
- Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix microarrays.
- Authors: Seo J, Bakay M, Chen YW, Hilmer S, Shneiderman B, Hoffman EP
- Issue date: 2004 Nov 1
- MADGE: scalable distributed data management software for cDNA microarrays.
- Authors: McIndoe RA, Lanzen A, Hurtz K
- Issue date: 2003 Jan
- affylmGUI: a graphical user interface for linear modeling of single channel microarray data.
- Authors: Wettenhall JM, Simpson KM, Satterley K, Smyth GK
- Issue date: 2006 Apr 1