Skip to main content
. Author manuscript; available in PMC: 2012 Jan 1.
Published in final edited form as: Curr Protoc Hum Genet. 2011 Jan;CHAPTER:Unit1.19. doi: 10.1002/0471142905.hg0119s68

Table 1.

Useful software packages for data management, quality control, and statistical analysis in genome-wide association studies.

Software
package
URL Purpose
PLINK http://pngu.mgh.harvard.edu/~purcell/plink/ Free, open-source GWAS analysis
software package. Contains many
tools for data management, quality
control, and statistical analysis.
(PC, Mac, Linux).
PLATO https://chgr.mc.vanderbilt.edu/plato PLatform for the Analysis,
Translation, and Organization of
large-scale data – software for
GWAS analysis similar to PLINK.
R http://www.r-project.org/ Free, open-source statistical
computing software with excellent
graphical capabilities. (PC, Mac,
Linux).
Eigensoft http://genepath.med.harvard.edu/~reich/Software.htm Free, open-source software for
performing principal components
analysis based method for detecting
and correcting for population
stratification in GWAS. (Linux
only).
Structure http://pritch.bsd.uchicago.edu/structure.html Free, open-source software for
inferring the presence of distinct
populations and assigning
individuals to those populations for
a stratified analysis. (Windows,
Mac, Linux)
MySQL
Workbench
http://wb.mysql.com/ Free, open-source software for
creating, administering and
querying relational databases. This
is helpful for subsetting data,
merging results, and joining QC
metrics (e.g. HWE) to final
association results. (Windows, Mac,
Linux).