Previous: QTL Projection, Up: Meta Analysis
Here we want to address the following question: How many “real” QTL do the QTL detected in the different mapping experiments represent - one, two, three, four,... or as many as the number detected throughout the studies ? The meta-analysis of QTL can be viewed as a clustering procedure. To do so, MetaQTL implements tow kinds of clustering algorithm. Whatever the procedure used to perform the clustering, the QTL locations are assumed to be normally distributed around their true locations with variances which can be derived from the reported CI or r-square values. This Gaussian and unbiased approximation comes from the classical asymptotic Gaussian distribution of the maximum-likelihood estimation of the parameters.
ClustQTL implements a clustering procedure based on a Gaussian mixture model which parameter estimates are obtained by applying a EM-algorithm.
| Option | Usage | Type | Explanation
|
|---|---|---|---|
-q,--qtlmap | required | string | The map with the QTL to clusterize (XML format).
|
-o,--output | required | string | The output file stem.
|
-t,--tonto | optional | string | The trait ontology.
|
-k,--kmax | optional | integer | The maximal number of clusters.
|
-c,--chr | optional | string | The name of the chromosome on which to perform the meta-analysis.
|
--cimode | optional | integer | The CI computation mode.
|
--cimiss | optional | integer | The imputation mode for missing CI.
|
--emrs | optional | integer | the number of random starting points for the EM algorithm
|
--emeps | optional | double | the convergence threshold for the EM algorithm
|
The option --cimode controls the mode of computation of the variances of the QTL. There are four modes:
--cimiss defines how to deal with QTL for which no variance can be computed. There are two possibilities:
The output of ClustQTL is divided into 3 plain text files:
| Identifier | Value
|
|---|---|
| CR | The name of the linkage group.
|
| TR | The trait name following by the number of related QTL on the chromosome.
|
| QT | A QTL with its identifier, its name, its position on the chromosome and its estimated standard deviation.
|
| CL | Indicates the beginning of a clustering result. It is followed by the number of QTL involved in the clustering, the number of clusters, the log-likelihood and the complete log-likelihood of this clustering.
|
| CC | The name of a model choice criterion followed by its value.
|
| CP | This tag recovers four kinds of entry:
|
|
|
|
Another way to clusterize the observed QTL is to use standard hierarchical clustering procedures. QTLTree implements two kinds of hierarchical clustering algorithm :
| Option | Usage | Type | Explanation
|
|---|---|---|---|
-q,--qtlmap | required | string | The map with the QTL to clusterize (XML format).
|
-o,--output | required | string | The output file.
|
-m,--mode | optional | integer | The clustering mode (default is 2).
|
-t,--tonto | optional | string | The trait ontology.
|
--cimode | optional | integer | The variance computation mode.
|
--cimiss | optional | integer | The imputation mode for missing variances.
|
The option -m (or --mode) allows user to switch between the two possible clustering algorithms:
--cimode and --cimiss works as for QTLClust.
The output of QTLTree consists in one plain text file. The file is organized as follows:
| Identifier | Value
|
|---|---|
| CR | The name of the linkage group.
|
| TR | The name of the trait followed by the number of related QTL on the chromosome.
|
| QT | A QTL involved in the clustering with its identifier, its name, its most probable position on the chromosome and its estimated standard deviation.
|
| HC | The tree obtained by the clustering algorithm in Newick's format.
|
|