Here we evaluate statistical methods for detecting difference between two sample correlation matricies. Let C1 be correlation between a set of features in the dataset Y1 with N1 samples, and let C2 be correlation in dataset Y2 with N2 samples. Alternatively, let Y be the combined dataset of the subsets indicated by categorical variable.

Statistical methods

Methods implemented in the psych R package:
- Steiger method with Fisher transform
  - ‘Steiger.fisher’: cortest(C1, C2,fisher=TRUE)
- Steiger method without Fisher transform
  - ‘Steiger’: cortest(C1, C2, N1, N2, fisher=FALSE)
- Jennrich method
  - ‘Jennrich’: cortest.jennrich(C1, C2, N1, N2)
- Factors analysis method
  - ‘Factor’: cortest.mat(C1, C2, N1, N2)

Paired Mann-Whitney test comparing elements of two correlation matirices:
- ‘Mann.Whitney’: wilcox.test( C1[lower.tri(C1)], C2[lower.tri(C2)], paired=TRUE)

sparse Leading Eigen-Value sLED available here.
- Uses permutations, so is very computationally demanding. Decorate implements a parallelized, adaptive permutation approach that stops early for tests that are not close to significant.

Methods implemented in sLED package
- ‘Cai.max’: Cai.max.test( Y1, Y2 )
- ‘Schott.Frob’: Schott.Frob.test( Y1, Y2 )
- ‘Chang.maxBoot’: Chang.maxBoot.test( Y1, Y2 )
- ‘WL.randProj’: WL.randProj.test( Y1, Y2 )
- ‘LC.U’: LC.U.test( Y1, Y2 )

Box’s M-test for homogeneity of covariance matrices implemented in heplots
- ‘Box’: boxM(Y, variable)
- Proposed here: Box’s M-test with empirical degrees of freedom of the \(\chi^2\) null distribution estimated by fast permutations.
  - ‘Box.permute’: boxM_permute(Y, variable)

Test developed by Delaneau, et al.. Evaluates the influence of sample i by comparing the correlations based on the full dataset to the correlation after dropping sample i. This gives a score for each sample. A test of association between this sample-level score and the variable of interest is then evaluated. If this variable has two categories, a Wilcoxon test is used and for more than two categories a Kruskal-Wallis test is used. If the variable is continuous, a Spearman correlation test is used.
- ‘Delaneau’: delaneau.test( Y, variable)

Proposed here: Evaluates the influence of sample i by comparing the sparse leading eigen-value of the correlation matrix based on the full dataset to sparse leading eigen-value of the correlation matrix after dropping sample i. This gives a score for each sample. A test of association between this sample-level score and the variable of interest is then evaluated. If this variable has two categories, a Wilcoxon test is used and for more than two categories a Kruskal-Wallis test is used. If the variable is continuous, a Spearman correlation test is used.
- ‘deltaSLE’: sle.test( Y, variable)

Method properties

Simulation 1

Estimate false positive rate under the null.

Simulation results are shown comparing correlation matricies for p features for N samples. Most methods are only applicable to positive definite matricies corresponding to N > p. Only Mann-Whitney, sLED, Delaneau and deltaSLE are applicable dataset with N > p, so the remaing methods do not give results simulations in this case (i.e. top right of figures).

To determine control of the false positive rate, 5000 simulations were performed under the null model of no difference between correlation structure in the two datasets (i.e. C1 == C2).

Note that x-axis stops at 0.2, but often the false positive rate of the Factor and Jennich methods exceed this value.

Simulation 2

In group 1, all pairwise correlations are 0.80 and in group 2 all pairwise correlations are 0.75.

To test the power of each method, 1000 null simulations were performed in addition to 1000 simulations with different correlation structure (i.e.C1 != C2).

Performance based on Area Under the Precision Recall (AURP) curve

Precision Recall curves

Simulation 3

In group 1, all pairwise correlations are 0.80 and in group 2 half of the pairwise correlations are set to 0.75 and the rest remain at 0.80. This followed by a small correction to make matrix positive definite.

To test the power of each method, 1000 null simulations were performed in addition to 1000 simulations with different correlation structure (i.e.C1 != C2).

Simulations for testing difference in correlation matrices

Developed by Gabriel Hoffman

Run on 2019-09-13

Statistical methods

Method properties

Simulation 1

Estimate false positive rate under the null.

Simulation 2

Performance based on Area Under the Precision Recall (AURP) curve

Precision Recall curves

Simulation 3

Performance based on Area Under the Precision Recall (AURP) curve