-
Notifications
You must be signed in to change notification settings - Fork 10
/
snpm_uc_FDR.m
180 lines (167 loc) · 5.74 KB
/
snpm_uc_FDR.m
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
function [u,Ps,Ts] = snpm_uc_FDR(q,df,STAT,n,Vs,Vm)
% False Discovery critical height threshold
% FORMAT [u,Ps,Ts] = snpm_uc_FDR(q,df,STAT,n,Vs[,Vm])
%
% q - critical expected False Discovery Rate
% df - [df{interest} df{residuals}]
% STAT - Statisical field (see comments below about FWER and EFDR)
% 'Z' - Gaussian field
% 'T' - T - field
% 'X' - Chi squared field
% 'F' - F - field
% 'P' - P - value
% n - abs(n), number of component SPMs in conjunction
% Vs - Mapped statistic image(s)
% -or-
% Vector of sorted p-values, p1<p2<... (saves i/o w/ repeated calls)
% Vm - Mask in 1 of 3 forms
% o Scalar, indicating implicit mask value in statistic image(s)
% o Vector of indicies of elements within mask
% o Mapped mask image
% (Ignored if Vs is a vector.)
%
% u - critical height
% Ps - Sorted p-values
% Ts - Sorted statistic values
%
%___________________________________________________________________________
%
% The Benjamini & Hochberch (1995) False Discovery Rate (FDR) procedure
% finds a threshold u such that the expected FDR is at most q.
% snpm_uc_FDR returns this critical threshold u.
%
% For repeated use of a given statistic image, return Ps in the place
% of Vs:
% [P Ps] = snpm_uc_FDR(Z1,df,STAT,n,Vs); %-Initialization, image read
% P = snpm_uc_FDR(Z2,df,STAT,n,Ps); %-Image not read, Ps used
%
% Note that a threshold of Inf is possible if q is very small. This
% means that there is no threshold such that FDR is controlled at q.
%
% If abs(n) > 1, a P-value for a minimum of n values of the statistic
% is returned. If n>0, the P-value assesses the conjunction null
% hypothesis of one or more of the null hypotheses being true. If n<0,
% then the P-value assess the global null of all nulls being true.
%
%
% Background
%
% For a given threshold on a statistic image, the False Discovery Rate
% is the proportion of suprathreshold voxels which are false positives.
% Recall that the thresholding of each voxel consists of a hypothesis
% test, where the null hypothesis is rejected if the statistic is larger
% than threshold. In this terminology, the FDR is the proportion of
% rejected tests where the null hypothesis is actually true.
%
% A FDR proceedure produces a threshold that controls the expected FDR
% at or below q. The FDR adjusted p-value for a voxel is the smallest q
% such that the voxel would be suprathreshold.
%
% In comparison, a traditional multiple comparisons proceedure
% (e.g. Bonferroni or random field methods) controls Familywise Error
% rate (FWER) at or below alpha. FWER is the *chance* of one or more
% false positives anywhere (whereas FDR is a *proportion* of false
% positives). A FWER adjusted p-value for a voxel is the smallest alpha
% such that the voxel would be suprathreshold.
%
% If there is truely no signal in the image anywhere, then a FDR
% proceedure controls FWER, just as Bonferroni and random field methods
% do. (Precisely, controlling E(FDR) yeilds weak control of FWE). If
% there *is* some signal in the image, a FDR method should be more powerful
% than a traditional method.
%
%
% References
%
% Benjamini & Hochberg (1995), "Controlling the False Discovery Rate: A
% Practical and Powerful Approach to Multiple Testing". J Royal Stat Soc,
% Ser B. 57:289-300.
%
% Benjamini & Yekutieli (2001), "The Control of the false discovery rate
% in multiple testing under dependency". To appear, Annals of Statistics.
% Available at http://www.math.tau.ac.il/~benja
%_______________________________________________________________________
% Copyright (C) 2013 The University of Warwick
% Id: snpm_uc_FDR.m SnPM13 2013/10/12
% Thomas Nichols
% Based on FIL spm_uc_FDR.m
if (nargin<6), Vm = []; end
if n>0
Cnj_n = 1; % Inf on Conjunction Null
else
Cnj_n = abs(n); % Inf on Global Null
end
% Set Benjamini & Yeuketeli cV for independence/PosRegDep case
%-----------------------------------------------------------------------
cV = 1;
% Load, mask & sort statistic image (if needed)
%-----------------------------------------------------------------------
if isstruct(Vs)
if (abs(n) ~= length(Vs))
error('SnPM:Invalidn', sprintf('n & number of mapped images doesn''t match (%d,%d)',...
abs(n),length(Vs)));
end
Ts = spm_read_vols(Vs(1));
for i = 2:abs(n)
Ts = min(Ts,spm_read_vols(Vs(i)));
end
if ~isempty(Vm)
if isstruct(Vm)
Ts(spm_read_vols(Vm)==0) = [];
elseif (prod(size(Vm))==1)
Ts(Ts==Vm) = [];
else
Ts = Ts(Vm);
end
end
Ts(isnan(Ts)) = [];
Ts = sort(Ts(:));
if STAT ~= 'P', Ts = flipud(Ts); end
end
% Calculate p values of image (if needed)
%-----------------------------------------------------------------------
if isstruct(Vs)
if STAT == 'Z'
Ps = (1-spm_Ncdf(Ts)).^Cnj_n;
elseif STAT == 'T'
Ps = (1-spm_Tcdf(Ts,df(2))).^Cnj_n;
elseif STAT == 'X'
Ps = (1-spm_Xcdf(Ts,df(2))).^Cnj_n;
elseif STAT == 'F'
Ps = (1-spm_Fcdf(Ts,df)).^Cnj_n;
end
else
Ps = Vs;
end
S = length(Ps);
% Calculate FDR inequality RHS
%-----------------------------------------------------------------------
Fi = (1:S)'/S*q/cV;
% Find threshold
%-----------------------------------------------------------------------
I = max(find(Ps<=Fi));
if isempty(I)
if STAT == 'P'
u = 0;
else
u = Inf;
end
else
if isstruct(Vs)
u = Ts(I);
else
% We don't have original statistic values; determine from p-value...
if STAT == 'Z'
u = spm_invNcdf(1-Ps(I));
elseif STAT == 'T'
u = spm_invTcdf(1-Ps(I),df(2));
elseif STAT == 'X'
u = spm_invXcdf(1-Ps(I),df(2));
elseif STAT == 'F'
u = spm_invFcdf(1-Ps(I),df);
% ... except in case when we want a P-value
elseif STAT == 'P'
u = Ps(I);
end
end
end