[BUG] Qvalue calculation is too conservative #171

bvenn · 2022-01-18T09:05:42Z

Describe the bug
In FSharp.Stats.Testing.Multiple.Qvalues local FDRs are calculated and afterwards smoothed so that the q value of pi is the minimal FDR of all p values greater than pi.

While the local FDR calculation is correct, the smoothing does not take the minimal FDR of pvals greater than pi, but the maximal FDR of pvals lower than pi, which makes the computation more conservative as it must be.

Solution
Modify the bindby function accordingly.

fix bindby
add unit tests

The text was updated successfully, but these errors were encountered:

bvenn · 2022-01-18T13:02:34Z

The issue is more complex than I thought. While for monotonic pvalues the strategy works, but if many identical pvalues exist, the sorting corrupts the q value smoothing. If many identical keys exist (pvalues), it is not clear which index to choose.

Reproduce


#r "nuget: Plotly.NET, 2.0.0-preview.16"
open Plotly.NET

let index = Array.init 10000 id
let testValues =
	[|
		[|1. .. 5000.|]
		Array.init 2000 (fun x-> 5000.)
		[|5001..8000|]
	|]
	|> Array.concat

testValues |> Array.indexed |> Chart.Point |> Chart.show
System.Array.Sort(testValues,index)
index |> Array.indexed |> Chart.Point |> Chart.show

Edit: When Seq.sort or List.sort is used instead of Array.Sort the problem seems to be solved.

#171

bvenn · 2022-01-18T16:34:54Z

The standard q value implementation is fixed. I decided to omit the bindBy function, since it reduces the readability and causes harm when the p value collection is too large. The monotonization of the q values is now packed within the respective function. Unit tests must be corrected and the Qvalues.ofPvaluesRobust requires further inspection of validity and proper documentation.

#171

bvenn · 2022-01-19T13:51:16Z

The robust q value version has an additional term, that corrects small p values, especially when the number of tests is low. Its described in Storey, J.D. (2002), A direct approach to false discovery rates. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 64: 479-498. https://doi.org/10.1111/1467-9868.00346 in function 9.

fix q value calculation

bvenn added the bug label Jan 18, 2022

bvenn self-assigned this Jan 18, 2022

bvenn added a commit that referenced this issue Jan 18, 2022

fix q value calculation

b62743d

#171

bvenn added a commit that referenced this issue Jan 18, 2022

add q value test

7ba8c4a

#171

bvenn added a commit that referenced this issue Jan 19, 2022

update q value tests

a815d5f

#171

bvenn mentioned this issue Jan 19, 2022

fix q value calculation #172

Merged

2 tasks

bvenn closed this as completed in #172 Jan 19, 2022

bvenn added a commit that referenced this issue Jan 19, 2022

Merge pull request #172 from fslaborg/#171-fix-qvalue

d424857

fix q value calculation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Qvalue calculation is too conservative #171

[BUG] Qvalue calculation is too conservative #171

bvenn commented Jan 18, 2022 •

edited

Loading

bvenn commented Jan 18, 2022 •

edited

Loading

bvenn commented Jan 18, 2022

bvenn commented Jan 19, 2022

[BUG] Qvalue calculation is too conservative #171

[BUG] Qvalue calculation is too conservative #171

Comments

bvenn commented Jan 18, 2022 • edited Loading

bvenn commented Jan 18, 2022 • edited Loading

bvenn commented Jan 18, 2022

bvenn commented Jan 19, 2022

bvenn commented Jan 18, 2022 •

edited

Loading

bvenn commented Jan 18, 2022 •

edited

Loading