Fix MCC for DDP and multitask #1131

KnathanM · 2024-12-20T23:27:27Z

Our implementation of the Matthews correlation coefficient does not work when using DDP. This is because torchmetrics will automatically concatenate the state variables from different batches when DDP is used. So when our MCC compute method is called, the state variables are already tensors instead of lists of tensors. torchmetrics gets around this by using their function dim_zero_cat which checks if the thing to concatenate is already a tensor, see this example in cosine similarity.

The case for Multiclass MCC has the same problem but also the added difficulty that we drop the batch and task dimensions and then stack along a new dimension when we need to collect batches. I've changed this to keep the dimension until compute. Because we dropped task dimensions early, our MulticlassMCC for multitask has been giving incorrect results. I have updated the tests to reflect the actual expected values after comparing to sklearn. (sklearn doesn't support multitask, so I calculated the MCC for each task separately and then averaged.)

For reference I will add that in MulticlassMCC p is the number of times each class was predicted, t is the number of times each class was the true value, c is the number of points we got correct for each task, and s is the number of points for each task.

fix MCC for DDP and multitask

530b2ba

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix MCC for DDP and multitask #1131

Fix MCC for DDP and multitask #1131

KnathanM commented Dec 20, 2024

Fix MCC for DDP and multitask #1131

Are you sure you want to change the base?

Fix MCC for DDP and multitask #1131

Conversation

KnathanM commented Dec 20, 2024