Improve median filter #150

cgohlke · 2024-11-28T20:39:25Z

Description

This PR improves upon #147, simplifying the implementation and boosting performance of median filtering:

Do not call _quickselect twice in _median function for even-sized kernels.
The functions are now merged into one _median function.
Move "repeat" loop into _apply_2d_median_filter function nogil section.
Separate "repeat" loops for real and imag to improve memory efficiency.
Merge _median_filter_2d and _median_filter_nd into one _median_filter function.
Rename _apply_2d_median_filter to _median_filter_2d.
Use separate nan_mask for real and imag to match Cython implementation.
Move median filter after thresholding in introductory tutorial. That should be better practice now that the median filter correctly handles NaN, no? Added a note that "filtered coordinates can no longer be used to reconstruct the original signal" (is there a better phrasing? Should that be added to the function docstring?)

Checklist

The pull request title and description are concise.
Related issues are linked in the description.
New dependencies are explained.
The source code and documentation can be distributed under the MIT license.
The source code adheres to code standards.
New classes, functions, and features are thoroughly tested.
New, user-facing classes, functions, and features are documented.
New features are covered in tutorials.
No files other than source code, documentation, and project settings are added to the repository.

src/phasorpy/_phasorpy.pyx

src/phasorpy/phasor.py

bruno-pannunzio · 2024-11-29T15:24:51Z

Hi @cgohlke! Thanks for implementing this improvements for median filtering. The modifications helped improving a little bit more the execution times, since now it seems to be around 20% faster than Scipy's (compared to 10-15% faster from what is now in main).

Use separate nan_mask for real and imag to match Cython implementation.

This is something we can discuss if it makes sense to apply the same logic as the phasor_threshold for propagating NaN across real and imag so that same values are invalid in all components of the phasor. But we can keep it like this I think and remember to discuss it next meeting.

Move median filter after thresholding in introductory tutorial. That should be better practice now that the median filter correctly handles NaN, no?

I think it would be a good idea to have the input of @lmalacrida on this. I remember he once said to me it's better practice to first filter and then threshold. I tried first thresholding and then filtering and the result phasor has wider spread and doesn't look as nice (even with the current implementation which is NaN consistent).

This is an example result of first thresholding and then filtering:

And this is first filtering and then thresholding where you can see it is more compact and fewer pixels lie outside the semicircle:

Added a note that "filtered coordinates can no longer be used to reconstruct the original signal" (is there a better phrasing? Should that be added to the function docstring?)

This can be a good idea to add to the docstring as a warning that once performed the filter you can't go back to the signal. Also maybe add a note in the phasor_to_signal function stating the same warning?

cgohlke · 2024-11-29T16:40:17Z

it's better practice to first filter and then threshold

OK, I reverted the changes to the tutorial since they are disputed.

apply the same logic as the phasor_threshold for propagating NaN across real and imag so that same values are invalid in all components of the phasor.

This would need to be documented and tested. And it should be implemented for all code paths, not just one. The filter would no longer be independent for real and imag. The logic should also apply to the intensity (at least optionally). That would probably be better implemented in a separate function (?)

I think it is easier to implement and document if the median filter is applied independently (as it is now).

lmalacrida · 2024-11-29T17:08:32Z

@bruno-pannunzio I agree that Filtering should be performed on the entire image array, regardless of thresholding or before it. This is how i believe was performed at SimFCS.

cgohlke · 2024-11-29T17:13:18Z

I agree that Filtering should be performed on the entire image array, regardless of thresholding or before it. This is how i believe was performed at SimFCS.

And the scientific reason is?

cgohlke · 2024-11-29T17:15:43Z

That would probably be better implemented in a separate function (?)

That's what phasor_threshold already does by default.

cgohlke · 2024-11-29T18:20:01Z

This is how i believe was performed at SimFCS.

I skimmed through the SimFCS source code. At least in some places, a minimum intensity threshold is applied as part of calculating phasor coordinates, that is before applying median filter. Phasor coordinates for intensities below threshold are set to zero, which are then included in the median filter (I presume). Other code paths in SimFCS might or might not do things differently.

For comparison: PhasorPy does not apply any intensity threshold by default when calculating phasor coordinates and sets phasor coordinates with no intensity to NaN. The phasor_threshold function sets values below or above thresholds to NaN. NaN values are ignored in the median filter.

bruno-pannunzio

Christoph I think it makes sense what you propose here and keep the workflow as it is now.

Lets keep it like this with the modifications proposed here, where everything is very well documented.

Improve median filter

01a8e0f

cgohlke added the enhancement New feature or request label Nov 28, 2024

cgohlke requested a review from bruno-pannunzio November 28, 2024 20:39

cgohlke self-assigned this Nov 28, 2024

bruno-pannunzio reviewed Nov 29, 2024

View reviewed changes

src/phasorpy/_phasorpy.pyx Show resolved Hide resolved

bruno-pannunzio reviewed Nov 29, 2024

View reviewed changes

src/phasorpy/phasor.py Show resolved Hide resolved

bruno-pannunzio reviewed Nov 29, 2024

View reviewed changes

src/phasorpy/phasor.py Show resolved Hide resolved

Revert changes to introductory tutorial

2794806

cgohlke requested a review from bruno-pannunzio November 29, 2024 18:23

bruno-pannunzio approved these changes Nov 29, 2024

View reviewed changes

cgohlke merged commit f6b96c0 into phasorpy:main Nov 29, 2024
14 checks passed

cgohlke deleted the improve-median branch November 29, 2024 19:09

cgohlke mentioned this pull request Dec 1, 2024

Improve consistency of handling NaN by phasor.phasor_filter #87

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve median filter #150

Improve median filter #150

cgohlke commented Nov 28, 2024 •

edited

Loading

bruno-pannunzio commented Nov 29, 2024

cgohlke commented Nov 29, 2024

lmalacrida commented Nov 29, 2024

cgohlke commented Nov 29, 2024

cgohlke commented Nov 29, 2024

cgohlke commented Nov 29, 2024

bruno-pannunzio left a comment

Improve median filter #150

Improve median filter #150

Conversation

cgohlke commented Nov 28, 2024 • edited Loading

Description

Checklist

bruno-pannunzio commented Nov 29, 2024

cgohlke commented Nov 29, 2024

lmalacrida commented Nov 29, 2024

cgohlke commented Nov 29, 2024

cgohlke commented Nov 29, 2024

cgohlke commented Nov 29, 2024

bruno-pannunzio left a comment

Choose a reason for hiding this comment

cgohlke commented Nov 28, 2024 •

edited

Loading