Data structure for filters #2153

tcompa · 2024-12-19T09:16:53Z

As of fractal-analytics-platform/fractal-web#678 (comment), we are not anymore defining attribute filters for workflow tasks. But we still have type filters. On the other hand, we are introducing attribute filters for jobs, but not type filters. This would be a validation layer which is not defined in the database schema but rather in the application/API schemas, which is a scenario we always try to avoid (it's more complex and error prone, and it requires redundant definitions).

Our suggestion (with @mfranzon) is that we should also split the data structure in two parts, attributes and types, rather than having single homogeneous filters objects + validation.

We describe the different options below, and suggest that we pick the third (or even fourth) one.

Notes:

We describe the different options under the assumption that we don't need the "exclude" field any more (see Drop attributes_exclude and keep attributes_include #2154), but this is not a critical point.
Task input_types are not used for filtering, but rather to validate the image list after filtering took place. Thus they are not related to the current discussion, and they would not require any change.

1. Current situation

DatasetV2.filters = {"types": {"is_3D": false}, "attributes": {"well": "B03"}}
WorkflowTaskV2.intput_filters: {"types": {"is_3D": false}, "attributes": {"well": "B03"}}

2. Original plan for complex filters

DatasetV2.filters = {"types": {"is_3D": false}, "attributes": {"well": ["B03"]}}   # note that we now have a list of possible wells
WorkflowTaskV2.input_filters: {"types": {"is_3D": false}}                          # note that here we only accept "types"
JobV2.filters = {"attributes": {"well": ["B03"]}}                                  # note that here we only accept "attributes"

3. Improved plan for complex filters

DatasetV2.type_filters = {"is_3D": false}
DatasetV2.attribute_filters = {"well": ["B03"]}
WorkflowTaskV2.type_filters: {"is_3D": false}
JobV2.attribute_filters = {"well": ["B03"]}}

At the cost of one additional field, we are removing one level of nesting from all these JSON objects.

4. Improved plan for complex filters / without attribute filters for dataset

To discuss: are dataset attribute filters actually relevant? If not, we could go even further and have

DatasetV2.type_filters = {"is_3D": false}
WorkflowTaskV2.type_filters: {"is_3D": false}
JobV2.attribute_filters = {"well": "B03"}}

The text was updated successfully, but these errors were encountered:

jluethi · 2024-12-19T12:52:48Z

I'd be in favor of 3. I do think there is likely a point for dataset attribute filters.

tcompa · 2025-01-07T11:24:42Z

Recap:

New columns:

DatasetV2.type_filters = {"is_3D": false}
DatasetV2.attribute_filters = {"well": ["B03"]}
WorkflowTaskV2.type_filters: {"is_3D": false}
JobV2.attribute_filters = {"well": ["B03"]}}

Old columns cannot be removed from db models until the new release is deployed - because they are needed for the data migration.
Old columns should be removed from API schemas.
This then must be implemented in the runner as well - see upcoming issue

tcompa added the flexibility Support more workflow-execution use cases label Dec 19, 2024

jluethi added this to Fractal Project Management Dec 19, 2024

github-project-automation bot moved this to TODO in Fractal Project Management Dec 19, 2024

This was referenced Dec 19, 2024

To discuss: Replace attribute filters rather than merging them #2155

Open

Define backwards compatibility for complex filters #2156

Open

tcompa mentioned this issue Jan 7, 2025

Show list of images that would be processed by the first task of a job fractal-analytics-platform/fractal-web#682

Open

tcompa changed the title ~~To discuss: Data structure for filters~~ Data structure for filters Jan 7, 2025

tcompa assigned ychiucco Jan 7, 2025

ychiucco linked a pull request Jan 7, 2025 that will close this issue

Data structure for filters #2168

Draft

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data structure for filters #2153

Data structure for filters #2153

tcompa commented Dec 19, 2024 •

edited

Loading

jluethi commented Dec 19, 2024

tcompa commented Jan 7, 2025

Data structure for filters #2153

Data structure for filters #2153

Comments

tcompa commented Dec 19, 2024 • edited Loading

1. Current situation

2. Original plan for complex filters

3. Improved plan for complex filters

4. Improved plan for complex filters / without attribute filters for dataset

jluethi commented Dec 19, 2024

tcompa commented Jan 7, 2025

tcompa commented Dec 19, 2024 •

edited

Loading