Task label feature refined #983

ottointhesky · 2026-02-12T11:58:18Z

This merge request will contain small improvements regarding the docu and unittests of the label feature. For now, only unittests were added which check submitted labels with the dictDB and sqliteDB backend.

A couple of days ago I realized that label will be written twice to the DB which is maybe unwanted (since its wastes resources):

We need label as explicit column to make it queryable. Hence, we could remove the entry from the metadata before writing the record to the database. This can be handled centrally. However, retrieving a record needs re-adding the label to metadata which makes everything more complicated since it requires specific handling for the different DB backends. So is it worth the effort since label will be empty for most users anyway? Probably not...

As mentioned earlier we also need the possibility to find records based on substrings within DB columns. In monoDB syntax this can be achieved using regex. E.g.
{'label': {'$regex': 'my'}}
would find any records where label contains the string my (at any position). So this would require a new comparision operator ($regex) for the filter defintion in ipp. Supporting this operator in dictDB shouldn't be to difficult but for sqlite, only a strongly reduced regex defintion could be supported via like. sql like basically only support wildcard (single and multi character macthing). So a regex only containing ^ $ . .* .+ could be translated. Anything else isn't possible. So the question here is, should we extended the supported operators by $regex or should we go a different/new way by add the possibility of passing backend specific filter objects (e.g. lamba object for dictDB and where clauses for sqliteDB)? If you are thinking of dropping support of mongoDB the second option might be more appealing. If you do not want to drop support for mongoDB yet, I sugguest that we add a monoDB installation to the github actions. Using the following action script this should be to difficult. No matter which way you want to go, I'm happy to provid the necessary implementation...

minrk · 2026-02-12T17:09:20Z

I don't think we need to worry about the cost of writing the label twice to make it queryable. It's quite small compared to anything else, so the impact will be negligible.

I don't imagine full regex search is going to be that useful, since users would only craft the labels specifically to make them searchable, I imagine wildcard matching is plenty.

If you wanted to put some time into testing mongodb, that would be super appreciated! If it takes too much of your time, just say so, and we can probably drop it.

…lel into task_label_feature

for more information, see https://pre-commit.ci

…bel_feature # Conflicts: # ipyparallel/controller/mongodb.py

for more information, see https://pre-commit.ci

…bel_feature # Conflicts: # ipyparallel/controller/mongodb.py

ottointhesky · 2026-02-12T19:57:37Z

I don't think we need to worry about the cost of writing the label twice to make it queryable. It's quite small compared to anything else, so the impact will be negligible.

Ok & thanks. I just wanted to double check with you...

If you wanted to put some time into testing mongodb, that would be super appreciated! If it takes too much of your time, just say so, and we can probably drop it.

As presumed, adding mongodb to the github tests was easy. supercharge/mongodb-github-action only works for linux container but that's definitely better than no test. I also changed to pymongo api 4.x and raise an exception if pymongo version is below 4

I don't imagine full regex search is going to be that useful, since users would only craft the labels specifically to make them searchable, I imagine wildcard matching is plenty.

Agreed, but how should a wildcard matching look in python code? So far the query objects syntax is defined by mongodb (query objects are passed to mongodb untouched) and there is no wildcard syntax there. If we come up with something new, e.g. based on sql like

{'label': {'$like': '%my%'}}

query objects will need preprocessing also for mongodb as it is NOT currently the case. Which direction should we go?

ottointhesky · 2026-02-13T12:33:05Z

FYI: for what ever reason the mongodb container seem to interfere with the slurm container. Sometimes it works but most of the time it doesn't. Deactivating mongodb via if for the slurm test doesn't seem to work. Hopefully I can find a solution to this problem...

test code for labels added

fbab54a

Johannes Otepka and others added 11 commits February 12, 2026 18:52

tzinfo got lost when storing in mongodb

171bc98

Merge branch 'task_label_feature' of github.com:ottointhesky/ipyparal…

07923f8

…lel into task_label_feature

[pre-commit.ci] auto fixes from pre-commit.com hooks

2f1293f

for more information, see https://pre-commit.ci

make mongodb tzinfo aware

7f0400a

Merge remote-tracking branch 'origin/task_label_feature' into task_la…

6a887c1

…bel_feature # Conflicts: # ipyparallel/controller/mongodb.py

make mongodb tzinfo aware

a9e4266

[pre-commit.ci] auto fixes from pre-commit.com hooks

e473c81

for more information, see https://pre-commit.ci

switch to new mongodb api

5762cd6

mongodb installation added to github actions

6a36864

Merge remote-tracking branch 'origin/task_label_feature' into task_la…

29fe5e1

…bel_feature # Conflicts: # ipyparallel/controller/mongodb.py

ruff format changes

d6436d1

Johannes Otepka and others added 4 commits February 12, 2026 22:22

exclude mongodb isntall from slurm

1a2624e

Update test.yml

d4c6fab

Update test.yml

6a365f2

Update test.yml

3d20d63

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Task label feature refined #983

Task label feature refined #983

ottointhesky commented Feb 12, 2026

Uh oh!

minrk commented Feb 12, 2026

Uh oh!

ottointhesky commented Feb 12, 2026

Uh oh!

ottointhesky commented Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Task label feature refined #983

Are you sure you want to change the base?

Task label feature refined #983

Conversation

ottointhesky commented Feb 12, 2026

Uh oh!

minrk commented Feb 12, 2026

Uh oh!

ottointhesky commented Feb 12, 2026

Uh oh!

ottointhesky commented Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants