Feature/issue 177 by Jayant-kernel · Pull Request #266 · openml/server-api

Jayant-kernel · 2026-03-07T22:00:50Z

Summary

Closes #177

Server testing previously ran as a single undifferentiated block, making it hard to tell whether a failure was an internal logic regression or a public API contract breakage.

This PR splits the Run tests step in the CI workflow into two explicit steps:

Run internal tests — runs tests/database/ and tests/config_test.py to validate database logic and configuration internals.
Run API tests — runs tests/routers/ to validate HTTP response contracts, using coverage run -a (append mode) so combined coverage is preserved correctly.

Changes

.github/workflows/tests.yml: replaced the monolithic Run tests step with two dedicated Run internal tests and Run API tests steps.

Testing

No new test logic is added — only the CI execution is reorganised. Existing tests cover all assertions.

YAML syntax validated locally.
Both steps share the same Docker environment to avoid duplication of the expensive docker compose up setup.

… flow names The GET /flows/exists/{name}/{external_version} endpoint broke for flows with URI-unsafe characters in their names (e.g. sklearn flows with parentheses). Replaced with POST /flows/exists accepting name and external_version in the request body, resolving issue openml#166. Closes openml#166

for more information, see https://pre-commit.ci

…cated wrapper - Move FlowExistsBody to schemas/flows.py for reusability - Keep GET /flows/exists/{name}/{version} as a deprecated thin wrapper around POST /flows/exists for backward compatibility - Update test imports accordingly

Resolves openml#177

coderabbitai · 2026-03-07T22:01:10Z

Walkthrough

The flows existence endpoint was changed from GET /flows/exists/{name}/{external_version} to POST /flows/exists accepting a new Pydantic model FlowExistsBody(name, external_version). A deprecated GET endpoint at /flows/exists/{name}/{external_version} now delegates to the POST handler. Tests were updated to use the POST API (and include a legacy-GET test). The GitHub Actions workflow adds a job-level TEST_MARKER env var and splits tests into "Run internal tests" and "Run API tests"; coverage and Codecov upload steps remain.

🚥 Pre-merge checks | ✅ 3 | ❌ 2

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Title check	❓ Inconclusive	The title 'Feature/issue 177' is vague and generic, using a reference to an issue number without describing the actual change being made.	Use a descriptive title that explains what the change does, such as 'Split CI test steps to distinguish internal and API tests' or similar.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description check	✅ Passed	The description clearly explains the changes: splitting the CI workflow's test step into two explicit steps (internal and API tests) to help distinguish failure types.
Linked Issues check	✅ Passed	The PR addresses the core objective of `#177` by implementing a CI/CD pipeline that splits testing into two suites: one for server internals and one for API contracts.
Out of Scope Changes check	✅ Passed	The changes focus narrowly on splitting the CI workflow test step, which aligns with the `#177` objective of establishing separate testing suites.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Jayant-kernel · 2026-03-07T22:01:55Z

@PGijsbers
review the pr

sourcery-ai

Hey - I've reviewed your changes and they look great!

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (2)

.github/workflows/tests.yml (1)

37-44: The test split looks good and achieves the stated objective.

Coverage is correctly handled with fresh coverage run for internal tests and append mode (-a) for API tests. Both steps share the same container, preserving combined coverage.

One minor observation: the marker expression is duplicated verbatim in lines 39 and 43. Consider extracting to a job-level environment variable to reduce drift risk:

♻️ Optional: Extract marker to job-level env

 jobs:
   tests:
     name: "${{ matrix.php_api == true && 'Migration' || 'Python-only' }} ${{ matrix.mutations == true && 'with mutations' || 'read-only' }}"
+    env:
+      TEST_MARKER: "${{ matrix.php_api == true && 'php_api' || 'not php_api' }} and ${{ matrix.mutations == true && 'mut' || 'not mut' }}"
     strategy:

Then in each step:

       - name: Run internal tests
         run: |
-          marker="${{ matrix.php_api == true && 'php_api' || 'not php_api' }} and ${{ matrix.mutations == true && 'mut' || 'not mut' }}"
-          docker exec openml-python-rest-api coverage run -m pytest tests/database tests/config_test.py -n auto -v -m "$marker"
+          docker exec openml-python-rest-api coverage run -m pytest tests/database tests/config_test.py -n auto -v -m "$TEST_MARKER"
       - name: Run API tests
         run: |
-          marker="${{ matrix.php_api == true && 'php_api' || 'not php_api' }} and ${{ matrix.mutations == true && 'mut' || 'not mut' }}"
-          docker exec openml-python-rest-api coverage run -a -m pytest tests/routers -n auto -v -m "$marker"
+          docker exec openml-python-rest-api coverage run -a -m pytest tests/routers -n auto -v -m "$TEST_MARKER"

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.github/workflows/tests.yml around lines 37 - 44, Extract the duplicated
marker expression into a job-level environment variable (e.g., MARKER) and
update the two steps "Run internal tests" and "Run API tests" to reference that
env var instead of repeating the expression; specifically replace the inline
marker expression used in both steps with the single job env variable (marker)
so the value is defined once and reused, reducing duplication and drift risk.

src/schemas/flows.py (1)

9-11: Mirror the existing name/version bounds on the request model.

FlowExistsBody currently accepts unbounded strings, while Flow.name and Flow.external_version in this file already cap the same fields at 1024 and 128 chars. Keeping the lookup payload consistent would fail fast on impossible values instead of sending them to the database.
Suggested diff
 class FlowExistsBody(BaseModel):
-    name: str
-    external_version: str
+    name: str = Field(max_length=1024)
+    external_version: str = Field(max_length=128)
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/schemas/flows.py` around lines 9 - 11, FlowExistsBody accepts unbounded
strings whereas the Flow model caps these fields; update FlowExistsBody to
enforce the same length bounds by changing its fields to constrained types
(e.g., use pydantic.constr or Field with max_length) so name has max_length=1024
and external_version has max_length=128, keeping the field names exactly
FlowExistsBody.name and FlowExistsBody.external_version to mirror Flow.name and
Flow.external_version.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/routers/openml/flows.py`:
- Around line 34-41: Add a small smoke test that exercises the deprecated GET
shim function flow_exists_get by calling the route GET
/flows/exists/{name}/{external_version} via the test client and asserting it
returns the same structure/value as the POST /flows/exists helper (or directly
asserts {"flow_id": <expected>}), to prevent regressions; locate the test near
other flows tests, create or reuse a fixture to ensure the flow exists (or use
an existing FlowExistsBody/flow creation helper), call
client.get(f"/flows/exists/{name}/{external_version}"), and assert status code
200 and the JSON payload matches the expected flow_id.

In `@tests/routers/openml/flows_test.py`:
- Around line 65-75: Add a regression test for URI-unsafe names by creating (or
using a Flow fixture) whose name contains a reserved character like "/" and then
POSTing to "/flows/exists" with that name and its external_version; assert
HTTPStatus.OK and that the response body equals {"flow_id": flow.id}. Implement
this as a new test (e.g., test_flow_exists_uri_unsafe) alongside
test_flow_exists and test_flow_exists_not_exists, using the same py_api:
TestClient invocation pattern so the lookup works when names cannot be placed in
the URL path.

---

Nitpick comments:
In @.github/workflows/tests.yml:
- Around line 37-44: Extract the duplicated marker expression into a job-level
environment variable (e.g., MARKER) and update the two steps "Run internal
tests" and "Run API tests" to reference that env var instead of repeating the
expression; specifically replace the inline marker expression used in both steps
with the single job env variable (marker) so the value is defined once and
reused, reducing duplication and drift risk.

In `@src/schemas/flows.py`:
- Around line 9-11: FlowExistsBody accepts unbounded strings whereas the Flow
model caps these fields; update FlowExistsBody to enforce the same length bounds
by changing its fields to constrained types (e.g., use pydantic.constr or Field
with max_length) so name has max_length=1024 and external_version has
max_length=128, keeping the field names exactly FlowExistsBody.name and
FlowExistsBody.external_version to mirror Flow.name and Flow.external_version.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: d101f279-5c5e-4b51-b629-22d329ba2d0c

📥 Commits

Reviewing files that changed from the base of the PR and between 72989df and a5cac49.

📒 Files selected for processing (5)

.github/workflows/tests.yml
src/routers/openml/flows.py
src/schemas/flows.py
tests/routers/openml/flows_test.py
tests/routers/openml/migration/flows_migration_test.py

coderabbitai · 2026-03-07T22:05:52Z

src/routers/openml/flows.py

+@router.get("/exists/{name}/{external_version}", deprecated=True)
+def flow_exists_get(
+    name: str,
+    external_version: str,
+    expdb: Annotated[Connection, Depends(expdb_connection)],
+) -> dict[Literal["flow_id"], int]:
+    """Deprecated: use POST /flows/exists instead."""
+    return flow_exists(FlowExistsBody(name=name, external_version=external_version), expdb)


⚠️ Potential issue | 🟡 Minor

Please add coverage for the deprecated shim.

flow_exists_get is now part of the public compatibility surface, but none of the updated tests exercise GET /flows/exists/{name}/{external_version}. A small smoke test would keep that wrapper from silently drifting.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/routers/openml/flows.py` around lines 34 - 41, Add a small smoke test that exercises the deprecated GET shim function flow_exists_get by calling the route GET /flows/exists/{name}/{external_version} via the test client and asserting it returns the same structure/value as the POST /flows/exists helper (or directly asserts {"flow_id": <expected>}), to prevent regressions; locate the test near other flows tests, create or reuse a fixture to ensure the flow exists (or use an existing FlowExistsBody/flow creation helper), call client.get(f"/flows/exists/{name}/{external_version}"), and assert status code 200 and the JSON payload matches the expected flow_id.

tests/routers/openml/flows_test.py

- Add Field(max_length=) constraints to FlowExistsBody.name (1024) and .external_version (128) to mirror the bounds already on the Flow model - Add smoke test for the deprecated GET shim (flow_exists_get) to prevent silent regressions on the backward-compatibility surface - Add regression test for URI-unsafe flow names (containing '/') via POST /flows/exists, guarding the core motivation for the GET→POST migration - Extract duplicated marker expression into a job-level TEST_MARKER env var in tests.yml to eliminate drift risk Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Add Field(max_length=) constraints to FlowExistsBody.name (1024) and .external_version (128) to mirror the bounds already on the Flow model - Add smoke test for the deprecated GET shim (flow_exists_get) to prevent silent regressions on the backward-compatibility surface - Add regression test for URI-unsafe flow names (containing '/') via POST /flows/exists, guarding the core motivation for the GET->POST migration - Extract duplicated marker expression into a job-level TEST_MARKER env var in tests.yml to eliminate drift risk

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.github/workflows/tests.yml:
- Around line 39-44: The two GitHub Actions steps "Run internal tests" and "Run
API tests" are in the same job so a failure in the first step short-circuits the
second; update .github/workflows/tests.yml by either moving the "Run API tests"
step into its own job so it runs independently, or add an explicit condition to
the later step (the "Run API tests" step) such as if: ${{ always() }} so it
executes and records its result even if the "Run internal tests" step fails;
reference the step names "Run internal tests" and "Run API tests" when making
the change.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: f07db707-e2b6-4af9-a7bc-417f58966c78

📥 Commits

Reviewing files that changed from the base of the PR and between 331ab1d and 45af02a.

📒 Files selected for processing (3)

.github/workflows/tests.yml
src/schemas/flows.py
tests/routers/openml/flows_test.py

.github/workflows/tests.yml

Ensures both suites report independently so a failure in internal tests does not mask API test results, which is the whole point of the split.

coderabbitai

🧹 Nitpick comments (1)

.github/workflows/tests.yml (1)

42-51: Keep coverage publishing running after a suite fails.

Line 43 fixes the split-test short-circuit, but Lines 46-51 still use the default success() condition. If either suite fails, coverage xml and Codecov are skipped, so the combined coverage run -a data never gets surfaced on the runs where it is most useful.

Suggested follow-up

       - name: Run API tests
         if: always()
         run: |
           docker exec openml-python-rest-api coverage run -a -m pytest tests/routers -n auto -v -m "$TEST_MARKER"
       - name: Produce coverage report
+        if: ${{ !cancelled() }}
         run: docker exec openml-python-rest-api coverage xml
       - name: Upload results to Codecov
+        if: ${{ !cancelled() }}
         uses: codecov/codecov-action@v4
         with:
           token: ${{ secrets.CODECOV_TOKEN }}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.github/workflows/tests.yml around lines 42 - 51, The "Produce coverage
report" and "Upload results to Codecov" steps are currently running only on
success and get skipped when a test suite fails; update those steps (the steps
named "Produce coverage report" and "Upload results to Codecov") to use if:
always() so coverage xml and the Codecov upload run even after failures,
ensuring the combine-with-coverage run -a data is published for failed suites as
well.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In @.github/workflows/tests.yml:
- Around line 42-51: The "Produce coverage report" and "Upload results to
Codecov" steps are currently running only on success and get skipped when a test
suite fails; update those steps (the steps named "Produce coverage report" and
"Upload results to Codecov") to use if: always() so coverage xml and the Codecov
upload run even after failures, ensuring the combine-with-coverage run -a data
is published for failed suites as well.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: ea7754ee-952d-4c2d-a897-337dd8ba8c8c

📥 Commits

Reviewing files that changed from the base of the PR and between 45af02a and 9b0c16f.

📒 Files selected for processing (1)

.github/workflows/tests.yml

Jayant Kernel and others added 5 commits March 7, 2026 22:57

[pre-commit.ci] auto fixes from pre-commit.com hooks

2f60ac4

for more information, see https://pre-commit.ci

style: fix trailing comma in flow_exists db call

3ff845f

ci: split tests into internal and API steps for clearer reporting

a5cac49

Resolves openml#177

sourcery-ai bot reviewed Mar 7, 2026

View reviewed changes

coderabbitai bot reviewed Mar 7, 2026

View reviewed changes

Jayant-kernel force-pushed the feature/issue-177 branch from 331ab1d to 45af02a Compare March 8, 2026 06:40

coderabbitai bot reviewed Mar 8, 2026

View reviewed changes

.github/workflows/tests.yml Show resolved Hide resolved

ci: always run API tests step even if internal tests fail

9b0c16f

Ensures both suites report independently so a failure in internal tests does not mask API test results, which is the whole point of the split.

coderabbitai bot reviewed Mar 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/issue 177#266

Feature/issue 177#266
Jayant-kernel wants to merge 7 commits intoopenml:mainfrom
Jayant-kernel:feature/issue-177

Jayant-kernel commented Mar 7, 2026

Uh oh!

coderabbitai bot commented Mar 7, 2026 •

edited

Loading

❌ Failed checks (1 warning, 1 inconclusive)

Uh oh!

Jayant-kernel commented Mar 7, 2026

Uh oh!

sourcery-ai bot left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Mar 7, 2026

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

Jayant-kernel commented Mar 7, 2026

Summary

Changes

Testing

Uh oh!

coderabbitai bot commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

❌ Failed checks (1 warning, 1 inconclusive)

Uh oh!

Jayant-kernel commented Mar 7, 2026

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

coderabbitai bot commented Mar 7, 2026 •

edited

Loading