BMDeep tutorial added by DanielaSchacherer · Pull Request #95 · ImagingDataCommons/IDC-Tutorials

DanielaSchacherer · 2025-12-09T17:51:42Z

As discussed, I added the BMDeep tutorial. Happy for feedback! :)

was stored with a wrong name

review-notebook-app · 2025-12-09T17:51:47Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

DanielaSchacherer · 2026-01-22T12:38:30Z

@fedorov did we already talk about this? If not, maybe we should briefly in the Slim meeting later today!

Copilot

Copilot wasn't able to review any files in this pull request.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

fedorov · 2026-01-22T14:19:09Z

I admit I lost track of this PR...

DanielaSchacherer · 2026-01-22T15:43:04Z

no problem, me too :D

fedorov · 2026-02-02T18:32:52Z

@DanielaSchacherer I pushed some minor changes, and also left some suggestion in the comments here: https://colab.research.google.com/drive/1kD_mEbfi1ozhyyS0WK9zl3vyKYMJJJjA?usp=sharing

… paragraph removed.

DanielaSchacherer · 2026-02-04T12:25:42Z

I took the feedback in and answered to your comments in the notebook linked above. I already changed the code to use ann_index, so let's not merge yet, but let me test as soon as ann_index is out :)

DanielaSchacherer · 2026-02-09T19:34:03Z

@fedorov do you have any additional feedback?

fedorov · 2026-02-09T20:19:39Z

You rely on "monolayer" in SeriesDescription in one of the queries, which is not a good pattern.

Why not use ann_group_index and search by Selected region?

Search by unstructured data is not desirable, especially when the query can be resolved using structured data.

client.fetch_index("ann_group_index")

# Select all annotation groups from ANN series whose SeriesDescription
# mentions "monolayer" in the bonemarrowwsi_pediatricleukemia collection
monolayer_ann_groups = client.sql_query("""
    SELECT
        ag.SeriesInstanceUID,
        ag.AnnotationGroupNumber,
        ag.AnnotationGroupUID,
        ag.AnnotationGroupLabel,
        ag.NumberOfAnnotations,
        ag.GraphicType,
        ag.AnnotationPropertyCategory_CodeMeaning,
        ag.AnnotationPropertyType_CodeMeaning,
        ai.referenced_SeriesInstanceUID
    FROM ann_group_index ag
    JOIN ann_index ai
      ON ag.SeriesInstanceUID = ai.SeriesInstanceUID
    JOIN index i
      ON ag.SeriesInstanceUID = i.SeriesInstanceUID
    WHERE i.collection_id = 'bonemarrowwsi_pediatricleukemia'
      AND ag.AnnotationPropertyType_CodeMeaning = 'Selected region'
    ORDER BY ag.SeriesInstanceUID, ag.AnnotationGroupNumber
""")

print(f"Found {len(monolayer_ann_groups)} annotation groups "
      f"across {monolayer_ann_groups['SeriesInstanceUID'].nunique()} ANN series")
display(monolayer_ann_groups)

fedorov · 2026-02-09T20:30:04Z

Also note that idc-index has been updated, and should allow fetching ann_group_index.

fedorov · 2026-02-10T15:38:50Z

I think similar comment applies further in the notebook where you deal with unlabeled cells.

DanielaSchacherer · 2026-02-15T14:46:00Z

@fedorov I have adapted the notebook. You are right, it is of course better to query by standardized values. However here, this added another layer of complexity. I will ask André to review and give me feedback on how well understandable the notebook is and how it could be improved.

Let mw know, what you think about the current version.

fedorov · 2026-02-16T15:47:09Z

The issue is that if we rely on collection-specific conventions to be able to use the data, it will be more difficult to discover and reuse - most importantly, these days, by the LLMs. This collection-specific approach that is oriented towards a human using a single collection does not scale, and this is exactly why we put the effort into using codes and standard DICOM attributes. But I am fine either way for the tutorial - it's your call. It will be an interesting experiment to ask Claude + IDC skill to write a tutorial on this topic, and compare!

fedorov · 2026-02-17T19:14:35Z

@DanielaSchacherer let me know when it is ready!

DanielaSchacherer and others added 14 commits October 27, 2025 19:09

Created using Colab

393795d

Add files via upload

f9772b4

moved bmdeep tutorial and image

5cb6e38

renamed notebook

2a49ff1

Merge branch 'ImagingDataCommons:master' into bmdeep_tutorial

80d4e0a

Add files via upload

d3a7cd4

updates with test against idc-etl-dev

9bd740f

Delete notebooks/collections_demos/bonemarrowwsi-pediatricleukemia.ipynb

bed62d5

was stored with a wrong name

cleared and tested notebook with idc-index-data v23

f0c39d3

added more text and reorganized code a little

e47f184

introduced demo mode for quicker overview

c24c840

Moved text cells above code cells

b9559f8

Smaller style changes

3f3b1c5

adapted Colab link, added warning about flawed session labels.

bdff1a9

fedorov requested a review from Copilot January 22, 2026 14:12

Copilot AI reviewed Jan 22, 2026

View reviewed changes

DanielaSchacherer and others added 3 commits February 2, 2026 17:28

ready to be tested on complete dataset for first time

ffce1c3

removed contraint to bmdeep collection

f512884

minor changes

aff0d46

DanielaSchacherer and others added 5 commits February 3, 2026 17:01

adapted query on ANN files

376de03

Integrated first round of feedback from Andrey.

5483e3a

adapted queries to use ann_index

d3b22c6

text changes, demo replaced by ann_to_process and warning in the last…

bde5c4c

… paragraph removed.

adjusted numbers in text

c2d33e6

working if idc-index-data is force-upgraded

0d660e1

DanielaSchacherer and others added 3 commits February 14, 2026 20:07

updated information about ann_groups

015a485

updated to query ann_group_index.

2801aaa

added correct colab link (for when notebook is merged)

65f1ea4

Merge branch 'ImagingDataCommons:master' into bmdeep_tutorial

40ace17

DanielaSchacherer and others added 4 commits February 16, 2026 21:49

Merge branch 'ImagingDataCommons:master' into bmdeep_tutorial

e9e40bb

updated introduction

ca46d14

adjusted title and included tutorial header.

30d5c31

small cosmetic changes

384505f

DanielaSchacherer and others added 3 commits February 19, 2026 17:36

added AnnotationCoordinateType to query and reference in text

2b89ce7

added confusion matrix (to be tested with v24!), tiny text changes

91b06ed

Merge branch 'ImagingDataCommons:master' into bmdeep_tutorial

9c8c3a5

Conversation

DanielaSchacherer commented Dec 9, 2025

Uh oh!

review-notebook-app Bot commented Dec 9, 2025

Uh oh!

DanielaSchacherer commented Jan 22, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

fedorov commented Jan 22, 2026

Uh oh!

DanielaSchacherer commented Jan 22, 2026

Uh oh!

fedorov commented Feb 2, 2026

Uh oh!

DanielaSchacherer commented Feb 4, 2026

Uh oh!

DanielaSchacherer commented Feb 9, 2026

Uh oh!

fedorov commented Feb 9, 2026

Uh oh!

fedorov commented Feb 9, 2026

Uh oh!

fedorov commented Feb 10, 2026

Uh oh!

DanielaSchacherer commented Feb 15, 2026

Uh oh!

fedorov commented Feb 16, 2026

Uh oh!

fedorov commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants