Merge pre/v2.0

claude · dimitri-yatsenko · claude · commit 4a43f50ede38 · 2025-12-25T05:19:51.000Z
Co-authored-by: dimitri-yatsenko &lt;dimitri@datajoint.com&gt;
diff --git a/docs/src/design/tables/object-type-spec.md b/docs/src/design/tables/object-type-spec.md
@@ -50,18 +50,41 @@ This is fundamentally different from **external references**, where DataJoint me
 
 ## Storage Architecture
 
-### Single Storage Backend Per Pipeline
+### Default and Named Stores
 
-Each DataJoint pipeline has **one** associated storage backend configured in `datajoint.json`. DataJoint fully controls the path structure within this backend.
+Each DataJoint pipeline has a **default storage backend** plus optional **named stores**, all configured in `datajoint.json`. DataJoint fully controls the path structure within each store.
 
-**Why single backend?** The object store is a logical extension of the schema—its integrity must be verifiable as a unit. With a single backend:
-- Schema completeness can be verified with one listing operation
-- Orphan detection is straightforward
-- Migration requires only config changes, not mass URL updates in the database
+```python
+@schema
+class Recording(dj.Manual):
+    definition = """
+    subject_id : int
+    session_id : int
+    ---
+    raw_data : object              # uses default store
+    published : object@public      # uses 'public' named store
+    """
+```
+
+**All stores follow OAS principles:**
+- DataJoint owns the lifecycle (insert/delete/fetch as a unit)
+- Same deterministic path structure (`project/schema/Table/objects/...`)
+- Same access control alignment with database
+- Each store has its own `datajoint_store.json` metadata file
+
+**Why support multiple stores?**
+- Different access policies (private vs public buckets)
+- Different storage tiers (hot vs cold storage)
+- Organizational requirements (data sovereignty, compliance)
+
+**Why require explicit store configuration?**
+- All stores must be registered for OAS semantics
+- Credential management aligns with database access control (platform-managed)
+- Orphan cleanup operates per-store with full knowledge of configured stores
 
 ### Access Control Patterns
 
-The deterministic path structure (`project/schema/Table/objects/pk=val/...`) enables **prefix-based access control policies** on the storage backend.
+The deterministic path structure (`project/schema/Table/objects/pk=val/...`) enables **prefix-based access control policies** on each storage backend.
 
 **Supported access control levels:**
 
@@ -72,21 +95,23 @@ The deterministic path structure (`project/schema/Table/objects/pk=val/...`) ena
 | Table-level | IAM/bucket policy | `my-bucket/my_project/schema/SensitiveTable/*` |
 | Row-level | Per-object ACL or signed URLs | Future enhancement |
 
-**Example: Private and public data in one bucket**
-
-Rather than using separate buckets, use prefix-based policies:
+**Example: Private and public data in separate stores**
 
 ```
-s3://my-bucket/my_project/
-├── internal_schema/           ← restricted IAM policy
-│   └── ProcessingResults/
-│       └── objects/...
-└── publications/              ← public bucket policy
+# Default store (private)
+s3://internal-bucket/my_project/
+└── lab_schema/
+    └── ProcessingResults/
+        └── objects/...
+
+# Named 'public' store
+s3://public-bucket/my_project/
+└── lab_schema/
     └── PublishedDatasets/
         └── objects/...
 ```
 
-This achieves the same access separation as multiple buckets while maintaining schema integrity in a single backend.
+Alternatively, use prefix-based policies within a single bucket if preferred.
 
 **Row-level access control** (access to objects for specific primary key values) is not directly supported by object store policies. Future versions may address this via DataJoint-generated signed URLs that project database permissions onto object access.
 
@@ -156,6 +181,42 @@ For local filesystem storage:
 }
 ```
 
+### Named Stores
+
+Additional stores can be defined using the `object_storage.stores.<name>` prefix:
+
+```json
+{
+    "object_storage.project_name": "my_project",
+    "object_storage.protocol": "s3",
+    "object_storage.bucket": "internal-bucket",
+    "object_storage.location": "my_project",
+
+    "object_storage.stores.public.protocol": "s3",
+    "object_storage.stores.public.bucket": "public-bucket",
+    "object_storage.stores.public.location": "my_project"
+}
+```
+
+Named stores inherit `project_name` from the default configuration but can override all other settings. Use named stores with the `object@store_name` syntax:
+
+```python
+@schema
+class Dataset(dj.Manual):
+    definition = """
+    dataset_id : int
+    ---
+    internal_data : object           # default store (internal-bucket)
+    published_data : object@public   # public store (public-bucket)
+    """
+```
+
+Each named store:
+- Must be explicitly configured (no ad-hoc URLs)
+- Has its own `datajoint_store.json` metadata file
+- Follows the same OAS lifecycle semantics as the default store
+- Credentials are managed at the platform level, aligned with database access control
+
 ### Settings Schema
 
 | Setting | Type | Required | Description |
@@ -320,20 +381,24 @@ class Recording(dj.Manual):
     subject_id : int
     session_id : int
     ---
-    raw_data : object          # managed file storage
-    processed : object         # another object attribute
+    raw_data : object          # uses default store
+    processed : object         # another object attribute (default store)
+    published : object@public  # uses named 'public' store
     """
 ```
 
-Note: No `@store` suffix needed - storage is determined by pipeline configuration.
+- `object` — uses the default storage backend
+- `object@store_name` — uses a named store (must be configured in settings)
 
 ## Database Storage
 
 The `object` type is stored as a `JSON` column in MySQL containing:
 
-**File example:**
+**File in default store:**
 ```json
 {
+    "store": null,
+    "url": "s3://my-bucket/my_project/my_schema/Recording/objects/subject_id=123/session_id=45/raw_data_Ax7bQ2kM.dat",
     "path": "my_schema/Recording/objects/subject_id=123/session_id=45/raw_data_Ax7bQ2kM.dat",
     "size": 12345,
     "hash": null,
@@ -344,10 +409,12 @@ The `object` type is stored as a `JSON` column in MySQL containing:
 }
 ```
 
-**File with optional hash:**
+**File in named store:**
 ```json
 {
-    "path": "my_schema/Recording/objects/subject_id=123/session_id=45/raw_data_Ax7bQ2kM.dat",
+    "store": "public",
+    "url": "s3://public-bucket/my_project/my_schema/Dataset/objects/dataset_id=1/published_data_Bx8cD3kM.dat",
+    "path": "my_schema/Dataset/objects/dataset_id=1/published_data_Bx8cD3kM.dat",
     "size": 12345,
     "hash": "sha256:abcdef1234...",
     "ext": ".dat",
@@ -360,6 +427,8 @@ The `object` type is stored as a `JSON` column in MySQL containing:
 **Folder example:**
 ```json
 {
+    "store": null,
+    "url": "s3://my-bucket/my_project/my_schema/Recording/objects/subject_id=123/session_id=45/raw_data_pL9nR4wE",
     "path": "my_schema/Recording/objects/subject_id=123/session_id=45/raw_data_pL9nR4wE",
     "size": 567890,
     "hash": null,
@@ -373,6 +442,8 @@ The `object` type is stored as a `JSON` column in MySQL containing:
 **Zarr example (large dataset, metadata fields omitted for performance):**
 ```json
 {
+    "store": null,
+    "url": "s3://my-bucket/my_project/my_schema/Recording/objects/subject_id=123/session_id=45/neural_data_kM3nP2qR.zarr",
     "path": "my_schema/Recording/objects/subject_id=123/session_id=45/neural_data_kM3nP2qR.zarr",
     "size": null,
     "hash": null,
@@ -386,7 +457,9 @@ The `object` type is stored as a `JSON` column in MySQL containing:
 
 | Field | Type | Required | Description |
 |-------|------|----------|-------------|
-| `path` | string | Yes | Full path/key within storage backend (includes token) |
+| `store` | string/null | Yes | Store name (e.g., `"public"`), or `null` for default store |
+| `url` | string | Yes | Full URL including protocol and bucket (e.g., `s3://bucket/path`) |
+| `path` | string | Yes | Relative path within store (excludes protocol/bucket, includes token) |
 | `size` | integer/null | No | Total size in bytes (sum for folders), or null if not computed. See [Performance Considerations](#performance-considerations). |
 | `hash` | string/null | Yes | Content hash with algorithm prefix, or null (default) |
 | `ext` | string/null | Yes | File extension as tooling hint (e.g., `.dat`, `.zarr`) or null. See [Extension Field](#extension-field). |
@@ -395,6 +468,11 @@ The `object` type is stored as a `JSON` column in MySQL containing:
 | `mime_type` | string | No | MIME type (files only, auto-detected from extension) |
 | `item_count` | integer | No | Number of files (folders only), or null if not computed. See [Performance Considerations](#performance-considerations). |
 
+**Why both `url` and `path`?**
+- `url`: Self-describing, enables cross-validation, robust to config changes
+- `path`: Enables store name re-derivation at migration time, consistent structure across stores
+- At migration, the store name can be derived by matching `url` against configured stores
+
 ### Extension Field
 
 The `ext` field is a **tooling hint** that preserves the original file extension or provides a conventional suffix for directory-based formats. It is:
@@ -937,18 +1015,36 @@ Orphaned files (files in storage without corresponding database records) may acc
 
 ### Orphan Cleanup Procedure
 
-Orphan cleanup is a **separate maintenance operation** provided via the `schema.object_storage` utility object.
+Orphan cleanup is a **separate maintenance operation** provided via the `schema.object_storage` utility object. Cleanup operates **per-store**, iterating through all configured stores.
 
 ```python
 # Maintenance utility methods (not a hidden table)
-schema.object_storage.find_orphaned(grace_period_minutes=30)  # List orphaned files
+schema.object_storage.find_orphaned(grace_period_minutes=30)  # List orphaned files (all stores)
+schema.object_storage.find_orphaned(store="public")           # List orphaned files (specific store)
 schema.object_storage.cleanup_orphaned(dry_run=True)          # Delete orphaned files
 schema.object_storage.verify_integrity()                       # Check all objects exist
 schema.object_storage.stats()                                  # Storage usage statistics
 ```
 
 **Note**: `schema.object_storage` is a utility object, not a hidden table. Unlike `attach@store` which uses `~external_*` tables, the `object` type stores all metadata inline in JSON columns and has no hidden tables.
 
+**Efficient listing for Zarr and large stores:**
+
+For stores with Zarr arrays (potentially millions of chunk objects), cleanup uses **delimiter-based listing** to enumerate only root object names, not individual chunks:
+
+```python
+# S3 API with delimiter - lists "directories" only
+response = s3.list_objects_v2(
+    Bucket=bucket,
+    Prefix='project/schema/Table/objects/',
+    Delimiter='/'
+)
+# Returns: ['neural_data_kM3nP2qR.zarr/', 'raw_data_Ax7bQ2kM.dat']
+# NOT millions of individual chunk keys
+```
+
+Orphan deletion uses recursive delete to remove entire Zarr stores efficiently.
+
 **Grace period for in-flight inserts:**
 
 While random tokens prevent filename collisions, there's a race condition with in-flight inserts:
@@ -962,8 +1058,9 @@ While random tokens prevent filename collisions, there's a race condition with i
 **Solution**: The `grace_period_minutes` parameter (default: 30) excludes files created within that window, assuming they are in-flight inserts.
 
 **Important considerations:**
+- Cleanup enumerates all configured stores (default + named)
+- Uses delimiter-based listing for efficiency with Zarr stores
 - Grace period handles race conditions—cleanup is safe to run anytime
-- Running during low-activity periods reduces in-flight operations to reason about
 - `dry_run=True` previews deletions before execution
 - Compares storage contents against JSON metadata in table columns
 
diff --git a/src/datajoint/declare.py b/src/datajoint/declare.py
@@ -65,7 +65,7 @@
         INTERNAL_ATTACH=r"attach$",
         EXTERNAL_ATTACH=r"attach@(?P<store>[a-z][\-\w]*)$",
         FILEPATH=r"filepath@(?P<store>[a-z][\-\w]*)$",
-        OBJECT=r"object$",  # managed object storage (files/folders)
+        OBJECT=r"object(@(?P<store>[a-z][\-\w]*))?$",  # managed object storage (files/folders)
         UUID=r"uuid$",
         ADAPTED=r"<.+>$",
     ).items()
@@ -469,6 +469,9 @@ def substitute_special_type(match, category, foreign_key_sql, context):
         match["type"] = "LONGBLOB"
     elif category == "OBJECT":
         # Object type stores metadata as JSON - no foreign key to external table
+        # Extract store name if present (object@store_name syntax)
+        if "@" in match["type"]:
+            match["store"] = match["type"].split("@", 1)[1]
         match["type"] = "JSON"
     elif category in EXTERNAL_TYPES:
         if category == "FILEPATH" and not _support_filepath_types():
diff --git a/src/datajoint/fetch.py b/src/datajoint/fetch.py
@@ -53,8 +53,10 @@ def _get(connection, attr, data, squeeze, download_path):
     if attr.is_object:
         # Object type - return ObjectRef handle
         json_data = json.loads(data) if isinstance(data, str) else data
+        # Get the correct backend based on store name in metadata
+        store_name = json_data.get("store")  # None for default store
         try:
-            spec = config.get_object_storage_spec()
+            spec = config.get_object_store_spec(store_name)
             backend = StorageBackend(spec)
         except DataJointError:
             backend = None
diff --git a/src/datajoint/heading.py b/src/datajoint/heading.py
@@ -359,6 +359,13 @@ def _init_from_database(self):
                         {env} = TRUE or upgrade datajoint.
                         """.format(env=FILEPATH_FEATURE_SWITCH)
                     )
+                # Extract store name for external types and object types with named stores
+                store = None
+                if category in EXTERNAL_TYPES:
+                    store = attr["type"].split("@")[1]
+                elif category == "OBJECT" and "@" in attr["type"]:
+                    store = attr["type"].split("@")[1]
+
                 attr.update(
                     unsupported=False,
                     is_attachment=category in ("INTERNAL_ATTACH", "EXTERNAL_ATTACH"),
@@ -368,7 +375,7 @@ def _init_from_database(self):
                     is_blob=category in ("INTERNAL_BLOB", "EXTERNAL_BLOB"),
                     uuid=category == "UUID",
                     is_external=category in EXTERNAL_TYPES,
-                    store=(attr["type"].split("@")[1] if category in EXTERNAL_TYPES else None),
+                    store=store,
                 )
 
             if attr["in_key"] and any(
diff --git a/src/datajoint/objectref.py b/src/datajoint/objectref.py
@@ -34,7 +34,9 @@ class ObjectRef:
     from the storage backend.
 
     Attributes:
-        path: Full path/key within storage backend (includes token)
+        path: Relative path within the store (includes token)
+        url: Full URI to the object (e.g., 's3://bucket/path/to/object.dat')
+        store: Store name (None for default store)
         size: Total size in bytes (sum for folders), or None if not computed.
             For large hierarchical data like Zarr stores, size computation can
             be expensive and is optional.
@@ -53,6 +55,8 @@ class ObjectRef:
     ext: str | None
     is_dir: bool
     timestamp: datetime
+    url: str | None = None
+    store: str | None = None
     mime_type: str | None = None
     item_count: int | None = None
     _backend: StorageBackend | None = None
@@ -80,6 +84,8 @@ def from_json(cls, json_data: dict | str, backend: StorageBackend | None = None)
 
         return cls(
             path=data["path"],
+            url=data.get("url"),
+            store=data.get("store"),
             size=data["size"],
             hash=data.get("hash"),
             ext=data.get("ext"),
@@ -105,6 +111,10 @@ def to_json(self) -> dict:
             "is_dir": self.is_dir,
             "timestamp": self.timestamp.isoformat() if self.timestamp else None,
         }
+        if self.url:
+            data["url"] = self.url
+        if self.store:
+            data["store"] = self.store
         if self.mime_type:
             data["mime_type"] = self.mime_type
         if self.item_count is not None:
@@ -121,7 +131,9 @@ def to_dict(self) -> dict:
 
         Returns:
             Dict containing the object metadata:
-                - path: Storage path
+                - path: Relative storage path within the store
+                - url: Full URI (e.g., 's3://bucket/path') (optional)
+                - store: Store name (optional, None for default store)
                 - size: File/folder size in bytes (or None)
                 - hash: Content hash (or None)
                 - ext: File extension (or None)
@@ -152,12 +164,15 @@ def fs(self) -> fsspec.AbstractFileSystem:
         return self._backend.fs
 
     @property
-    def store(self) -> fsspec.FSMap:
+    def fsmap(self) -> fsspec.FSMap:
         """
         Return FSMap suitable for Zarr/xarray.
 
         This provides a dict-like interface to the storage location,
         compatible with zarr.open() and xarray.open_zarr().
+
+        Example:
+            >>> z = zarr.open(obj_ref.fsmap, mode='r')
         """
         self._ensure_backend()
         full_path = self._backend._full_path(self.path)
diff --git a/src/datajoint/settings.py b/src/datajoint/settings.py
diff --git a/src/datajoint/table.py b/src/datajoint/table.py