adding bulk endpoints and auth token endpoint by fcollman · Pull Request #7 · CAVEconnectome/SkeletonService

fcollman · 2026-03-14T00:52:18Z

Addresses user reports of hitting the 10-skeleton limit in get_bulk_skeletons() even when all requested skeletons already exist in the GCS cache. The existing limit was designed to prevent blocking on skeleton generation, but incorrectly also throttled retrieval of pre-existing cached skeletons. This PR adds two new endpoints that separate those concerns.

Changes

POST //bulk/get_cached_skeletons//<output_format> — retrieves up to 500 already-cached skeletons per call. Skips per-RID is_valid_nodes() validation against the chunkedgraph (the main bottleneck of the existing endpoint). Returns a structured dict with three keys: skeletons (data for found RIDs), missing (not in cache), and async_queued (queued for async generation if generate_missing=true). Rate-limited by the new get_cached_skeletons_bulk category.

POST //bulk/get_skeleton_token/ — generates a short-lived, downscoped GCS OAuth2 Bearer token scoped to read-only access on the skeleton bucket prefix for the given datastack and version. The client can use this token to download skeleton H5 files directly from GCS without routing through this service, which is significantly faster for bulk access. Returns the token, expiry, bucket name, GCS object paths for each cached RID, and a list of missing RIDs.

New constants: MAX_BULK_CACHED_SKELETONS = 500

New dependencies: google.auth.downscoped (already available via google-auth)

Copilot

Pull request overview

This PR adds two new bulk-oriented API endpoints to improve high-volume skeleton access by (1) separating cached retrieval from generation and (2) enabling direct GCS downloads via a downscoped OAuth2 token, addressing reports of the existing bulk endpoint hitting limits even when skeletons are already cached.

Changes:

Add a bulk endpoint to fetch already-cached skeletons with a higher per-call RID limit and optional async-queueing of missing RIDs.
Add an endpoint that returns a downscoped, short-lived GCS Bearer token plus object paths for cached skeleton H5 files.
Introduce MAX_BULK_CACHED_SKELETONS = 500 and wire new API routes to the service layer.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 8 comments.

File	Description
skeletonservice/datasets/service.py	Implements cached-bulk retrieval logic and downscoped GCS token generation, plus new bulk limit constant.
skeletonservice/datasets/api.py	Exposes new POST routes for cached-bulk retrieval and token issuance, with rate-limiting and auth decorators.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

skeletonservice/datasets/service.py

fcollman · 2026-03-18T09:52:13Z

skeletonservice/datasets/service.py

+        datastack_name_remapped = DATASTACK_NAME_REMAPPING[datastack_name] if datastack_name in DATASTACK_NAME_REMAPPING else datastack_name
+        skvn_prefix = f"{bucket_extra_prefix}{datastack_name_remapped}/{HIGHEST_SKELETON_VERSION}/"
+


@copilot apply changes based on this feedback

fcollman · 2026-03-18T09:53:20Z

skeletonservice/datasets/api.py

 from skeletonservice.datasets import limiter
 from skeletonservice.datasets.limiter import *
-from skeletonservice.datasets.service import NEUROGLANCER_SKELETON_VERSION, SKELETON_DEFAULT_VERSION_PARAMS, SKELETON_VERSION_PARAMS, SkeletonService
+from skeletonservice.datasets.service import NEUROGLANCER_SKELETON_VERSION, SKELETON_DEFAULT_VERSION_PARAMS, SKELETON_VERSION_PARAMS, SkeletonService, MAX_BULK_CACHED_SKELETONS


@copilot please enforce the limit at the server side

fcollman · 2026-03-18T09:53:43Z

skeletonservice/datasets/service.py

+    @staticmethod
+    def get_cached_skeletons_bulk_by_datastack_and_rids(
+        datastack_name: str,
+        rids: List,
+        bucket: str,
+        root_resolution: List,
+        collapse_soma: bool,
+        collapse_radius: int,
+        skeleton_version: int = 0,
+        output_format: str = "flatdict",
+        generate_missing_skeletons: bool = False,


@copilot apply changes based on this feedback

skeletonservice/datasets/service.py

+                    skeleton = SkeletonService.get_skeleton_by_datastack_and_rid(
+                        datastack_name,
+                        rid,
+                        output_format,
+                        bucket,
+                        root_resolution,
+                        collapse_soma,
+                        collapse_radius,


skeletonservice/datasets/service.py

…n validation, RID limits, fast-path, remove unused param, add tests Co-authored-by: fcollman <782341+fcollman@users.noreply.github.com>

Copilot · 2026-03-18T09:52:23Z

@fcollman I've opened a new pull request, #10, to work on those changes. Once the pull request is ready, I'll request review from you.

Copilot · 2026-03-18T09:53:29Z

@fcollman I've opened a new pull request, #11, to work on those changes. Once the pull request is ready, I'll request review from you.

Copilot · 2026-03-18T09:53:51Z

@fcollman I've opened a new pull request, #12, to work on those changes. Once the pull request is ready, I'll request review from you.

Co-authored-by: fcollman <782341+fcollman@users.noreply.github.com>

Enforce MAX_BULK_CACHED_SKELETONS limit at the API layer

Fix hard-coded HIGHEST_SKELETON_VERSION in get_skeleton_token_by_datastack

Co-authored-by: fcollman <782341+fcollman@users.noreply.github.com>

Add unit tests for bulk cached skeleton and token endpoints

adding bulk endpoints and auth tokens

501bd1e

fcollman requested a review from Copilot March 14, 2026 12:22

Copilot started reviewing on behalf of fcollman March 14, 2026 12:22 View session

Copilot AI reviewed Mar 14, 2026

View reviewed changes

Copilot AI mentioned this pull request Mar 14, 2026

Address review comments on bulk skeleton endpoints #8

Closed

Copilot AI added a commit that referenced this pull request Mar 14, 2026

Address PR #7 review comments: fix token scope docs, string keys, skv…

ac6900b

…n validation, RID limits, fast-path, remove unused param, add tests Co-authored-by: fcollman <782341+fcollman@users.noreply.github.com>

fcollman and others added 2 commits March 17, 2026 19:51

changing service to only have token and alter return format

312effe

Initial plan

bb484bb

Copilot AI mentioned this pull request Mar 18, 2026

Fix hard-coded HIGHEST_SKELETON_VERSION in get_skeleton_token_by_datastack #10

Merged

Initial plan

17276ba

Copilot AI mentioned this pull request Mar 18, 2026

Enforce MAX_BULK_CACHED_SKELETONS limit at the API layer #11

Merged

Initial plan

169cc98

Copilot AI mentioned this pull request Mar 18, 2026

Add unit tests for bulk cached skeleton and token endpoints #12

Merged

Copilot AI and others added 6 commits March 18, 2026 09:55

Fix skeleton_version usage in get_skeleton_token_by_datastack

c0d692c

Co-authored-by: fcollman <782341+fcollman@users.noreply.github.com>

Enforce MAX_BULK_CACHED_SKELETONS limit in API layer with 400 error

a2a7655

Co-authored-by: fcollman <782341+fcollman@users.noreply.github.com>

Merge pull request #11 from CAVEconnectome/copilot/sub-pr-7-again

87608b0

Enforce MAX_BULK_CACHED_SKELETONS limit at the API layer

Merge pull request #10 from CAVEconnectome/copilot/sub-pr-7

6048236

Fix hard-coded HIGHEST_SKELETON_VERSION in get_skeleton_token_by_datastack

Add unit tests for bulk cached skeletons and token endpoints

1bc96f9

Co-authored-by: fcollman <782341+fcollman@users.noreply.github.com>

Merge pull request #12 from CAVEconnectome/copilot/sub-pr-7-another-one

c5051fa

Add unit tests for bulk cached skeleton and token endpoints

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding bulk endpoints and auth token endpoint#7

adding bulk endpoints and auth token endpoint#7
fcollman wants to merge 11 commits intomainfrom
bulk_skeleton_improvements

fcollman commented Mar 14, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

fcollman Mar 18, 2026

Uh oh!

fcollman Mar 18, 2026

Uh oh!

fcollman Mar 18, 2026

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented Mar 18, 2026

Uh oh!

Copilot AI commented Mar 18, 2026

Uh oh!

Copilot AI commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		datastack_name_remapped = DATASTACK_NAME_REMAPPING[datastack_name] if datastack_name in DATASTACK_NAME_REMAPPING else datastack_name
		skvn_prefix = f"{bucket_extra_prefix}{datastack_name_remapped}/{HIGHEST_SKELETON_VERSION}/"

Conversation

fcollman commented Mar 14, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

fcollman Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

fcollman Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

fcollman Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented Mar 18, 2026

Uh oh!

Copilot AI commented Mar 18, 2026

Uh oh!

Copilot AI commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants