Skip to content

Serverless GPU lab with APIM and SGLang#297

Open
nourshaker-msft wants to merge 3 commits intomainfrom
serverless-gpu
Open

Serverless GPU lab with APIM and SGLang#297
nourshaker-msft wants to merge 3 commits intomainfrom
serverless-gpu

Conversation

@nourshaker-msft
Copy link
Collaborator

Purpose

Showcase the capabilities of serverless GPU to host custom models that can scale down to Zero to run models that would otherwise not have access to in certain regions.

Does this introduce a breaking change?

[ ] Yes
[x] No

Pull Request Type

What kind of change does this Pull Request introduce?

[ ] Bugfix
[x] Feature
[ ] Code style update (formatting, local variables)
[ ] Refactoring (no functional changes, no api changes)
[ ] Documentation content changes
[ ] Other... Please describe:

How to Test

  • Get the code

  • Run the serverless-gpu lab

  • CLEAN UP to avoid excessive costs

Other Information

@nourshaker-msft nourshaker-msft requested a review from vieiraae March 3, 2026 16:51
@github-actions
Copy link
Contributor

github-actions bot commented Mar 3, 2026

📋 Labs Config Preview

The following labs will be updated in docs/labs-config.json when this PR is merged:

  • serverless-gpu

The config will be updated on both main and gh-pages branches.

@nourshaker-msft nourshaker-msft marked this pull request as ready for review March 3, 2026 16:52
@github-actions
Copy link
Contributor

github-actions bot commented Mar 3, 2026

📋 Labs Config Preview

The following labs will be updated in docs/labs-config.json when this PR is merged:

  • serverless-gpu

The config will be updated on both main and gh-pages branches.

@github-actions
Copy link
Contributor

github-actions bot commented Mar 4, 2026

📋 Labs Config Preview

The following labs will be updated in docs/labs-config.json when this PR is merged:

  • serverless-gpu

The config will be updated on both main and gh-pages branches.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant