Slash Commands#
vLLM Ascend supports slash commands in pull request comments to trigger CI workflows. See the Permission section for who can trigger each command.
Available Commands#
/e2e#
Run specific E2E tests under tests/e2e/pull_request/. Tests are automatically routed to the appropriate NPU runner based on the test path.
Examples:
# Run a single test on the default runner (a2 single card)
/e2e tests/e2e/pull_request/one_card/test_attention.py
# Run multiple tests across different runners
/e2e tests/e2e/pull_request/one_card/test_attention.py tests/e2e/pull_request/two_card/test_parallel.py
# Run tests on 310P
/e2e tests/e2e/pull_request/one_card/_310p/test_310p_ops.py
Routing rules (matched in order):
Test path contains |
Runner |
|---|---|
|
310P 4-card |
|
310P single card |
|
A3 4-card |
|
A3 2-card |
Others (e.g. |
A2 single card |
Only test paths under
tests/e2e/pull_request/are supported. Tests intests/e2e/nightly/,tests/e2e/models/, ortests/e2e/doctests/are not accepted by/e2e. Use/nightlyfor nightly tests.
Tests are run against both the community vLLM version and the latest release.
/nightly#
Trigger specific nightly test cases on A2 and A3. Supports both PR and issue comments. Test case names correspond to the test_config.name entries defined in schedule_nightly_test_a2.yaml and schedule_nightly_test_a3.yaml.
Usage:
Syntax |
Scope |
|---|---|
|
Runs on |
|
Runs on the specified branch |
Use --branch <name> to specify a target branch. Without --branch, all arguments are treated as test cases (separated by commas or spaces) and the branch defaults to main.
Note: When commenting on a PR, the tests run on the PR branch automatically in the triggered workflow; the
--branchflag is primarily used in issue comments.
Common test case names (A2):
test_custom_op, test_custom_op_multi_card, qwen3-vl-32b-instruct-w8a8, qwen3-32b-int8, MiniMax-M2.5-w8a8-QuaRot-A2, Qwen3.5-27B-w8a8-A2, Qwen3.5-397B-A17B-w4a8-mtp, accuracy-group
Common test case names (A3):
multi-node-deepseek-v3.2-W8A8-EP, mtpx-deepseek-r1-0528-w8a8, deepseek-r1-0528-w8a8, kimi-k2-thinking, qwen3-vl-235b-a22b-instruct-w8a8, custom-multi-ops, …
Examples:
# Run a single test case on main branch
/nightly qwen3-vl-32b-instruct-w8a8
# Run on a specific release branch
/nightly qwen3-vl-32b-instruct-w8a8 --branch releases/v0.21.0
# Run all tests on a specific branch
/nightly all --branch my-feature-branch
# Run multiple test cases (comma-separated)
/nightly test_custom_op,multi-node-deepseek-v3.2-W8A8-EP
# Run multiple test cases (space-separated, also works)
/nightly test_custom_op accuracy-group
# Run accuracy group tests (branch defaults to main)
/nightly accuracy-group
This triggers workflow_dispatch on both schedule_nightly_test_a2.yaml and schedule_nightly_test_a3.yaml.
/rerun#
Re-run all failed workflow runs on the current PR commit. Useful when CI jobs failed due to infrastructure issues.
Examples:
# Re-run all failed CI workflows on this PR
/rerun
Behavior#
When you comment a slash command, a 👀 reaction is added to your comment to indicate it has been received
The corresponding CI workflow is triggered asynchronously
Upon completion, a 🎉 reaction and a summary comment are added
Scope#
Command |
PR comments |
Issue comments |
|---|---|---|
|
✅ |
❌ |
|
✅ |
❌ |
|
✅ |
✅ |
Permission#
Command |
Who can trigger |
|---|---|
|
PR author, or users with triage+ permission on the repository |
|
PR author, or users with triage+ permission on the repository |
|
Users with triage+ permission on the repository only |
Permission is verified via the GitHub API (repos/{owner}/{repo}/collaborators/{user}/permission).