fix: add Databricks models databricks-qwen35-122b-a10b and databricks-qwen3-next-80b-a3b-instruct#654
Open
github-actions[bot] wants to merge 1 commit into
Open
fix: add Databricks models databricks-qwen35-122b-a10b and databricks-qwen3-next-80b-a3b-instruct#654github-actions[bot] wants to merge 1 commit into
github-actions[bot] wants to merge 1 commit into
Conversation
…-qwen3-next-80b-a3b-instruct
|
The latest updates on your projects. Learn more about Vercel for GitHub.
|
4 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
fix: add Databricks models databricks-qwen35-122b-a10b and databricks-qwen3-next-80b-a3b-instruct
Closes #644
Source issue: #644
Summary
databricks-qwen35-122b-a10bdatabricks-qwen3-next-80b-a3b-instructdatabricks-qwen35-122b-a10bdatabricks-qwen3-next-80b-a3b-instruct2
3
4
5
6
7
8
Verified metadata
Verification notes
Verification
Official sources and fields verified
https://docs.databricks.com/aws/en/machine-learning/foundation-model-apis/supported-models (Databricks Supported Models page)
databricks-qwen35-122b-a10b: model ID, 256K context window, 8K max output tokens, Public Preview status, text-only input, "reasoning cannot be disabled"databricks-qwen3-next-80b-a3b-instruct: model ID, Public Preview status, text-only input. Token limits NOT published on this page.https://docs.databricks.com/aws/en/machine-learning/model-serving/foundation-model-overview (Databricks Foundation Model Overview)
https://huggingface.co/Qwen/Qwen3-Next-80B-A3B-Instruct (upstream Qwen model card)
databricks-qwen3-next-80b-a3b-instructsince Databricks does not publish specific token limitshttps://huggingface.co/Qwen/Qwen3.5-122B-A10B (upstream Qwen model card)
<think>tokenssync_models (LiteLLM) cross-check
Neither
databricks-qwen35-122b-a10bnordatabricks-qwen3-next-80b-a3b-instructappear in the LiteLLMmodel_prices_and_context_window_backup.jsoncatalog. No numeric fields to cross-check — both models are absent from sync_models entirely.Fields not published or not applicable
input_cost_per_mil_tokens,output_cost_per_mil_tokens): Databricks does not publish stable per-model pricing for Foundation Model API models. Omitted.input_cache_read_cost_per_mil_tokens,input_cache_write_cost_per_mil_tokens): Not applicable for Databricks. Omitted.multimodal: Both models are text-only per Databricks docs. Omitted (defaults to falsy).parent: Neither model is a dated snapshot or location-scoped variant of a stable alias. Omitted.deprecation_date: Both are active (Public Preview). Omitted.supported_regions: Not applicable — these are Databricks models, not Vertex models.locations: Not required for Databricks provider.Caveats
databricks-qwen3-next-80b-a3b-instructtoken limits (262,144 input / 16,384 output) are sourced from the upstream Qwen HuggingFace model card, not from Databricks documentation. Databricks may impose different limits at the API level. The output limit of 16,384 is described as "recommended" in the model card, not a hard architectural limit.databricks-qwen35-122b-a10bupstream model supports 262,144 context and 32,768 output, but Databricks restricts to 256,000 input / 8,000 output per their supported-models page.sync_models vs proposed update
sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.