docs: Document min_cuda_version parameter for Flash GPU endpoints#593
Draft
promptless[bot] wants to merge 1 commit intomainfrom
Draft
docs: Document min_cuda_version parameter for Flash GPU endpoints#593promptless[bot] wants to merge 1 commit intomainfrom
promptless[bot] wants to merge 1 commit intomainfrom
Conversation
Contributor
Author
There was a problem hiding this comment.
(Line 474)
Citation: New min_cuda_version parameter added to Endpoint class in src/runpod_flash/endpoint.py. Default value of "12.8" set in src/runpod_flash/core/resources/serverless.py. CPU endpoints clear this value via _sync_cpu_fields() in serverless_cpu.py.
View source
Contributor
Author
There was a problem hiding this comment.
(Line 500)
Citation: Valid CUDA versions are validated against the CudaVersion enum via validate_min_cuda_version() in serverless.py. The error message format and validation logic are defined in the PR.
View source
|
Preview deployment for your docs. Learn more about Mintlify Previews.
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Open this suggestion in Promptless to view citations and reasoning process
Documents the new
min_cuda_versionparameter for Flash endpoints. GPU endpoints now default to CUDA 12.8 to ensure workers run on hosts with recent drivers. Users can override this value to allow older hosts if needed. CPU endpoints are unaffected.Trigger Events
runpod/flash PR #277: feat: default GPU endpoints to minCudaVersion 12.8
Promptless Research (5 files, 1 GitHub PR)
.long_term_context/product_knowledge/product_overview.md.long_term_context/doc_workflow/client_instructions.md.long_term_context/style/client_style_guide.mdflash/configuration/parameters.mdxflash/create-endpoints.mdxAgent Response
Tip: Add or adjust Promptless's style guide in Agent Knowledge Base ✍️