Add configuration options to control size of generated response

When generating completion responses, the generator currently uses all available tokens.

To increase the realism, it would be useful to be able to control the number of tokens that are generated via config e.g. to enable generating a range of response sizes.

TODO:  determine what the configuration should look like
 - specify the mean tokens to use as a percentage of the `max_tokens` specified in the request (with a config for the default if not present in the request) and an associated std dev?
 - should this be a single configuration option? Per model? Per deployment?

(from https://github.com/stuartleeks/aoai-simulated-api/issues/35)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add configuration options to control size of generated response #10

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add configuration options to control size of generated response #10

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions