NAME
AI::Ollama::GenerateCompletionRequest -
SYNOPSIS
my $obj = AI::Ollama::GenerateCompletionRequest->new();
...
PROPERTIES
context
The context parameter returned from a previous request to [generateCompletion], this can be used to keep a short conversational memory.
format
The format to return a response in. Currently the only accepted value is json.
Enable JSON mode by setting the format parameter to json. This will structure the response as valid JSON.
Note: it's important to instruct the model to use JSON in the prompt. Otherwise, the model may generate large amounts whitespace.
images
(optional) a list of Base64-encoded images to include in the message (for multimodal models such as llava)
keep_alive
How long (in minutes) to keep the model loaded in memory.
- If set to a positive duration (e.g. 20), the model will stay loaded for the provided duration. - If set to a negative duration (e.g. -1), the model will stay loaded indefinitely. - If set to 0, the model will be unloaded immediately once finished. - If not set, the model will stay loaded for 5 minutes by default
model
The model name.
Model names follow a `model:tag` format. Some examples are `orca-mini:3b-q4_1` and `llama2:70b`. The tag is optional and, if not provided, will default to `latest`. The tag is used to identify a specific version.
options
Additional model parameters listed in the documentation for the Modelfile such as `temperature`.
prompt
The prompt to generate a response.
raw
If `true` no formatting will be applied to the prompt and no context will be returned.
You may choose to use the `raw` parameter if you are specifying a full templated prompt in your request to the API, and are managing history yourself.
stream
If `false` the response will be returned as a single response object, otherwise the response will be streamed as a series of objects.
system
The system prompt to (overrides what is defined in the Modelfile).
template
The full prompt or prompt template (overrides what is defined in the Modelfile).