NAME

App::Greple::xlate::gpt5 - GPT-5.5 translation engine for greple xlate module

SYNOPSIS

greple -Mxlate --xlate-engine=gpt5 --xlate=ja file.txt

DESCRIPTION

This module provides GPT-5.5 translation support for the App::Greple::xlate module. GPT-5.5 is OpenAI's latest language model, offering enhanced reasoning capabilities and improved translation quality. The engine name remains gpt5 for backward compatibility, while the default model is gpt-5.5.

GPT-5.5 API SPECIFICATIONS

Model

  • gpt-5.5 - Full model with maximum capabilities (snapshot: gpt-5.5-2026-04-23)

Unlike the GPT-5 series, GPT-5.5 is offered as a single model with no mini/nano variants.

Token Limits

  • Context window: 1,050,000 tokens

  • Output limit: 128,000 tokens (including reasoning tokens)

Prompts exceeding 272,000 input tokens are billed at 2x input and 1.5x output rates for the full session.

Input/Output Support

  • Input: Text and images

  • Output: Text only

New API Parameters

The GPT-5 family introduces several new parameters for fine-grained control:

reasoning_effort

Controls the model's thinking time and reasoning depth:

  • none - Reasoning effectively disabled; the model behaves like a non-reasoning model for the fastest latency, suitable for deterministic tasks such as translation

  • low - Low reasoning effort, prioritizes speed while keeping some planning

  • medium - Balanced reasoning (default)

  • high - High reasoning effort, prioritizes quality

  • xhigh - Maximum reasoning effort

Note: GPT-5.5 replaces the GPT-5 minimal level with none, and adds the xhigh level. The default is medium. This engine uses none for translation to favor speed and cost.

verbosity

Controls the length and detail of responses:

  • low - Minimal, terse responses

  • medium - Balanced detail level

  • high - Comprehensive, verbose responses

max_completion_tokens

Specifies the maximum number of completion tokens in the response. Unlike the legacy max_tokens parameter, this specifically controls output tokens and is the recommended approach for GPT-5.5.

Enhanced Features

Reduced Hallucinations

The GPT-5 family is significantly less likely to hallucinate compared to previous generations of models, which improves the factual reliability of translations.

Custom Tools Support

The GPT-5 family supports custom tools that can receive plaintext payloads instead of JSON, enabling more flexible integration with external systems.

Context-Free Grammar (CFG)

Allows strict output constraints to match predefined syntax rules, useful for ensuring valid format generation.

Pricing

  • Input: $5.00/1M tokens

  • Cached input: $0.50/1M tokens

  • Output: $30.00/1M tokens

Note: prompts exceeding 272,000 input tokens are billed at 2x input and 1.5x output rates for the full session (applies to standard, batch, and flex).

CONFIGURATION

This module uses the following default parameters:

  • engine: gpt-5.5

  • temperature: 1 (fixed for GPT-5.5)

  • max_length: 3000 characters per batch

  • reasoning_effort: none (for translation tasks; fastest)

  • max_completion_tokens: 16000

ENVIRONMENT VARIABLES

  • OPENAI_API_KEY - Required OpenAI API key, read by the gpty command

RELATED OPTIONS

Batching and debugging are controlled through the standard App::Greple::xlate command-line options, not environment variables:

  • --xlate-maxlen=chars - Maximum characters sent per request (defaults to this engine's value of 3000 when unset)

  • --xlate-maxline=n - Maximum lines sent per request (default 0 = unlimited); useful as a safety valve if a large batch causes a response element-count mismatch

  • --xlate-debug - Dump the gpty command and parameters

DEPENDENCIES

SEE ALSO

AUTHOR

Kazumasa Utashiro

LICENSE

Copyright 2024-2026 Kazumasa Utashiro.

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.