NAME

Langertha::Engine::vLLM - vLLM inference server

VERSION

version 0.502

SYNOPSIS

use Langertha::Engine::vLLM;

my $vllm = Langertha::Engine::vLLM->new(
    url           => 'http://localhost:8000/v1',
    system_prompt => 'You are a helpful assistant',
);

print $vllm->simple_chat('Say something nice');

# MCP tool calling (requires server started with tool-call-parser)
use Future::AsyncAwait;

my $vllm = Langertha::Engine::vLLM->new(
    url         => 'http://localhost:8000/v1',
    model       => 'Qwen/Qwen2.5-3B-Instruct',
    mcp_servers => [$mcp],
);

my $response = await $vllm->chat_with_tools_f('Add 7 and 15');

DESCRIPTION

Provides access to vLLM, a high-throughput inference engine for large language models. Composes Langertha::Role::OpenAICompatible since vLLM exposes an OpenAI-compatible API.

Only url is required. The URL must include the /v1 path prefix (e.g., http://localhost:8000/v1). Since vLLM serves exactly one model (configured at server startup), no model name or API key is needed.

MCP tool calling requires the vLLM server to be started with --enable-auto-tool-choice and --tool-call-parser matching the model (hermes for Qwen2.5/Hermes, llama3 for Llama, mistral for Mistral).

See https://docs.vllm.ai/ for installation and configuration details.

THIS API IS WORK IN PROGRESS

SUPPORT

Issues

Please report bugs and feature requests on GitHub at https://github.com/Getty/langertha/issues.

IRC

Join #langertha on irc.perl.org or message Getty directly.

CONTRIBUTING

Contributions are welcome! Please fork the repository and submit a pull request.

AUTHOR

Torsten Raudssus <getty@cpan.org>

COPYRIGHT AND LICENSE

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.

To install Langertha, copy and paste the appropriate command in to your terminal.

cpanm

cpanm Langertha

CPAN shell

perl -MCPAN -e shell
install Langertha

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	Go to GitHub issues (only if GitHub is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)