LLM / ChatGPT support

The DialoX platform has builtin support for ChatGPT and other large language models (LLMs).

Prompt example¶

By creating a script of the LLM Prompt, it is possible to define one or more prompts that can be used at runtime in the bot.

A prompt file looks at minimum like this:

prompts:
  - id: rhyme
    text: |
      make a sentence that rhymes with: {{ text }}

In bubblescript this exposes a constant called @prompts.rhyme, that can then be used like this:

dialog main do
  ask "Enter a sentence and I will make it rhyme for you"

  _result = LLM.complete(@prompts.rhyme, text: answer.text)
  say _result.text
end

resulting in a conversation like this:

bot:  Enter a sentence and I will make it rhyme for you
user: I want to fly away!
bot:  Today is Sunday Funday, let's go play!

Individual Prompt Files¶

As an alternative to managing all prompts in a single prompts.yaml file, the platform supports creating individual prompt files. This approach is particularly useful for bots with many prompts, as it provides better organization and makes it easier to manage prompts individually.

Individual prompt files are stored as markdown with YAML front matter and are configured through the CMS system using the type: prompt content definition. See Individual Prompt Files (type: prompt) in the CMS documentation for detailed configuration instructions.

Both approaches can coexist in the same bot:

Prompts defined in prompts.yaml are available as @prompts.id
Prompts defined as individual files are also available as @prompts.id
All prompts are merged into the same @prompts constant

This means you can use both methods in the same bot, and reference them the same way in your Bubblescript code. Just ensure that prompt IDs are unique across both approaches to avoid conflicts.

Prompt parameter documentation¶

A full specification of a prompt yaml is this:

prompts:
  - # Unique identifier for the prompt, exposed in bubblescript as @prompts.[id]
    id: summarize

    # Human-readable label, used when the prompt is included in the CMS or Inbox widget
    label: Summarize

    # LLM provider: openai, microsoft_openai, or google_ai (gemini)
    provider: openai

    # The LLM model used. It is dependent on the provider.
    # For Google AI, use gemini-2.0-flash for instance.
    # Recommended to leave empty do default to the platform choice per provider.
    model: gpt-4o-mini

    # The actual text of the prompts. Can be a simple string or a $i18n structure for
    # translation. The prompt text is a Liquid template, so `{{ }}` bindings can be
    # specified which need to be passed in when calling `LLM.complete()`.
    text:
      $i18n: true
      nl: |
        system: Gegeven de volgende tekst, maak een korte en bondige samenvatting die
        alleen de meest noodzakelijke punten teruggeeft. Gebruik hooguit 50
        woorden:

        user: {{text}}
      en: |
        system: Given the following text, create a short summary that only highlights the
        most relevant parts of the text. Use at most 50 words:

        user: {{text}}

    # Additional request parameters passed to the API endpoint
    endpoint_params:
      some_extra_param: 1

    # Expected format of the response: text, json_object, or json_schema
    response_format: text

    # a SUBSET of JSON schema for structured responses. (ONLY when response_format is json_schema)
    # See https://platform.openai.com/docs/guides/structured-outputs#supported-schemas for more details.
    response_json_schema:
      name: "My schema"
      schema:
        type: object
        properties:
          summary:
            type: string
            description: "A concise summary of the input text"

    # Whether to return the response log probability for each generated token
    logprobs: false

    # Maximum number of tokens for the completion
    max_completion_tokens: 100

    # Number of alternative completions to generate
    candidate_count: 1

    # Penalty for token frequency (between -2.0 and 2.0)
    frequency_penalty: 0.0

    # Penalty for token presence (between -2.0 and 2.0)
    presence_penalty: 0.0

    # Seed for deterministic completions (optional)
    seed: 42

    # Randomness of the output (0.0 to 2.0)
    temperature: 1.0

    # Nucleus sampling parameter
    top_p: 1.0

    # List of sequences where the API should stop generating (optional)
    stop:
      - "END"

    # Request timeout in seconds (optional)
    # When a timeout occurs, finish_reason will be set to "timeout"
    request_timeout: 30

    # List of tools available to the model (optional)
    tools:
      - type: function
        function:
          name: get_current_weather
          description: Get the current weather in a given location
          parameters:
            type: object
            properties:
              location:
                type: string
                description: The city and state, e.g. San Francisco, CA
              unit:
                type: string
                enum: [celsius, fahrenheit]
            required: [location]

This YAML structure defines all possible fields for a prompt, including advanced options like response schemas, completion parameters, and tool definitions. Not all fields are required for every prompt, and the specific fields used may depend on the provider and use case.

Executing prompts¶

By executing the LLM.complete(prompt, bindings) function, a call to the LLM API is done with the given prompt and its bindings. The prompt argument typically comes from a constant defined in a prompt YAML file, for instance @prompts.summarize.

The bindings is a map or keyword list that needs to contain the bindings that the prompt needs; in the summarize example only one binding is created named text. So a call to that prompt would be done like this:

  _result = LLM.complete(@prompts.summarize, text: "this is a long article ...")

The full result of the LLM.complete call is a map array which contains the following:

text - The output text that LLM produced
json - A JSON deserialized version of the text; the runtime detects whether JSON is available in the result and, if so, parses it. The JSON message itself can be padded with arbitrary other texts.
usage - The total tokens that were used for this API call
request_time - The nr of milliseconds this request took
raw - The raw OpenAPI response

User / bot / assistant roles¶

The prompt text can contain user:, assistant: or system: strings, which will be used for determining the different parts of the prompt (e.g. constructing the messages part of the OpenAPI request payload).

Automatic bindings¶

Some prompt bindings are done automatically.

In the case of Bubblescript LLM.complete calls, the following bindings are filled automatically:

locale - The conversation's locale
transcript - The last 5 turns of the bot / user. This is typically used to make a generic chatbot that responds to the previous conversation in a natural way.
full_transcript - The full transcript of the conversation. Both transcript and full_transcript are in a format that can be used direclty in the OpenAPI-compatible request payload format, including the role field and tool call messages.
bot - The metadata of the bot, for instance {{ bot.title }} is exposed.
conversation - Some metadata of the conversation, like addr, tags, frontend.
user - The contact of this converstaion.
persona - The persona as filled in on the "AI" > "Persona" page in a bot. You can also manually construct the persona using the bot binding (bot.purpose, bot.extended_purpose, and bot.guardrails).
constants - All constants (for example, @foo "bar") from the bot.
prompt_constants - Same as the constants, but these can be rendered as sub-templates for the prompt. For example: {{ prompt_constants.prompt_customizations }}.

The transcript and full_transcript bindings are array bindings and needs to be specified as [[ transcript ]] or [[ full_transcript ]] so with square brackets, and on a line by itself!

Any variables in the Liquid template that were not passed explicitly in the LLM.complete call will be lookup up automatically in the globals of the conversation.

Charging¶

For every LLM.complete call, a charge event (of type llm.complete) is created and is taken into account in the customer's billing cycle.