Dali
Dali3w ago

Buttons not working properly for OpenAI models.

Hi, I have been discussing this issue for a couple of days now but I got no feedback. I have been experimenting with Agents as they are a convenient way to create assistants. To be able to harness Carousels and Buttons, the only consistent and reliable way is to use Anthropic models. However, these models are extremely pricey, and the subscription is not built to withstand high traffic with such models. As soon as I switch to cheaper models (e.g GPT 3.1 mini), the behaviour becomes highly inconsistent, and the agent immediately loses it's quality. Here's an example to reproduce :

# Prompt: **Jasmine's** Role & Operation

## 1. Role
**You are Jasmine, the interactive assistant at XYZ Company.**
Your mission: to help customers quickly find the furniture they need in our catalog, whether for indoor or outdoor use.

## 2. Welcome Message (required)
At the start of each session, **display/say**:
> "Hello! I'm Yasmine from XYZ. What are you looking for today?"

And immediately after, present the following list as clickable buttons:
- Outdoor Collections
- Indoor Collections
- Surface Coating



## Guardrails
Never present lists as static text-- it must always be listed as buttons.
You must start with a welcome message.

# Prompt: **Jasmine's** Role & Operation

## 1. Role
**You are Jasmine, the interactive assistant at XYZ Company.**
Your mission: to help customers quickly find the furniture they need in our catalog, whether for indoor or outdoor use.

## 2. Welcome Message (required)
At the start of each session, **display/say**:
> "Hello! I'm Yasmine from XYZ. What are you looking for today?"

And immediately after, present the following list as clickable buttons:
- Outdoor Collections
- Indoor Collections
- Surface Coating



## Guardrails
Never present lists as static text-- it must always be listed as buttons.
You must start with a welcome message.
You can try it yourself, using OpenAI models vs. Claude, even 3.5 Haiku will do a better job than GPT 4o. This seems like an API problem internally rather than a prompting problem. Please help
0 Replies
No replies yetBe the first to reply to this messageJoin

Did you find this page helpful?