Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SelectorGroupChat example does not work with o3-mini #5408

Open
afourney opened this issue Feb 6, 2025 · 2 comments
Open

SelectorGroupChat example does not work with o3-mini #5408

afourney opened this issue Feb 6, 2025 · 2 comments

Comments

@afourney
Copy link
Member

afourney commented Feb 6, 2025

What happened?

SelectorGroupChat example does not work with o3-mini. The prompt we use has the agent very eager to TERMINATE after 1 round of thinking.

https://microsoft.github.io/autogen/stable//user-guide/agentchat-user-guide/selector-group-chat.html

Image

What did you expect to happen?

For it to work like GPT-4o

How can we reproduce it (as minimally and precisely as possible)?

Run the example, but change the model to o3-mini

AutoGen version

0.4.5

Which package was this bug in

AgentChat

Model used

o3-mini

Python version

No response

Operating system

No response

Any additional info you think would be helpful for fixing this bug

No response

@ekzhu
Copy link
Collaborator

ekzhu commented Feb 7, 2025

Text-based termination is too brittle. Likely we need some other verifiers to ensure behavior.

Currently, we can advice that if a task is not completed, just run the team again with a new prompt to continue.

@afourney
Copy link
Member Author

afourney commented Feb 7, 2025

Text-based termination is too brittle. Likely we need some other verifiers to ensure behavior.

Currently, we can advice that if a task is not completed, just run the team again with a new prompt to continue.

Agreed. Maybe we just change the example slightly somehow.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants