Set default params for Watsonx chat & text models #260

claudiosv · 2025-01-15T00:34:25Z

As of LiteLLM 1.52.1, model prefix watsonx/ points to the /chat endpoint. See change here. This endpoint has several differences in accepted parameters, and more sane defaults. It supports json_object as a response_format, and the endpoint supports tool calling (future work for PDL). Rather than handle different LiteLLM dependency versions pointing to different APIs, this PR also bumps the minimum LiteLLM version needed. One thing to note, set_default_granite_model_parameters is only called for granite models of course. This means that using Granite, the default will be greedy decoding/temp = 0, and for any other model, the default is temp = 1. That could be a surprising difference in behavior.

vazirim · 2025-01-15T13:32:35Z

pyproject.toml

We just removed the upper bound on LiteLLM per user's request. See the latest in main, including changes to watsonx examples. Please merge that.

Set default params for watsonx chat and text gen

07c1789

vazirim reviewed Jan 15, 2025

View reviewed changes

Merge remote-tracking branch 'origin/main' into default-params

5186ad3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set default params for Watsonx chat & text models #260

Set default params for Watsonx chat & text models #260

claudiosv commented Jan 15, 2025

vazirim Jan 15, 2025

Set default params for Watsonx chat & text models #260

Are you sure you want to change the base?

Set default params for Watsonx chat & text models #260

Conversation

claudiosv commented Jan 15, 2025

vazirim Jan 15, 2025

Choose a reason for hiding this comment