We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
when we serve topK model for a session-based model on Triton it only returns scores, but it also needs to serve topK ids together with the scores.
This issue is related to model signatures, the topK model output from model signature is only one, but it should be two.
Run this gist first. Check out the ensemble model output. you will see it has only one output noT two.
To check the Triton restponse, then do the following steps:
Launch triton on terminal and load the model with tritonserver --model-repository={OUTPUT_DATA_DIR}/<name of ensemble folder>/
tritonserver --model-repository={OUTPUT_DATA_DIR}/<name of ensemble folder>/
prepare input data and send a request
from merlin.systems.triton import convert_df_to_triton_input import tritonclient.grpc as grpcclient validation_data = pd.read_parquet('/workspace/data/interactions_merged_df.parquet') inputs = convert_df_to_triton_input(wf.input_schema, validation_data.iloc[:100]) with grpcclient.InferenceServerClient("localhost:8001") as client: response = client.infer('executor_model', inputs) output = response.as_numpy('item_id-list/categorical_output')
The text was updated successfully, but these errors were encountered:
marcromeyn
sararb
Successfully merging a pull request may close this issue.
Bug description
when we serve topK model for a session-based model on Triton it only returns scores, but it also needs to serve topK ids together with the scores.
This issue is related to model signatures, the topK model output from model signature is only one, but it should be two.
Steps/Code to reproduce bug
Run this gist first. Check out the ensemble model output. you will see it has only one output noT two.
To check the Triton restponse, then do the following steps:
Launch triton on terminal and load the model with
tritonserver --model-repository={OUTPUT_DATA_DIR}/<name of ensemble folder>/
prepare input data and send a request
Expected behavior
Environment details
Additional context
The text was updated successfully, but these errors were encountered: