Releases · deepjavalibrary/djl-serving

What's Changed

[docs] Update javadoc link by @xyang16 in #254
Adds restart feature to management console by @c007456 in #255
[docs] Update links by @xyang16 in #258
[Docker][DeepSpeed] use accelerate to reduce memory cost by @lanking520 in #261
[Docker] bump up torch version by @lanking520 in #264
[docker] Fix pytorch-cu113 docker file by @frankfliu in #265
[Docker] support integration test on release images by @lanking520 in #266
[Docker] make aarch defaults in pytorch by @lanking520 in #267
Add management console document by @c007456 in #262
[Docker] fix arg in code by @lanking520 in #268
[aarch64] Remove unsupported engine by @frankfliu in #269
[Docker] mkdirs for Inferentia for bug fixing by @lanking520 in #270
[docker] Fixed pytorch-cu113 docker build by @frankfliu in #271
[Docker] change copy script location by @lanking520 in #272
[Docker] upgrade deepspeed and transformers by @lanking520 in #273
Fix the problem that the returned image of input text cannot be parsed by @c007456 in #276
update huggingface version by @lanking520 in #277
[serving] clean cache directory on model unload by @frankfliu in #278
[python] Fixes logging issue by @frankfliu in #279
[Docker] switch base image to devel by @lanking520 in #280
[Docker] Add INT8 support on Large model by @lanking520 in #281
apply CVE patches into the docker by @lanking520 in #284
[Docker] minor fix by @lanking520 in #285
Clean up ModelServerTest with request helper function by @zachgk in #283
[Docker] add label patching for DLC by @lanking520 in #286
[serving] Fixes install XGBoost engine bug by @frankfliu in #287
[serving] Improve logging message by @frankfliu in #289
[serving] Loads model from root of model_store directory by @frankfliu in #288
[python] Adds large model inference support with MPI mode by @frankfliu in #291
[python] Adds built-in DeepSpeed handler by @frankfliu in #292
[python] Remove DsEngineProvider alias by @frankfliu in #296
[Docker] add tagging labels by @lanking520 in #297
[doc] Add some DJL Serving docs by @xyang16 in #299
Reorganize docs by @zachgk in #298
[doc] Update DJL Serving doc by @xyang16 in #300
[Docker] rename parallelformers engine name to transformers by @lanking520 in #301
[Integration] make 6 min timeout per model by @lanking520 in #304
[Docker] Fix Accelerate version by @lanking520 in #306
[serving] Sets default ONNXRuntime OMP threads to 1 by @frankfliu in #303
start supporting multi-gpu in python mode by @lanking520 in #302
Create an ensemble workflow by @zachgk in #282
[Docker][DLC] upgrade for next release by @lanking520 in #307
[ci] Upgrade deprecated github actions by @frankfliu in #309
[benchmark] Adds HuggingFace model zoo to djl-bench by @frankfliu in #308
[doc] Add DJL Serving packaging doc by @xyang16 in #310
[serving] Adds hugginface tokenizer as default dependency by @frankfliu in #313
[djl-bench] Update README to upgrade Java version to 11 by @frankfliu in #314
fix typo in Pymodel mpi log by @siddvenk in #315
[Docker][DLC] add deepspeed 0.7.5 by @lanking520 in #316
[DLC] update docker with s5cmd by @lanking520 in #317
fix shell script by @lanking520 in #319
fix tar file unzip location by @lanking520 in #320
[doc] Update serving docs by @xyang16 in #312
[central] workaround webpack-cli 5.0.0 build issue by @frankfliu in #324
[benchmark] Make warmup iteration configurable by @frankfliu in #323
[benchmar] Update benchmark README by @frankfliu in #325
support download model from s3 by @lanking520 in #322
[doc] Update configurations document by @frankfliu in #327
[Docker] build dlc telemetry by @lanking520 in #326
fix s5cmd by @lanking520 in #328
Upgrade dependencies version by @frankfliu in #331
[serving] Uses Engine.getDjlVersion() for consistency by @frankfliu in #330
[serving] Refactor NeuronUtils by @frankfliu in #333
add telemetry collection testing by @lanking520 in #332
[Docker][G5] add test artifacts by @lanking520 in #329
fix the telemetry inaccessible issues by @lanking520 in #335
fall back to imds v1 by @lanking520 in #338
G5 test patch fixes by @lanking520 in #336
[G5][Docker] add gptj model by @lanking520 in #339
[Docker] upgrade cu117 by @lanking520 in #340
[Docker][G5] add bloom 7b1 support by @lanking520 in #341
update telemetry to follow DLC standard by @lanking520 in #343
[Docker] add paddlepaddle docker build script by @lanking520 in #342
fix on the tag by @lanking520 in #344
[Docker] fix regex by @lanking520 in #345
final regex fix by @lanking520 in #346
[HF] add more information to HF Accelerate by @lanking520 in #318
[ci] Upgrade dependencies version by @frankfliu in #347
[docs] Update serving configurations document by @frankfliu in #348
[Docker][G5] add huggingface tests by @lanking520 in #349
[Handler] fix python grammer by @lanking520 in #350
[Python] fix potential None on TP degree by @lanking520 in #351
[serving] Fixes tensor_parallel_degree parsing bug by @frankfliu in #352
fix telemetry by @lanking520 in #353
fix mapping issues by @lanking520 in #354
[G5] final fixes by @lanking520 in #355
[serving] Support tensor_parallel_degree for commandline by @frankfliu in #356
use default serve command by @lanking520 in #357
[Docker] upgrade inferentia docker image by @lanking520 in https://github.com/deepjavalibrar...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

Contributors

Releases: deepjavalibrary/djl-serving

v0.20.0

What's Changed

Contributors