Skip to content

Latest commit

 

History

History
19 lines (13 loc) · 819 Bytes

README.md

File metadata and controls

19 lines (13 loc) · 819 Bytes

serving-dinov2

Contents

  • Serving dinov2 onnx model with triton. (Artifact throughput: 48.927 infer/sec)
  • Gradio Demo.
  • Docker Compose.
  • K8s Setting(Triton, Traefik, Promtail, Loki, Prometheus, Grafana).
  • Serving dinov2 TensorRT Model. (Artifact throughput: 222.66 infer/sec)
  • Serving dinov2 onnx model with Fastertransformer (fastertransformer_backend don't support vit yet.)

Docker Compose

Check docker-compose

Kubernetes

image Check Kubernetes