Kubeflow

  • By Kubeflow Charmers | bundle
  • Cloud
Channel Revision Published
latest/stable 414 01 Dec 2023
latest/candidate 294 24 Jan 2022
latest/beta 430 30 Aug 2024
latest/edge 423 26 Jul 2024
1.9/stable 426 31 Jul 2024
1.9/beta 420 19 Jul 2024
1.9/edge 425 31 Jul 2024
1.8/stable 414 22 Nov 2023
1.8/beta 411 22 Nov 2023
1.8/edge 413 22 Nov 2023
1.7/stable 409 27 Oct 2023
1.7/beta 408 27 Oct 2023
1.7/edge 407 27 Oct 2023
1.6/stable 329 07 Sep 2022
1.6/beta 326 23 Aug 2022
1.6/edge 328 07 Sep 2022
1.4/stable 321 30 Jun 2022
1.4/edge 320 30 Jun 2022
juju deploy kubeflow --channel beta
Show information

Platform:

This guide presents an overview of the Charmed Kubeflow (CKF) charms that provide Prometheus monitoring metrics.

All metrics can be accessed using the Prometheus or Grafana User Interface (UI). See Integrate with COS for more information.

Argo controller

See the argo-controller upstream documentation for more information on provided metrics.

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="argo-controller"}

Dex auth

The dex-auth charm provides:

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="dex-auth"}

Envoy

The envoy charm provides the following metrics:

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="envoy"}

Istio pilot

See the istio-pilot upstream documentation for more information on provided metrics.

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="istio-pilot"}

Istio gateway

See the istio-gateway upstream documentation for more information on provided metrics.

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="istio-gateway"}

Jupyter controller

The jupyter-controller provides the following metrics:

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="jupyter-controller"}

Katib controller

The katib controller provides the following metrics:

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="katib-controller"}

Kfp api

The kfp-api provides the following metrics:

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="kfp-api"}

Knative eventing

The knative-eventing metrics come from the knative-operator charm that deploys otel-collector. See its upstream documentation for more details.

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="knative-operator", namespace_name="knative-eventing"}

Knative serving

The knative-serving metrics come from the knative-operator charm that deploys otel-collector. See its upstream documentation for more details.

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="knative-operator", namespace_name="knative-serving"}

Knative operator

See the knative-operator upstream documentation for more information on provided metrics.

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="knative-operator"}

Metacontroller operator

The metacontroller-operator provides the following metrics:

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="metacontroller-operator"}

Minio

See the minio upstream documentation for more information on provided metrics.

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="minio"}

Seldon controller manager

See the seldon-controller-manager upstream documentation for more information on provided metrics.

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="seldon-controller-manager"}

Training operator

The training-operator provides the following metrics:

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="training-operator"}

Pvcviewer operator

The pvcviewer-operator provides the following metrics:

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="pvcviewer-operator"}

Kserve controller

The kserve-controller provides the following metrics:

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="kserve-controller"}

Kubeflow profiles

Kubeflow profiles manage two Pebble services:

  • profile-controller.
  • kfam.

Profile controller

The profile-controller provides the following metrics:

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="kubeflow-profiles"}

Kfam

The kfam provides the following metrics:

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="kubeflow-profiles"}

Tensorboard controller

The tensorboard-controller provides the following metrics:

You can check its metrics through the Prometheus or Grafana UI using the following query:

{juju_charm="tensorboard-controller"}