Kubeflow
- Kubeflow Charmers | bundle
- Cloud
Channel | Revision | Published |
---|---|---|
latest/candidate | 294 | 24 Jan 2022 |
latest/beta | 430 | 30 Aug 2024 |
latest/edge | 423 | 26 Jul 2024 |
1.9/stable | 432 | 03 Dec 2024 |
1.9/beta | 420 | 19 Jul 2024 |
1.9/edge | 431 | 03 Dec 2024 |
1.8/stable | 414 | 22 Nov 2023 |
1.8/beta | 411 | 22 Nov 2023 |
1.8/edge | 413 | 22 Nov 2023 |
1.7/stable | 409 | 27 Oct 2023 |
1.7/beta | 408 | 27 Oct 2023 |
1.7/edge | 407 | 27 Oct 2023 |
juju deploy kubeflow --channel latest/edge
Deploy Kubernetes operators easily with Juju, the Universal Operator Lifecycle Manager. Need a Kubernetes cluster? Install MicroK8s to create a full CNCF-certified Kubernetes system in under 60 seconds.
Platform:
This guide outlines some common issues, general methods to find out their cause, and their solutions.
Troubleshoot with Juju
Juju tracks the state of all applications it deploys. Any issues detected by the applications is picked up by Juju. You can check their status as follows:
juju status
See Troubleshoot your Juju deployment for more details.
Troubleshoot with kubectl
The kubectl
command provides information about the state of pods and services running on the cluster. To restrict the output to your kubeflow deployment, use the desired namespace, which is the name of the Juju model, kubeflow
in this case. For example:
kubectl get pods -n kubeflow
See Debug running pods for more details.
Common issues
Deployment
-
If some pods are stuck in ‘pending’ state, the most common cause is lacking storage capacity. Check that enough storage is allocated to the cluster and examine the persistent volume claims made by the pods.
-
If your Jupyter notebook server is stuck in ‘creating’ state, the most common cause is insufficient CPU available in your cluster. By default, notebook servers are created using 0.5 CPU. If this is the case, you can set the CPU share to 0 when you create the notebook server instead.
Dashboard
- In case of forgotten password, you can check the dex-auth user and password details as follows:
juju config dex-auth static-username
juju config dex-auth static-password
You can also set a new username and password bu running this:
juju config dex-auth static-username=admin
juju config dex-auth static-password=AxWiJjk2hu4fFga7
- If you are using dynamic hostname resolution, using a hostname ending with
nip.io
, to evaluate Charmed Kubeflow, you may encounter issues with DNS caching. By default, Ubuntu Server usessystemd resolved
for DNS caching. You can change its behaviour with the following commands:
sudo apt install -y resolvconf
sudo systemctl enable --now resolvconf.service
echo "nameserver 8.8.8.8" | sudo tee -a /etc/resolvconf/resolv.conf.d/head
sudo resolvconf -u
Other
- In some cases, Istio Pilot and Istio Gateway pods may not start as expected. This might be caused by Internet connectivity issues. Verify your connection is stable and has enough bandwidth.