Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Quickstart regressions #5247

Open
NillsF opened this issue Nov 7, 2024 · 1 comment
Open

Quickstart regressions #5247

NillsF opened this issue Nov 7, 2024 · 1 comment
Labels
kind/bug Categorizes issue or PR as related to a bug.

Comments

@NillsF
Copy link

NillsF commented Nov 7, 2024

/kind bug

What steps did you take and what happened:

  • Followed quickstart instructions
  • Issues with:
    • Node image for v 1.31.0 -- could only get images up to version 1.30.3
    • Calico image not found -- using version 1.36 from the template
    • Default template includes userAssignedIdentity to the AzureMachineTemplate for the control plate. That UserAssignedIdentity could not be found.

What did you expect to happen:

  1. Either the quickstart should use an older node image version - or newer images should be made available quicker.
  2. Calico version should be defined in the quickstart to avoid issues.
  3. I was not expecting references to resources that are not created as part of the quickstart.

Anything else you would like to add:
Logs:

kubectl logs -n capz-system capz-controller-manager-59544898c6-vz267
"Reconciler error" err="failed to init machine scope cache: failed to get default image: no VM image found for publisher \"cncf-upstream\" offer \"capi\" sku \"ubuntu-2204-gen1\" with Kubernetes version \"v1.30.5\""

Calico:

kubectl describe pods -n tigera-operator --kubeconfig=./capi-quickstart.kubeconfig
Name:             tigera-operator-5c8c9965d9-k4mxf
Namespace:        tigera-operator
Priority:         0
Service Account:  tigera-operator
Node:             capi-quickstart1-control-plane-w678g/10.0.0.4
Start Time:       Thu, 07 Nov 2024 16:25:50 +0000
Labels:           k8s-app=tigera-operator
                  name=tigera-operator
                  pod-template-hash=5c8c9965d9
Annotations:      <none>
Status:           Pending
IP:               10.0.0.4
IPs:
  IP:           10.0.0.4
Controlled By:  ReplicaSet/tigera-operator-5c8c9965d9
Containers:
  tigera-operator:
    Container ID:  
    Image:         mcr.microsoft.com/oss/tigera/operator:v1.36.0
    Image ID:      
    Port:          <none>
    Host Port:     <none>
    Command:
      operator
    State:          Waiting
      Reason:       ImagePullBackOff
    Ready:          False
    Restart Count:  0
    Environment Variables from:
      kubernetes-services-endpoint  ConfigMap  Optional: true
    Environment:
      WATCH_NAMESPACE:                     
      POD_NAME:                            tigera-operator-5c8c9965d9-k4mxf (v1:metadata.name)
      OPERATOR_NAME:                       tigera-operator
      TIGERA_OPERATOR_INIT_IMAGE_VERSION:  v1.36.0
    Mounts:
      /var/lib/calico from var-lib-calico (ro)
      /var/run/secrets/kubernetes.io/serviceaccount from kube-api-access-2j2dz (ro)
Conditions:
  Type                        Status
  PodReadyToStartContainers   True 
  Initialized                 True 
  Ready                       False 
  ContainersReady             False 
  PodScheduled                True 
Volumes:
  var-lib-calico:
    Type:          HostPath (bare host directory volume)
    Path:          /var/lib/calico
    HostPathType:  
  kube-api-access-2j2dz:
    Type:                    Projected (a volume that contains injected data from multiple sources)
    TokenExpirationSeconds:  3607
    ConfigMapName:           kube-root-ca.crt
    ConfigMapOptional:       <nil>
    DownwardAPI:             true
QoS Class:                   BestEffort
Node-Selectors:              kubernetes.io/os=linux
Tolerations:                 :NoExecute op=Exists
                             :NoSchedule op=Exists
Events:
  Type     Reason     Age                From               Message
  ----     ------     ----               ----               -------
  Normal   Scheduled  25s                default-scheduler  Successfully assigned tigera-operator/tigera-operator-5c8c9965d9-k4mxf to capi-quickstart1-control-plane-w678g
  Normal   BackOff    24s                kubelet            Back-off pulling image "mcr.microsoft.com/oss/tigera/operator:v1.36.0"
  Warning  Failed     24s                kubelet            Error: ImagePullBackOff
  Normal   Pulling    13s (x2 over 24s)  kubelet            Pulling image "mcr.microsoft.com/oss/tigera/operator:v1.36.0"
  Warning  Failed     13s (x2 over 24s)  kubelet            Failed to pull image "mcr.microsoft.com/oss/tigera/operator:v1.36.0": rpc error: code = NotFound desc = failed to pull and unpack image "mcr.microsoft.com/oss/tigera/operator:v1.36.0": failed to resolve reference "mcr.microsoft.com/oss/tigera/operator:v1.36.0": mcr.microsoft.com/oss/tigera/operator:v1.36.0: not found
  Warning  Failed     13s (x2 over 24s)  kubelet            Error: ErrImagePull

Managed identity issue:

E1107 16:14:30.868875       1 controller.go:324] "Reconciler error" err=<
        failed to get user-assigned identity ClientID: GET https://management.azure.com/subscriptions/39914bf7-25ef-4594-8d3b-d10ec7d4611b/resourceGroups/capz-ci/providers/Microsoft.ManagedIdentity/userAssignedIdentities/cloud-provider-user-identity
        --------------------------------------------------------------------------------
        RESPONSE 404: 404 Not Found
        ERROR CODE: ResourceGroupNotFound
        --------------------------------------------------------------------------------
        {
          "error": {
            "code": "ResourceGroupNotFound",
            "message": "Resource group 'capz-ci' could not be found."
          }
        }
        --------------------------------------------------------------------------------

Environment:

  • cluster-api-provider-azure version:
clusterctl version: &version.Info{Major:"1", Minor:"8", GitVersion:"v1.8.4", GitCommit:"3cce0d973682f11ab0f0ba1c2522eba66dac2d91", GitTreeState:"clean", BuildDate:"2024-10-08T15:37:26Z", GoVersion:"go1.22.7", Compiler:"gc", Platform:"linux/amd64"}
  • Kubernetes version: (use kubectl version):
Client Version: v1.31.1
Kustomize Version: v5.4.2
Server Version: v1.31.0
  • OS (e.g. from /etc/os-release):
NAME="Ubuntu"
VERSION="20.04.6 LTS (Focal Fossa)"
ID=ubuntu
ID_LIKE=debian
PRETTY_NAME="Ubuntu 20.04.6 LTS"
VERSION_ID="20.04"
HOME_URL="https://www.ubuntu.com/"
SUPPORT_URL="https://help.ubuntu.com/"
BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/"
PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy"
VERSION_CODENAME=focal
UBUNTU_CODENAME=focal
@k8s-ci-robot k8s-ci-robot added the kind/bug Categorizes issue or PR as related to a bug. label Nov 7, 2024
@junkiebev
Copy link

junkiebev commented Nov 11, 2024

This issue with Calico is that mcr.microsoft.com/oss/tigera/operator:v1.36.0 doesn't exist - the latest tag is v1.35.0 in the mcr repository.

appending --set tigeraOperator.version=v1.35.0 allows the CNI to roll-out

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Categorizes issue or PR as related to a bug.
Projects
Status: Todo
Development

No branches or pull requests

3 participants