update containerd to v2.0.2 #3828

AkihiroSuda · 2024-12-20T09:31:53Z

Fix #3768

AkihiroSuda · 2024-12-20T10:28:39Z

/retest

stmcginnis · 2024-12-20T12:32:47Z

I plan on doing some more local testing, but this looks good to me and CI is happy.

/lgtm

pkg/cluster/nodeutils/util.go

BenTheElder · 2025-01-07T19:43:35Z

pkg/cluster/nodeutils/util.go

+	}
+	var snapshotter string
+	switch configVersion {
+	case 2: // Introduced in containerd v1.3. Still supported in containerd v2.


we probably need to allow user supplied config patches to similarly target 2 vs 3 separately, though that doesn't need to be done in this PR

do we depend on the version of the config?

kind/pkg/cluster/internal/create/actions/config/config.go

Lines 119 to 146 in 90d9439

if len(ctx.Config.ContainerdConfigPatches) > 0 || len(ctx.Config.ContainerdConfigPatchesJSON6902) > 0 {

fns := make([]func() error, len(kubeNodes))

for i, node := range kubeNodes {

node := node // capture loop variable

fns[i] = func() error {

// read and patch the config

const containerdConfigPath = "/etc/containerd/config.toml"

var buff bytes.Buffer

if err := node.Command("cat", containerdConfigPath).SetStdout(&buff).Run(); err != nil {

return errors.Wrap(err, "failed to read containerd config from node")

}

patched, err := patch.TOML(buff.String(), ctx.Config.ContainerdConfigPatches, ctx.Config.ContainerdConfigPatchesJSON6902)

if err != nil {

return errors.Wrap(err, "failed to patch containerd config")

}

if err := nodeutils.WriteFile(node, containerdConfigPath, patched); err != nil {

return errors.Wrap(err, "failed to write patched containerd config")

}

// restart containerd now that we've re-configured it

// skip if containerd is not running

if err := node.Command("bash", "-c", `! pgrep --exact containerd || systemctl restart containerd`).Run(); err != nil {

return errors.Wrap(err, "failed to restart containerd after patching config")

}

return nil

}

}

if err := errors.UntilErrorConcurrent(fns); err != nil {

return err

do we depend on the version of the config?

user's patches will not work as intended, they will wind up merging in keys that are not valid for v3 because all the keys have new namespacing again.

(and if they simply switch to v3 keys, it will not work with older kind images)

ah, so you refer to the "existing" user patches, I think is a matter of how you see it then ... for me those are containerd specific patches, and yes, users need to adapt them to containerd 2.0 unless there is some containerd feature that allows to migrate between formats

for me those are containerd specific patches, and yes, users need to adapt them to containerd 2.0 unless there is some containerd feature that allows to migrate between formats

But then their config won't work for older kind releases/images.

The way we do this with kubeadm config patches is you can have multiple patches that are targeting different kubeadm API versions. So far that's been irrelevant because for nearly all of our history we've only had containerd 2.0 config.

But actually, that is for a future PR, because in this PR we're still using a v2.0 config (images/base/files/etc/containerd/config.toml), this part of the diff seems to be technically unecessary for now.

I see, I got it now, so we keep using containerd 2.0 version for the config and let containerd deal with the translation internall to version 3 https://github.com/containerd/containerd/blob/main/docs/cri/config.md#config-versions

Yeah, before we switch our config to v3 we should give users a way to deal with it smoothly, but we're not actually switching to v3 in this PR, even though this PR adds partial support for v3.

aojea · 2025-01-07T21:52:38Z

/lgtm

I'm positively surprised the diff is so small ... IIRC we already had issues with ctr and storage formats in the past

BenTheElder · 2025-01-07T22:05:51Z

I'm positively surprised the diff is so small ... IIRC we already had issues with ctr and storage formats in the past

I don't think this addresses all of the issues, and since CRI only has v1 and not v1alpha2 in containerd v2, we break kubernetes < 1.26 (which is probably OK, but worth noting, we have not actively broken an old k8s version for some time now).

AkihiroSuda · 2025-01-07T22:09:35Z

since CRI only has v1 and not v1alpha2 in containerd v2, we break kubernetes < 1.26

CRI v1 seems introduced in Kubernetes 1.20: kubernetes/kubernetes@9fcede9

aojea · 2025-01-07T22:12:23Z

and since CRI only has v1 and not v1alpha2 in containerd v2, we break kubernetes < 1.26 (which is probably OK, but worth noting, we have not actively broken an old k8s version for some time now).

v1 was added in 1.20 kubernetes/kubernetes#96387
it was made default in 1.23 kubernetes/kubernetes#106501
In 1.26 v1alpha2 was dropped kubernetes/kubernetes#110618, but we should support until 1.20 IIUIC

BenTheElder · 2025-01-07T23:04:16Z

CRI v1 seems introduced in Kubernetes 1.20: kubernetes/kubernetes@9fcede9

ACK, definitely no problem with that.

We will still need to make sure to note this for the release, and we should consider if this is sufficient excuse to drop other < 1.20 bits from the project. We've been very lenient about dropping support outright so far.

And re: other parts: #3828 (comment)

BenTheElder · 2025-01-07T23:10:09Z

At some point <1.19 will be widely unusable because of cgroups v2 anyhow.

pkg/build/nodeimage/imageimporter.go

Fix issue 3768 Signed-off-by: Akihiro Suda <[email protected]>

AkihiroSuda · 2025-01-14T06:46:37Z

/retest

aojea · 2025-01-14T14:12:05Z

/lgtm
/assign @BenTheElder

Thanks

BenTheElder · 2025-01-17T19:05:00Z

/lgtm
/approve
/hold cancel

Note: we have automated base image builds after merge, so most of the CI here is using the current base image still. After this merges we will send another PR to pickup the image, it's possible we'll find issues and revert. This is fine, just pointing it out in case.

Sometimes we preemptively test by temporarily adding a commit using a locally built & pushed image, but only to test. We don't need to do that here, but we will get more signal when we adopt a built image.

k8s-ci-robot · 2025-01-17T19:05:09Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: AkihiroSuda, BenTheElder

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [BenTheElder]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

BenTheElder · 2025-01-17T19:05:35Z

Thanks for working on this and the upstream bug fixes, as always!

BenTheElder · 2025-01-17T19:27:11Z

filed #3848 after the build job completed so we can keep this moving along

BenTheElder · 2025-01-17T21:11:48Z

I did find an issue #3848 (comment)

We can mitigate, but the underlying cause may be worth investigating.

Previously we did not wait for containerd to be ready because it was fully booted in ~0.3s even when cross-architecture, so we never had any issue, now that takes approximately 1s, so image pulls fail because the socket is not available yet.

Mitigating that would be pretty trivial (we can either retry image pulls which we should consider anyhow, watch for containerd to be ready before doing any pulls, or both), but that seems like a pretty big regression that will impact startup time at runtime as well (both initially, and anytime we need to restart). Not sure if that's something the containerd project would track or just accept.

cc @samuelkarp

BenTheElder · 2025-01-17T21:12:30Z

(These times are v1.7.24 vs v2.0.2, arm64 builds on an amd64 GCE machine, more details in #3848 (comment))

BenTheElder · 2025-01-17T21:26:59Z

Update: this is probably not worth discussing upstream, because 2.0.2 is still < 0.07s with amd64 + amd64 host. It is however consistently longer than 1.7.4. Something with arm64 qemu must be even more pathological.

BenTheElder · 2025-01-17T21:28:58Z

Moving further discussion to #3848

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Dec 20, 2024

k8s-ci-robot requested review from aojea and stmcginnis December 20, 2024 09:32

k8s-ci-robot added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label Dec 20, 2024

k8s-ci-robot assigned stmcginnis Dec 20, 2024

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Dec 20, 2024

AkihiroSuda force-pushed the containerd-v2 branch from 560c251 to b5b1ff6 Compare December 20, 2024 13:11

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Dec 20, 2024

BenTheElder reviewed Jan 7, 2025

View reviewed changes

pkg/cluster/nodeutils/util.go Outdated Show resolved Hide resolved

BenTheElder reviewed Jan 7, 2025

View reviewed changes

k8s-ci-robot assigned aojea Jan 7, 2025

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jan 7, 2025

This was referenced Jan 7, 2025

Use --local=false on amd64, to workaround containerd image loading #3805

Closed

Unable to load docker desktop containerd managed images to cluster #3795

Open

AkihiroSuda force-pushed the containerd-v2 branch from b5b1ff6 to ccd6c2a Compare January 8, 2025 01:35

k8s-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed lgtm "Looks good to me", indicates that a PR is ready to be merged. size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Jan 8, 2025

AkihiroSuda mentioned this pull request Jan 8, 2025

ctr: ctr images import --all-platforms: fix unpack containerd/containerd#11229

Merged

BenTheElder reviewed Jan 8, 2025

View reviewed changes

pkg/build/nodeimage/imageimporter.go Outdated Show resolved Hide resolved

AkihiroSuda marked this pull request as draft January 10, 2025 00:31

k8s-ci-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jan 10, 2025

update containerd to v2.0.2

586b038

Fix issue 3768 Signed-off-by: Akihiro Suda <[email protected]>

AkihiroSuda force-pushed the containerd-v2 branch from ccd6c2a to 586b038 Compare January 14, 2025 06:16

AkihiroSuda changed the title ~~update containerd to v2.0.1~~ update containerd to v2.0.2 Jan 14, 2025

AkihiroSuda marked this pull request as ready for review January 14, 2025 06:16

k8s-ci-robot added size/S Denotes a PR that changes 10-29 lines, ignoring generated files. and removed do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Jan 14, 2025

k8s-ci-robot assigned BenTheElder Jan 14, 2025

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jan 14, 2025

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jan 17, 2025

k8s-ci-robot merged commit f528b02 into kubernetes-sigs:main Jan 17, 2025
30 checks passed

BenTheElder mentioned this pull request Jan 17, 2025

update to latest base image with containerd 2.0.2, ensure containerd is ready before importing images #3848

Merged

tao12345666333 mentioned this pull request Jan 19, 2025

test: upgrade nerdctl to v2 #3850

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update containerd to v2.0.2 #3828

update containerd to v2.0.2 #3828

AkihiroSuda commented Dec 20, 2024 •

edited

Loading

AkihiroSuda commented Dec 20, 2024

stmcginnis commented Dec 20, 2024

BenTheElder Jan 7, 2025

aojea Jan 7, 2025

BenTheElder Jan 7, 2025

BenTheElder Jan 7, 2025

aojea Jan 7, 2025

BenTheElder Jan 7, 2025

BenTheElder Jan 7, 2025

aojea Jan 7, 2025 •

edited

Loading

BenTheElder Jan 7, 2025

aojea commented Jan 7, 2025

BenTheElder commented Jan 7, 2025

AkihiroSuda commented Jan 7, 2025

aojea commented Jan 7, 2025

BenTheElder commented Jan 7, 2025

BenTheElder commented Jan 7, 2025 •

edited

Loading

AkihiroSuda commented Jan 14, 2025

aojea commented Jan 14, 2025

BenTheElder commented Jan 17, 2025

k8s-ci-robot commented Jan 17, 2025

BenTheElder commented Jan 17, 2025

BenTheElder commented Jan 17, 2025

BenTheElder commented Jan 17, 2025 •

edited

Loading

BenTheElder commented Jan 17, 2025

BenTheElder commented Jan 17, 2025

BenTheElder commented Jan 17, 2025

	if len(ctx.Config.ContainerdConfigPatches) > 0 \|\| len(ctx.Config.ContainerdConfigPatchesJSON6902) > 0 {
	fns := make([]func() error, len(kubeNodes))
	for i, node := range kubeNodes {
	node := node // capture loop variable
	fns[i] = func() error {
	// read and patch the config
	const containerdConfigPath = "/etc/containerd/config.toml"
	var buff bytes.Buffer
	if err := node.Command("cat", containerdConfigPath).SetStdout(&buff).Run(); err != nil {
	return errors.Wrap(err, "failed to read containerd config from node")
	}
	patched, err := patch.TOML(buff.String(), ctx.Config.ContainerdConfigPatches, ctx.Config.ContainerdConfigPatchesJSON6902)
	if err != nil {
	return errors.Wrap(err, "failed to patch containerd config")
	}
	if err := nodeutils.WriteFile(node, containerdConfigPath, patched); err != nil {
	return errors.Wrap(err, "failed to write patched containerd config")
	}
	// restart containerd now that we've re-configured it
	// skip if containerd is not running
	if err := node.Command("bash", "-c", `! pgrep --exact containerd \|\| systemctl restart containerd`).Run(); err != nil {
	return errors.Wrap(err, "failed to restart containerd after patching config")
	}
	return nil
	}
	}
	if err := errors.UntilErrorConcurrent(fns); err != nil {
	return err

update containerd to v2.0.2 #3828

update containerd to v2.0.2 #3828

Conversation

AkihiroSuda commented Dec 20, 2024 • edited Loading

AkihiroSuda commented Dec 20, 2024

stmcginnis commented Dec 20, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aojea Jan 7, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aojea commented Jan 7, 2025

BenTheElder commented Jan 7, 2025

AkihiroSuda commented Jan 7, 2025

aojea commented Jan 7, 2025

BenTheElder commented Jan 7, 2025

BenTheElder commented Jan 7, 2025 • edited Loading

AkihiroSuda commented Jan 14, 2025

aojea commented Jan 14, 2025

BenTheElder commented Jan 17, 2025

k8s-ci-robot commented Jan 17, 2025

BenTheElder commented Jan 17, 2025

BenTheElder commented Jan 17, 2025

BenTheElder commented Jan 17, 2025 • edited Loading

BenTheElder commented Jan 17, 2025

BenTheElder commented Jan 17, 2025

BenTheElder commented Jan 17, 2025

AkihiroSuda commented Dec 20, 2024 •

edited

Loading

aojea Jan 7, 2025 •

edited

Loading

BenTheElder commented Jan 7, 2025 •

edited

Loading

BenTheElder commented Jan 17, 2025 •

edited

Loading