fix: handle users/groups with no names in push_path #1082

tonyandrewmeyer · 2023-11-28T06:36:03Z

Ned Batchelder noticed that the unit tests failed on his machine, which was because the temporary folder returned by TemporaryDirectory() belonged to a group with no name.

In theory, this could happen outside of the tests as well - someone could do a push_path() on a path that has files that have owners/groups that do not have names (e.g. because they have been removed). This is presumably quite uncommon (which makes sense: it seems unlikely that the charm container would often have a user or group removed) but it's also simple to fix, so it seems like we might as well do that.

benhoyt

A couple of nit suggestions and one question about the testing.

test/test_model.py

benhoyt · 2023-11-28T23:51:46Z

test/test_model.py

+            with push_path.open('w') as f:
+                f.write('hello')
+            c.push_path(push_path, "/")
+        assert c.exists("/src.txt"), 'push_path failed: file "src.txt" missing at destination'


It's a bit weird that we're not actually testing that the user/group attributes are None. In fact, this test succeeds even if you comment out the patching. I guess there's not an easy way to test this, because in this code path we're not actually using the user/group attributes. We could test the logs generated, but probably not worth it.

It does seem strange that we're calling these functions on every file and then throwing away the data (_list_recursive only uses the type attributes). But I'm happy to keep it as is for now to avoid code churn.

Thoughts?

It's a bit weird that we're not actually testing that the user/group attributes are None. In fact, this test succeeds even if you comment out the patching. I guess there's not an easy way to test this, because in this code path we're not actually using the user/group attributes.

Yes, that was the issue I had 😞. The test does fail (error) without the model.py change (as long as the patching is there), because a KeyError will be raised, but it indeed doesn't validate what the alternate value is. If you comment out the patching, then it's just testing that the regular case (whatever user/group you're defaulting to) works.

We could test the logs generated, but probably not worth it.

Agreed.

It does seem strange that we're calling these functions on every file and then throwing away the data (_list_recursive only uses the type attributes). But I'm happy to keep it as is for now to avoid code churn.

Thoughts?

I found it odd too. At first I thought it was intending to set the user/group on the workload container (which seems dubious using the ID), but noticed that indeed the majority of the data is thrown away.

However, there's bleeding over into harness, because the test pebble's list_files uses the same method so in tests people get back all the info. (I could actually put it in a test there that checks that the names are None).

It does seem like it would be cleaner if _TestingPebbleClient had the _build_fileinfo that Container has in the current PR (or one that started with Container's and built on it like the current PR), and Container had something like:

@staticmethod def _build_fileinfo(path: Union[str, Path]) -> pebble.FileInfo: """Constructs a FileInfo object by stat'ing a local path.""" path = Path(path) if path.is_symlink(): ftype = pebble.FileType.SYMLINK elif path.is_dir(): ftype = pebble.FileType.DIRECTORY elif path.is_file(): ftype = pebble.FileType.FILE else: ftype = pebble.FileType.UNKNOWN return pebble.FileInfo( path=str(path), name=path.name, type=ftype)

... except that FileInfo has last_modified and permissions as required fields. A stat is pretty cheap, but it's also being completely wasted here.

Container's _build_fileinfo doesn't have to return a FileInfo (although the name would need to change if it didn't). I guess it could return a TypedDict private to model.py that only had the path and type.

This would extend the value of this PR to actually making a (presumably imperceptible) performance improvement. But it also significantly increases the number of lines touched for what was originally a (I assume) fairly obscure bug.

Thanks for the write-up. Okay, I think let's leave this PR as just the bug-fix: it's definitely an improvement over what we have now. We can always consider the simplification/refactor later separately for performance reasons.

Co-authored-by: Ben Hoyt <[email protected]>

benhoyt

Looks good to me now, though linting is failing.

tonyandrewmeyer · 2023-11-29T01:01:18Z

Looks good to me now, though linting is failing.

Fixed now. Do you have a suggestion for a second reviewer?

benhoyt · 2023-11-29T02:20:21Z

I'm fine without a second reviewer on this small bug fix. I'll merge as soon as the tests are passing.

Handle users/groups with no names.

aacee63

tonyandrewmeyer added the bug Something isn't working label Nov 28, 2023

tonyandrewmeyer requested a review from benhoyt November 28, 2023 06:36

benhoyt requested changes Nov 28, 2023

View reviewed changes

tonyandrewmeyer and others added 3 commits November 29, 2023 13:13

Fix parameter order.

a8b1642

Co-authored-by: Ben Hoyt <[email protected]>

Adjustments as per review.

7c0de8c

Add a testing test for this case also.

2ce6ca1

benhoyt approved these changes Nov 29, 2023

View reviewed changes

Shorten line.

554ff19

Merge branch 'main' into groups-with-no-name

165f02f

benhoyt merged commit 18abc17 into canonical:main Nov 29, 2023
19 checks passed

tonyandrewmeyer deleted the groups-with-no-name branch November 29, 2023 09:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: handle users/groups with no names in push_path #1082

fix: handle users/groups with no names in push_path #1082

tonyandrewmeyer commented Nov 28, 2023

benhoyt left a comment

benhoyt Nov 28, 2023

tonyandrewmeyer Nov 29, 2023

benhoyt Nov 29, 2023

benhoyt left a comment

tonyandrewmeyer commented Nov 29, 2023

benhoyt commented Nov 29, 2023

fix: handle users/groups with no names in push_path #1082

fix: handle users/groups with no names in push_path #1082

Conversation

tonyandrewmeyer commented Nov 28, 2023

benhoyt left a comment

Choose a reason for hiding this comment

benhoyt Nov 28, 2023

Choose a reason for hiding this comment

tonyandrewmeyer Nov 29, 2023

Choose a reason for hiding this comment

benhoyt Nov 29, 2023

Choose a reason for hiding this comment

benhoyt left a comment

Choose a reason for hiding this comment

tonyandrewmeyer commented Nov 29, 2023

benhoyt commented Nov 29, 2023