Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tidy watcher #114209

Open
wants to merge 6 commits into
base: master
Choose a base branch
from
Open

tidy watcher #114209

wants to merge 6 commits into from

Conversation

klensy
Copy link
Contributor

@klensy klensy commented Jul 29, 2023

MVP of tidy watcher:

Allows to check that changes to some pieces of text synced in different files (or at least warn about it). Should resolve situations like: "Oh no, I've changed that constant here and here, but forgot to update it in other 3 places."

Current state is very MVP'ish, so it simply works and nothing more, posted to get some feedback.

Usage: wrap text that need to be synced with arbitrarily chosen tags:

some.rs:

//  tidy-keep-sync-with=tidy-ticket-foo
const FOO: usize = 42;
//  tidy-keep-sync-with=tidy-ticket-foo

some.sh:

#  tidy-keep-sync-with=tidy-ticket-foo
export FOO=42
#  tidy-keep-sync-with=tidy-ticket-foo

and add them to watcher.rs in tidy, adding into one group:

    add_group!(
        ("src/tools/opt-dist/src/main.rs", "728c2783154a52a30bdb1d66f8ea1f2a", "tidy-ticket-perf-commit"),
        ("src/ci/docker/host-x86_64/dist-x86_64-linux/Dockerfile", "76c8d9783e38e25a461355f82fcd7955", "tidy-ticket-perf-commit")
    ),

If hashes differs, actual and expected ones will be printed:

tidy error: The code blocks tagged with tidy watcher has changed.
It's likely that code blocks with the following tags need to be changed too. Check src/tools/tidy/src/watcher.rs, find t
ag/hash in TIDY_WATCH_LIST list and verify that sources for provided group of tags in sync. Once that done, run tidy aga
in and update hashes in TIDY_WATCH_LIST with provided actual hashes.
tidy error: hash for tag `tidy-ticket-sess-time-item_types_checking` in path `compiler/rustc_hir_analysis/src/lib.rs` mi
smatch:
  actual: `842e23fb65caf3a96681686131093316`, expected: `842e23fb65caf3a96681686131093317`
  Verify that tags `["tidy-ticket-sess-time-item_types_checking", "tidy-ticket-sess-time-item_types_checking"]` in sync.

Added few sync me commented code as examples.

--bless implemented, but i don't like current json config, it's huge.

r? @ghost

@rustbot rustbot added A-testsuite Area: The testsuite used to check the correctness of rustc S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-bootstrap Relevant to the bootstrap subteam: Rust's build system (x.py and src/bootstrap) T-infra Relevant to the infrastructure team, which will review and decide on the PR/issue. labels Jul 29, 2023
@rust-log-analyzer

This comment has been minimized.

@klensy klensy force-pushed the better-than-remembrall branch from 8b8f8f8 to 434286f Compare July 29, 2023 16:44
@rustbot rustbot added the WG-trait-system-refactor The Rustc Trait System Refactor Initiative (-Znext-solver) label Jul 30, 2023
@klensy
Copy link
Contributor Author

klensy commented Jul 30, 2023

Added few more entries a49ed53, but i don't know if all of them actually sync as of now, this should be reviewed.

@klensy klensy force-pushed the better-than-remembrall branch from a49ed53 to ca2cbb3 Compare July 31, 2023 09:48
@bors
Copy link
Contributor

bors commented Jul 31, 2023

☔ The latest upstream changes (presumably #114294) made this pull request unmergeable. Please resolve the merge conflicts.

@klensy klensy force-pushed the better-than-remembrall branch from ca2cbb3 to 3f2a374 Compare September 7, 2023 16:19
@rustbot rustbot added the O-unix Operating system: Unix-like label Sep 7, 2023
@rust-log-analyzer

This comment has been minimized.

@bors
Copy link
Contributor

bors commented Sep 18, 2023

☔ The latest upstream changes (presumably #115795) made this pull request unmergeable. Please resolve the merge conflicts.

@klensy klensy force-pushed the better-than-remembrall branch from 3f2a374 to e4c6a7c Compare October 2, 2023 17:22
@klensy klensy marked this pull request as ready for review October 2, 2023 17:55
@rustbot
Copy link
Collaborator

rustbot commented Oct 2, 2023

These commits modify the Cargo.lock file. Unintentional changes to Cargo.lock can be introduced when switching branches and rebasing PRs.

If this was unintentional then you should revert the changes before this PR is merged.
Otherwise, you can ignore this comment.

Some changes occurred to the CTFE / Miri engine

cc @rust-lang/miri

Some changes might have occurred in exhaustiveness checking

cc @Nadrieril

Some changes occurred to the core trait solver

cc @rust-lang/initiative-trait-system-refactor

@klensy
Copy link
Contributor Author

klensy commented Oct 2, 2023

Okay. Implementation is trivial, but most of asserted pieces of code in compiler, so r? compiler. Didn't checked if current state is sync, as i don't always understand whats functions do.

@rustbot rustbot added the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Oct 2, 2023
@@ -149,6 +149,7 @@ where
None => {
// For unsized types with an extern type tail we perform no adjustments.
// NOTE: keep this in sync with `PlaceRef::project_field` in the codegen backend.
// FIXME: that one? compiler/rustc_codegen_ssa/src/mir/place.rs
Copy link
Member

@RalfJung RalfJung Oct 2, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Specifically this:

_ if self.llextra.is_none() => {
debug!(
"unsized field `{}`, of `{:?}` has no metadata for adjustment",
ix, self.llval
);
return simple();
}

I don't see how an automatic checker can help here. This is about keeping the semantics in sync.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I just run over codebase, found few comments like "keep in sync with" and wrapped them in tags. Of course this check can't say if two places is valid, only ping author to give it chance to check things manually.

@RalfJung
Copy link
Member

RalfJung commented Oct 2, 2023

If hashes differs, actual and expected ones will be printed:

This needs to give very clear guidance on what I should do now, otherwise it will be very frustrating.

@klensy
Copy link
Contributor Author

klensy commented Oct 2, 2023

If hashes differs, actual and expected ones will be printed:

This needs to give very clear guidance on what I should do now, otherwise it will be very frustrating.

It slightly more descriptive, forgot to update first message, but can be improved.

@@ -1144,6 +1150,7 @@ impl<'p, 'tcx> Fields<'p, 'tcx> {
})
}

// tidy-ticket-wildcards
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would have expected that we reuse the tag for all the places that need to be in sync. E.g. if I modify wildcards() here and not arity(), it would ping me "you should also check arity".

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added grouping for tags, should write

tidy error: hash for tag `tidy-ticket-ast-from_token` in path `compiler/rustc_ast/src/token.rs` mismatch:
  actual: `eecacc0fe4c615f7c71409c3ebfcff80`, expected: `70666de80ab0194a67524deeda3c01b8`
  Verify that tags `["tidy-ticket-ast-from_token", "tidy-ticket-ast-can_begin_literal_maybe_minus", "tidy-ticket-rustc_p
arse-can_begin_literal_maybe_minus"]` in sync.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice! Could you make it explain what it's about for those who don't know? Something like: "the code block tagged with tidy-ticket-ast-from_token has changed. It's likely that code blocks with the following tags need to be changed too. Once that is done, run tidy again with the --blessed flag to update the hashes. The related tags are ["tidy-ticket-ast-from_token", "tidy-ticket-ast-can_begin_literal_maybe_minus", "tidy-ticket-rustc_parse-can_begin_literal_maybe_minus"]"

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added tidy-keep-sync-with= and updated error message with parts from your suggestion.

@Nadrieril
Copy link
Member

Nadrieril commented Oct 2, 2023

The tag should have a name that explains what it's for if we encounter it in code. Something like tidy-keep-sync-with=<tag>.

@klensy klensy force-pushed the better-than-remembrall branch from e4c6a7c to fa7b8ae Compare October 4, 2023 15:46
@klensy
Copy link
Contributor Author

klensy commented Oct 4, 2023

The tag should have a name that explains what it's for if we encounter it in code. Something like tidy-keep-sync-with=<tag>.

Don't want to implement smarter parsing for tags (at least for now), can just make it longer, like tidy-keep-sync-try_visit_primitive tidy-keep-sync-visit_value?

Anyway, accidental removing of tag from code will enrage tidy, which will say something special about it.

@Nadrieril
Copy link
Member

Nadrieril commented Oct 4, 2023

You don't need to do any parsing, just allow "=" in the list of allowed symbols in a tag. I just think it's more legible

@Nadrieril
Copy link
Member

You might want to wait and see if compiler contributors actually want this before investing more time in it though. As far as I am concerned this adds unnecessary friction; I'm waiting to hear from ppl who work on files where this would be helpful.

@alex-semenyuk

This comment was marked as resolved.

@alex-semenyuk

This comment was marked as resolved.

@jieyouxu
Copy link
Member

jieyouxu commented Nov 11, 2024

Re-nominating this PR for T-compiler triage meeting to gauge team interest.


Notes for T-compiler triage meeting

This experimental PR proposes a tidy watcher mechanism which is intended to check if pieces of code between multiple locations are kept in sync properly by comparing their hashes.

Yes, this is for code that will not emit errors; for example git hash for rustc-perf in dockerfile and opt-dist, or code commented in style "please keep me in sync", which will not immediately cause errors, but will silently do wrong things; or assert pieces in rust, bash, python code simultaneously (which will hard to sync other way).

How does the team feel about such a mechanism:

  • Do we have the interest for such a mechanism?
  • Do we have concerns about the mechanism design (see below)?
  • Do we have the resources available to support the PR author in finishing the (initial) implementation?

Known implementation limitations and concerns (as of writing this comment)

  • It currently does not have a --bless mechanism in place to update the hashes if the changes are intentional, e.g. from a larger refactor PR, cf. tidy watcher #114209 (comment).
  • It's not super clear how to check "keep in sync" (semantic in-sync?), cf. tidy watcher #114209 (comment).
  • It's not clear if team members and compiler contributors actually want this mechanism because it can add friction, cf. tidy watcher #114209 (comment)
    • We can propose an MCP for this, but I wanted to check for some initial preferences in the triage meeting.

@rustbot label +I-compiler-nominated +S-waiting-on-team -S-waiting-on-author

Also cc @rust-lang/bootstrap because this concerns tidy which we (T-bootstrap) maintain.

@rustbot rustbot added I-compiler-nominated Nominated for discussion during a compiler team meeting. S-waiting-on-team Status: Awaiting decision from the relevant subteam (see the T-<team> label). and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Nov 11, 2024
@klensy
Copy link
Contributor Author

klensy commented Nov 11, 2024

This PR was already discussed: https://rust-lang.zulipchat.com/#narrow/stream/238009-t-compiler.2Fmeetings/topic/.5Bweekly.5D.202023-10-26/near/398708149, so i need rebase this stuff and reread it again.

@apiraino
Copy link
Contributor

removing nomination, postponing a possible further discussion after the rebase (this summary will be useful anyway). Thanks @klensy for your patience.

@rustbot label -I-compiler-nominated

@rustbot rustbot removed the I-compiler-nominated Nominated for discussion during a compiler team meeting. label Nov 13, 2024
@apiraino apiraino added the S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. label Nov 20, 2024
@apiraino apiraino removed the S-waiting-on-team Status: Awaiting decision from the relevant subteam (see the T-<team> label). label Dec 5, 2024
@klensy klensy force-pushed the better-than-remembrall branch from 0b311d0 to 429ca32 Compare January 17, 2025 11:21
@rustbot rustbot added the A-tidy Area: The tidy tool label Jan 17, 2025
@rustbot
Copy link
Collaborator

rustbot commented Jan 17, 2025

Some changes occurred in cfg and check-cfg configuration

cc @Urgau

Some changes occurred to the CTFE machinery

cc @rust-lang/wg-const-eval

Some changes occurred to MIR optimizations

cc @rust-lang/wg-mir-opt

Some changes occurred in exhaustiveness checking

cc @Nadrieril

Some changes occurred in coverage instrumentation.

cc @Zalathar

@RalfJung
Copy link
Member

Once that done, run tidy aga
in and update hashes in TIDY_WATCH_LIST with provided actual hashes.

Is there any way this could be automated? Currently this all seems somewhat tedious to use.

@klensy
Copy link
Contributor Author

klensy commented Jan 17, 2025

Okay, rebased.

Current state: rebased with almost mindlessly reapplied checks (so need to review if linked pieces of code actually sync) and added more fixme's to sync more codepieces.

Once that done, run tidy aga
in and update hashes in TIDY_WATCH_LIST with provided actual hashes.

Is there any way this could be automated? Currently this all seems somewhat tedious to use.

Yes, i'm thinking about way to implement bless #114209 (comment)

@klensy
Copy link
Contributor Author

klensy commented Jan 17, 2025

There exist https://github.com/stepchowfun/tagref with somehow similar idea, but looks like it can only check existence of tag links, not content.

@@ -109,6 +109,7 @@ fn coverage_ids_info<'tcx>(
// to any particular point in the control-flow graph.
// (Keep this in sync with the injection of `ExpressionUsed`
// statements in the `InstrumentCoverage` MIR pass.)
// FIXME: sync me
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This one is a semantic sync rather than a data sync, and violating it would cause a bunch of tests to break, so it's probably not worth the hassle of adding sync-check tags here.

@klensy
Copy link
Contributor Author

klensy commented Jan 17, 2025

Okay, bless works; but saving state into json still ugly.

@rust-log-analyzer

This comment has been minimized.

@@ -176,6 +176,7 @@ pub(crate) fn default_configuration(sess: &Session) -> Cfg {
//
// NOTE: These insertions should be kept in sync with
// `CheckCfg::fill_well_known` below.
// FIXME: sync me
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

default_configuration and CheckCfg::fill_well_known are loosely synced, there are some cfgs in default_configuration like emscripten_wasm_eh that shouldn't be in CheckCfg::fill_well_known and the inverse is also for some cfgs (miri, rustfmt, ...).

Is it possible to still apply this tidy annotation in that case ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This pr/feature can only assert that some spans unchanged, nothing more. It's up to author/reviewer check actual content. Should i wrap entire default_configuration/fill_well_known fns?

@rustbot
Copy link
Collaborator

rustbot commented Jan 18, 2025

This PR modifies src/bootstrap/src/core/config.

If appropriate, please update CONFIG_CHANGE_HISTORY in src/bootstrap/src/utils/change_tracker.rs.

Comment on lines +338 to +343
// tidy-keep-sync-with=tidy-ticket-sess-time-item_types_checking
// NOTE: These are copy/pasted from typeck/lib.rs and should be kept in sync with those changes.
let _ = tcx.sess.time("wf_checking", || {
tcx.hir().try_par_for_each_module(|module| tcx.ensure().check_mod_type_wf(module))
});
// tidy-keep-sync-with=tidy-ticket-sess-time-item_types_checking
Copy link
Contributor Author

@klensy klensy Jan 18, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks clearly desynced (or not), only similar thing is

tcx.sess.time("coherence_checking", || {
tcx.hir().par_for_each_module(|module| {
let _ = tcx.ensure().check_mod_type_wf(module);
});
for &trait_def_id in tcx.all_local_trait_impls(()).keys() {
let _ = tcx.ensure().coherent_trait(trait_def_id);
}
// these queries are executed for side-effects (error reporting):
let _ = tcx.ensure().crate_inherent_impls_validity_check(());
let _ = tcx.ensure().crate_inherent_impls_overlap_check(());
});

@bors
Copy link
Contributor

bors commented Jan 18, 2025

☔ The latest upstream changes (presumably #135678) made this pull request unmergeable. Please resolve the merge conflicts.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-testsuite Area: The testsuite used to check the correctness of rustc A-tidy Area: The tidy tool O-unix Operating system: Unix-like S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. T-bootstrap Relevant to the bootstrap subteam: Rust's build system (x.py and src/bootstrap) T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. T-infra Relevant to the infrastructure team, which will review and decide on the PR/issue. WG-trait-system-refactor The Rustc Trait System Refactor Initiative (-Znext-solver)
Projects
None yet
Development

Successfully merging this pull request may close these issues.