Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change match_empty_frames to empty_is_annotated #8888

Merged
merged 8 commits into from
Jan 8, 2025

Conversation

zhiltsov-max
Copy link
Contributor

@zhiltsov-max zhiltsov-max commented Dec 29, 2024

Motivation and context

Improves the implementation of the match_empty_frames quality setting to work in cases when only GT or DS annotations are empty. This allows to use precision as the primary metric in cases when all frames are required to have at least 1 annotation.

The previous implementation only affected matching empty annotations. The updated variant also counts any empty frames in ds_count and gt_count so that mismatched empty annotations also included in denominators of metrics.

Example:

annotations per frame old accuracy new accuracy old precision new precision
ds: [empty, 1 valid]; gt: [empty, 1 valid] 2/2 2/2 2/2 2/2
ds: [1 extra, 1 valid]; gt: [empty, 1 valid] 1/2 (only matches) 1/3 (empty != extra) 1/2 1/2
ds: [empty, 1 valid]; gt: [1 miss, 1 valid] 1/2 (only matches) 1/3 (empty != miss) 1/1 (only matches) 1/2

So, it allowed undesirable situations in which 1 and the only correct annotation in a job could be counted as 100% of precision. The updated option prevents this.

How has this been tested?

Checklist

  • I submit my changes into the develop branch
  • I have created a changelog fragment
  • I have updated the documentation accordingly
  • I have added tests to cover my changes
  • I have linked related issues (see GitHub docs)
  • I have increased versions of npm packages if it is necessary
    (cvat-canvas,
    cvat-core,
    cvat-data and
    cvat-ui)

License

  • I submit my code changes under the same MIT License that covers the project.
    Feel free to contact the maintainers if that's a concern.

Summary by CodeRabbit

  • Refactor

    • Renamed match_empty_frames to empty_is_annotated across multiple components and files
    • Updated terminology for handling empty frames in quality settings
  • Documentation

    • Updated API schema and documentation to reflect new parameter name
    • Clarified description of empty frame handling in quality reports
  • Tests

    • Updated test cases to use new parameter name
    • Verified metrics calculation with updated quality settings

Copy link
Contributor

coderabbitai bot commented Dec 29, 2024

Important

Review skipped

Auto incremental reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Walkthrough

The pull request introduces a comprehensive renaming of a property related to frame matching across multiple files in the CVAT (Computer Vision Annotation Tool) project. The change involves replacing match_empty_frames with empty_is_annotated in various components, including core TypeScript files, Django models, serializers, UI components, and test assets. This modification spans frontend, backend, and testing layers, ensuring consistent terminology for handling empty frame annotations.

Changes

File Path Change Summary
cvat-core/src/quality-settings.ts Renamed private member and methods from #matchEmptyFrames to #emptyIsAnnotated
cvat-core/src/server-response-types.ts Updated SerializedQualitySettingsData interface to use empty_is_annotated
cvat-ui/src/components/quality-control/ Updated property names and labels in quality control components
cvat/apps/quality_control/migrations/ Added migration script to rename database field
cvat/apps/quality_control/models.py Renamed model field from match_empty_frames to empty_is_annotated
cvat/apps/quality_control/quality_reports.py Updated attribute and documentation for empty frame handling
cvat/schema.yml Updated OpenAPI schema with new property name
tests/ Updated test files and assets with new property name

Poem

🐰 A Rabbit's Ode to Renaming 🔤

Empty frames, once matched with care,
Now annotated with flair so rare
From old to new, a simple twist
Code evolves, no frames dismissed!
Hop along, semantic delight! 🌟


Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Generate unit testing code for this file.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate docstrings to generate docstrings for this PR. (Beta)
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🧹 Nitpick comments (2)
tests/python/shared/assets/cvat_db/data.json (1)

18176-18176: Consider adding test cases with empty_is_annotated: true.

All current test configurations set empty_is_annotated to false. Given that the PR aims to enhance functionality for scenarios where either GT or DS annotations are empty, it would be beneficial to include test cases with empty_is_annotated: true to ensure both behaviors are properly tested.

Would you like me to help create additional test configurations with empty_is_annotated: true?

Also applies to: 18200-18200, 18224-18224, 18248-18248, 18272-18272, 18296-18296, 18320-18320, 18344-18344, 18368-18368, 18392-18392, 18416-18416, 18440-18440, 18464-18464, 18488-18488, 18512-18512, 18536-18536, 18560-18560, 18584-18584, 18608-18608, 18632-18632, 18656-18656, 18680-18680, 18704-18704, 18728-18728

cvat/schema.yml (1)

9778-9782: API schema changes look good but consider versioning strategy.

The addition of the empty_is_annotated property to quality settings is well-documented and the implementation is consistent across schema definitions. The property serves a clear purpose in controlling how empty frames affect quality metrics.

Since this is a breaking change to the API schema, consider:

  1. Implementing API versioning (e.g., using URL versioning like /api/v1/ vs /api/v2/)
  2. Adding version compatibility notes in the schema description
  3. Documenting migration steps for API clients

Also applies to: 10285-10289

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 26f08ce and 17bc920.

📒 Files selected for processing (12)
  • cvat-core/src/quality-settings.ts (4 hunks)
  • cvat-core/src/server-response-types.ts (1 hunks)
  • cvat-ui/src/components/quality-control/quality-control-page.tsx (1 hunks)
  • cvat-ui/src/components/quality-control/task-quality/quality-settings-form.tsx (3 hunks)
  • cvat/apps/quality_control/migrations/0006_rename_match_empty_frames_qualitysettings_empty_is_annotated.py (1 hunks)
  • cvat/apps/quality_control/models.py (1 hunks)
  • cvat/apps/quality_control/quality_reports.py (5 hunks)
  • cvat/apps/quality_control/serializers.py (2 hunks)
  • cvat/schema.yml (2 hunks)
  • tests/python/rest_api/test_quality_control.py (2 hunks)
  • tests/python/shared/assets/cvat_db/data.json (24 hunks)
  • tests/python/shared/assets/quality_settings.json (24 hunks)
🔥 Files not summarized due to errors (1)
  • tests/python/shared/assets/cvat_db/data.json: Error: Server error: no LLM provider could handle the message
✅ Files skipped from review due to trivial changes (2)
  • cvat/apps/quality_control/migrations/0006_rename_match_empty_frames_qualitysettings_empty_is_annotated.py
  • tests/python/shared/assets/quality_settings.json
🔇 Additional comments (25)
tests/python/shared/assets/cvat_db/data.json (1)

18176-18176: LGTM! Property renamed consistently across test data.

The renaming from match_empty_frames to empty_is_annotated has been applied consistently across all test configurations, maintaining the same default value of false.

Also applies to: 18200-18200, 18224-18224, 18248-18248, 18272-18272, 18296-18296, 18320-18320, 18344-18344, 18368-18368, 18392-18392, 18416-18416, 18440-18440, 18464-18464, 18488-18488, 18512-18512, 18536-18536, 18560-18560, 18584-18584, 18608-18608, 18632-18632, 18656-18656, 18680-18680, 18704-18704, 18728-18728

cvat/apps/quality_control/models.py (1)

238-238: Great name change for clarity!
The new empty_is_annotated field better expresses its purpose compared to the old match_empty_frames.

cvat-core/src/quality-settings.ts (5)

41-41: Addition of the private member field
Declaring #emptyIsAnnotated is consistent with the refactor; it cleanly encapsulates the property within the class.


63-63: Constructor alignment
Initializing #emptyIsAnnotated from initialData.empty_is_annotated ensures the new field is fully integrated with the serialized data structure.


203-204: Getter for emptyIsAnnotated
The getter neatly exposes the property for external usage. Looks good and adheres to the new naming convention.


207-208: Setter for emptyIsAnnotated
The setter properly handles the boolean value. Make sure to test any UI or logic flows that might rely on the old setter name.


239-239: Ensure correct serialization
Replacing match_empty_frames with empty_is_annotated in toJSON() maintains consistency between data models and server representation.

cvat/apps/quality_control/serializers.py (3)

95-95: Renamed field in fields list
Properly lists empty_is_annotated. Confirm that no references to match_empty_frames remain.


103-103: Added default constraint
The default False matches the legacy behavior, safeguarding backward compatibility.


169-170: Updated help text
The revised description accurately reflects the new behavior regarding empty frames. This is crucial for user understanding.

cvat-core/src/server-response-types.ts (1)

261-261: Consistent interface rename
Renaming match_empty_frames to empty_is_annotated aligns with the updated data model across the codebase. Good job!

cvat-ui/src/components/quality-control/quality-control-page.tsx (1)

226-226: Validate all usages of the new property
You have correctly replaced settings.matchEmptyFrames with settings.emptyIsAnnotated. Please ensure that any existing calls or references using the old name throughout the codebase (especially if it was documented or used in external components) are fully aligned with the new naming.

cvat-ui/src/components/quality-control/task-quality/quality-settings-form.tsx (4)

37-37: Consistency of initial data
The new property emptyIsAnnotated is properly integrated into initialValues. Verify that legacy references (if any) to matchEmptyFrames are fully replaced and that the new property is tested for form submission.


84-84: Tooltip clarity
The updated tooltip for "Empty frames are annotated" clearly conveys the new property’s purpose. Great work on ensuring user-friendly messaging.


201-201: Form item name alignment
Renaming the form item to emptyIsAnnotated matches the new property name. This is consistent with the rest of the changes. Excellent.


206-206: Accurate label
The checkbox label "Empty frames are annotated" clearly communicates the effect of the new setting. This is an improvement over the old naming (“Match empty frames”).

tests/python/rest_api/test_quality_control.py (1)

1216-1216: Check param coverage in tests
The introduction of "empty_is_annotated" in the parameter list is consistent with the updated naming. Ensure that you have sufficient test coverage for scenarios when empty_is_annotated is turned on/off.
[approve]

cvat/apps/quality_control/quality_reports.py (8)

218-218: Renaming field reflects the new meaning
Introducing empty_is_annotated: bool = False aligns with the overall refactor. Make sure any references or related logic for the old match_empty_frames field is removed or migrated.


220-222: Documentation clarity
These lines explain precisely how empty frames are interpreted if empty_is_annotated is enabled. The updated docstring is very clear. Good job!


1981-1990: Metric adjustment for simultaneously empty frames
Handling cases where both GT and DS frames have no annotations is critical for accurate metrics. The approach of setting valid counts when both are empty is correct. Confirm that the final reporting logic handles small edge cases (e.g., tasks with no annotations at all).


1992-1996: Validate partial emptiness
When either GT or DS is empty (but not both), these lines ensure the DS or GT shape counts are marked as 1. Verify that no additional side effects are introduced, such as artificially inflating counts where partial data is present in only one side.


2089-2100: Empty frames in confusion matrix
Here, you handle a scenario in which summary.total_count is set to 1 when a frame is entirely empty. This is consistent with the docstring expectations for empty_is_annotated. The logic looks good!


2124-2125: Edge case references
These lines track empty GT or DS frames separately. Ensure that related logic (like partial updates in a multi-job scenario) doesn’t incorrectly classify frames.


2131-2143: Confusion matrix emptiness check
The code properly checks whether the confusion matrix indicates any annotations. This is critical when adjusting metrics. Great attention to detail with these checks.


2156-2162: Accumulate valid counts for empty frames
The logic of adding a valid count for frames that are empty on both sides is well-implemented. This ensures that metrics reflect successful “matches” when no annotations exist.

tests/python/rest_api/test_quality_control.py Outdated Show resolved Hide resolved
@codecov-commenter
Copy link

codecov-commenter commented Dec 30, 2024

Codecov Report

Attention: Patch coverage is 10.52632% with 34 lines in your changes missing coverage. Please review.

Project coverage is 73.92%. Comparing base (9a25291) to head (2f27b90).
Report is 10 commits behind head on develop.

Additional details and impacted files
@@             Coverage Diff             @@
##           develop    #8888      +/-   ##
===========================================
+ Coverage    73.89%   73.92%   +0.02%     
===========================================
  Files          408      411       +3     
  Lines        44131    44246     +115     
  Branches      3986     3993       +7     
===========================================
+ Hits         32611    32708      +97     
- Misses       11520    11538      +18     
Components Coverage Δ
cvat-ui 78.38% <72.13%> (-0.02%) ⬇️
cvat-server 70.13% <9.09%> (+0.06%) ⬆️

@Marishka17 Marishka17 requested review from SpecLad and removed request for Marishka17 January 7, 2025 12:13
# Add virtual annotations for empty frames
valid_labels_count = 1
total_labels_count = 1
if not gt_item.annotations and not ds_item.annotations:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems like there's more going on here than just a renaming. Could you describe the behavior change in the changelog entry?

Copy link
Contributor Author

@zhiltsov-max zhiltsov-max Jan 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes. The previous implementation only affected matching empty annotations. The updated variant also counts any empty frames in ds_count and gt_count so that mismatched empty annotations also included in denominators of metrics.

Example:

annotations per frame old accuracy new accuracy old precision new precision
ds: [empty, 1 valid]; gt: [empty, 1 valid] 2/2 2/2 2/2 2/2
ds: [1 extra, 1 valid]; gt: [empty, 1 valid] 1/2 (only matches) 1/3 (empty != extra) 1/2 1/2
ds: [empty, 1 valid]; gt: [1 miss, 1 valid] 1/2 (only matches) 1/3 (empty != miss) 1/1 (only matches) 1/2

So, it allowed undesirable situations in which 1 and the only correct annotation in a job could be counted as 100% of precision. The updated option prevents this.

Added this into the PR description.

cvat/apps/quality_control/quality_reports.py Outdated Show resolved Hide resolved
Copy link

sonarqubecloud bot commented Jan 7, 2025

@SpecLad SpecLad merged commit c5f2272 into develop Jan 8, 2025
35 checks passed
@SpecLad SpecLad deleted the zm/empty-is-annotated branch January 8, 2025 10:21
This was referenced Jan 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants