AbstractAggregator: Aggregation operations are no longer transparent downstream #102

ppanopticon · 2024-08-26T08:00:34Z

@rahelarnold98 and I just ran into an issue in the XReco context. Due to the recent changes to the AbstractAggregator, the "aggregated" Retrievable no longer holds only aggregated content. Instead, it holds the original content plus the aggregated content in a single Retrievable .

The question now is: By what mechanism can we restrict downstream operators to only work with the aggregated content? It's one thing if an operator is built from scratch. In our case, however, we mostly rely on existing operators that are being configured.

Quite frankly, this change broke our complete video extraction pipeline.

The text was updated successfully, but these errors were encountered:

lucaro · 2024-08-26T08:29:27Z

That is the downside of the append-only approach. We do have a mechanism to check the author of a content element via the ContentAuthorAttribute, so at least they are distinguishable.

ppanopticon · 2024-08-26T08:34:19Z

While this may very well be, I currently don't see a way to leverage this in a configurable fashion (i.e., without changing all the operators). Or have I overlooked something?

faberf · 2024-08-26T10:18:05Z

Hey @ppanopticon sorry for just now getting to this. Yes, with the approach we are going with all of the downstream consumers must filter the content using the contentauthorattribute using a configurable value in the extraction configuration. You can check the FES Extractor class for an example.

ppanopticon · 2024-08-26T11:04:44Z

Okay I see - thanks for the hint.

However, I would expect the author(s) of such a breaking change to actually adjust existing operators such that they can work as they did before opening a PR. In its current state, the change breaks the pipeline.

faberf · 2024-08-26T11:14:35Z

Yes, I agree, I actually hadn't considered that this change is breaking before. I will push a fix to dev today, and will edit the wiki.

ppanopticon · 2024-08-30T16:02:49Z

Fixed by PR #103

ppanopticon added the bug Something isn't working label Aug 26, 2024

ppanopticon added this to the Release Candidate #1 milestone Aug 26, 2024

ppanopticon assigned faberf and lucaro Aug 26, 2024

ppanopticon changed the title ~~AbstractAggregator: Aggregation Operations no longer transparent downstream~~ AbstractAggregator: Aggregation operations are no longer transparent downstream Aug 26, 2024

faberf mentioned this issue Aug 26, 2024

bugfix: added content authors to all extractors #103

Merged

ppanopticon closed this as completed Aug 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AbstractAggregator: Aggregation operations are no longer transparent downstream #102

AbstractAggregator: Aggregation operations are no longer transparent downstream #102

ppanopticon commented Aug 26, 2024 •

edited

Loading

lucaro commented Aug 26, 2024

ppanopticon commented Aug 26, 2024

faberf commented Aug 26, 2024

ppanopticon commented Aug 26, 2024

faberf commented Aug 26, 2024

ppanopticon commented Aug 30, 2024

AbstractAggregator: Aggregation operations are no longer transparent downstream #102

AbstractAggregator: Aggregation operations are no longer transparent downstream #102

Comments

ppanopticon commented Aug 26, 2024 • edited Loading

lucaro commented Aug 26, 2024

ppanopticon commented Aug 26, 2024

faberf commented Aug 26, 2024

ppanopticon commented Aug 26, 2024

faberf commented Aug 26, 2024

ppanopticon commented Aug 30, 2024

ppanopticon commented Aug 26, 2024 •

edited

Loading