You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Creating secondary parent/child associations for nodes is a fairly common use case. Alfresco itself uses this feature for workflow folders, for example.
Unfortunately, this does not play well with CascadeTracker. For example, when a folder is "linked" to the companyhome (as Alfresco does for workflows) by adding the companyhome as a secondary parent, CascadeTracker goes through the entire repo again to be added as a list of paths to the index document.
This results in a very large index size that may even exceed the size of the entire repo.
Secondary child associations worked fine in the old solr4 implementation until ASS introduced CascadeTracker. The only option we have today is to disable PATH tracking completely (alfresco.cascade.tracker.enabled=false) or remove all secondary child associations on folders/containers, which may brake use cases and functionality.
As a compromise, we should introduce something like alfresco.cascade.tracksecondaries.enabled to support PATH queries but avoid multiplying index size and tracking time and resources.
The text was updated successfully, but these errors were encountered:
As a proof of concept I removed the ~20 secondary child assocs on folders in a repo having ~40 mio nodes.
After reindexing the index shrinked from 1 TB to < 50 GB.
Creating secondary parent/child associations for nodes is a fairly common use case. Alfresco itself uses this feature for workflow folders, for example.
Unfortunately, this does not play well with CascadeTracker. For example, when a folder is "linked" to the companyhome (as Alfresco does for workflows) by adding the companyhome as a secondary parent, CascadeTracker goes through the entire repo again to be added as a list of paths to the index document.
This results in a very large index size that may even exceed the size of the entire repo.
Secondary child associations worked fine in the old solr4 implementation until ASS introduced CascadeTracker. The only option we have today is to disable PATH tracking completely (
alfresco.cascade.tracker.enabled=false
) or remove all secondary child associations on folders/containers, which may brake use cases and functionality.As a compromise, we should introduce something like
alfresco.cascade.tracksecondaries.enabled
to support PATH queries but avoid multiplying index size and tracking time and resources.The text was updated successfully, but these errors were encountered: