You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Following discussion with @raffazizzi today, I've added more normalization to a new version of the Python script. The normalization function now does two things:
it ignores the content of any element tags during the collation process (so <p/> elements with distinctly different "location markers" in their attributes can still align and be considered invariant),
it lowercases all the input that collateX works with.
The new version of the Python script also ignores most markup. I decided for now to keep <p/> elements as inline-empty because a change in paragraphing is semantically significant. For the same reason I kept div markers, add and del. I'm thinking we might just want to output del, but suppress add.
The new output is coming out in the LessMarkupV2_xmlOutput directory.
I'll run a few more variations on this theme. Question: Is it necessary/useful to have "location flag" attributes appear in the collation output on the <p/> and <lb/> elements?
The text was updated successfully, but these errors were encountered:
@raffazizzi@Rikkm I'm running a fresh collation this morning. As I mentioned in our meeting on Tuesday, this collation will have better alignment because it's properly normalizing ampersands and markup. Also, I've made sure the <lb> elements are present so the text locations are clearly signaled.
Following discussion with @raffazizzi today, I've added more normalization to a new version of the Python script. The normalization function now does two things:
<p/>
elements with distinctly different "location markers" in their attributes can still align and be considered invariant),The new version of the Python script also ignores most markup. I decided for now to keep
<p/>
elements as inline-empty because a change in paragraphing is semantically significant. For the same reason I kept div markers, add and del. I'm thinking we might just want to output del, but suppress add.The new output is coming out in the LessMarkupV2_xmlOutput directory.
I'll run a few more variations on this theme. Question: Is it necessary/useful to have "location flag" attributes appear in the collation output on the
<p/>
and<lb/>
elements?The text was updated successfully, but these errors were encountered: