Interface with TextAnalysis.jl? #10

ericphanson · 2020-12-08T23:46:33Z

Document's could be TextAnalysis.StringDocument and Corpus could be TextAnalysis.Corpus?

This would mean KeywordSearch complements TextAnalysis by providing fuzzy-matching capabilities its StringDocument types and Corpuses of StringDocuments.
It would also mean that it's easy to do other types of analyses (provided by TextAnalysis) beyond keyword searching on documents loaded into this library, since they would already be TextAnalysis.Corpuses.
Currently, we allow attaching arbitrary NamedTuple metadata to our documents or corpuses, which I use for storing document UUIDs and corpus UUIDs. This maybe could be upstreamed to maintain the functionality.
This would also enlarge the dependency tree of KeywordSearch, since we would need to add a dependency on TextAnalysis (https://juliahub.com/ui/Packages/TextAnalysis/5Mwet/0.7.1?t=1).

This would be for the future (not the 0.3 release I want to get out soon).

The text was updated successfully, but these errors were encountered:

Provide feedback