Releases: ufal/morphodita
Releases · ufal/morphodita
MorphoDiTa 1.11.2
Changes since 1.11.0:
- Fix a bug in the tokenizers, which did not end a sentence on an empty line if the preceding line ended with whitespace. However, only the REST server and the API was affected; the
run_*
binaries read input a paragraph at a time, and so circumvent the problem. - Fix warnings in Clang 15 by qualifying
std::move
. - Do not assume char is signed; fixes silent errors on aarch64 when compiled from the source (e.g. in Docker on ARM macOS).
MorphoDiTa 1.11.0
Changes since 1.10.1:
- When analyzing numerals, if the form ends with a dot, keep the dot in the lemma (previously the dot was discarded).
- Correctly handle lemma number which is not null-terminated in
generate
. - On Windows, the file paths are now UTF-8 encoded, instead of ANSI. This change affects the API, binary arguments, and program outputs.
- Add ARM64 macOS build.
- The Windows binaries are now compiled with VS 2019, older systems than Windows 7 are no longer supported.
- Add billing information to the REST server.
- Add type information to the Python package.
- The Python wheels are provided for Pythons 3.6-3.11.
MorphoDiTa 1.10.1
Changes since 1.9.2:
- Allow overriding path to Java library in the bindings.
- Document statistical guesser training using train_guesser binary.
- Improve the GRU tokenizer.
- Do not generate empty lemmas in guesser.
- Add explicit GUESSER_UNSPECIFIED guesser mode already used by tagger.
- Fix unaligned pointer access and other UndefinedBehaviourSanitizer findings.
- Fix bug in tag_filter incorrectly matching negates sets of 2+ characters.
- Add several REST server options.
- Add methods running tagset_converter during derivation_formatter.
This allows correct handling of their interactions. - Add binary wheels of the Python package.
MorphoDiTa 1.9.2
Changes since 1.9.1:
- Fix regression in
morpho::generate
caused in 1.9.1. - Add check in
encode_dictionary
that no lemma is repeated.
MorphoDiTa 1.9.1 - contains a regression!
Please use MorphoDiTa 1.9.2 which fixes regression caused by 1.9.1.
Changes since 1.9.0-prerelease:
- Allow loading larger dictionaries by increasing the internal limit of size of search structures created during loading.
- Export
string_piece
using correct namespace insrc_lib_only/morphodita.h
.
MorphoDiTa 1.9.0-prerelease
Changes since 1.3.0:
- Use t2t_docsys http://github.com/ufal/t2t_docsys as documentation system.
- Use C++ Builtem http://github.com/ufal/cpp_builtem as build system.
- Add C# bindings.
- Remove support for shared library build.
- Change license from LGPL to MPL 2.0.
- Use C++ iostreams instead of C stdio.
- Start using Semantic Versioning http://semver.org/.
- Add REST server using MicroRestD http://github.com/ufal/microrestd.
- Add
tagger::tag_analyzed
method. - Add
derivator
andderivation_formatter
classes.
MorphoDiTa 1.3.0
MorphoDiTa 1.2.0
Changes since 1.1.0:
- Add
strip_lemma_comment
andstrip_lemma_id
tag set converters.
MorphoDiTa 1.1.0
Changes since 1.0.0:
- Allow English negative prefix rules to overlap, allowing to use also "non-" as negative prefix in additon to "non".
- Allow passing tagger model instead of morphological model to
run_morpho_analyze
,run_morpho_cli
andrun_morpho_generate
binaries. - Add
--version
option printing current version of binaries. - Clarify stable/development versions.
MorphoDiTa 1.0.0
First public release of MorphoDiTa 1.0.0.