Skip to content

Introducing spacymoji

Compare
Choose a tag to compare
@ines ines released this 12 Oct 21:45
· 25 commits to master since this release

spaCy v2.0 extension and pipeline component for adding emoji meta data to Doc objects. Detects emoji consisting of one or more unicode characters, and can optionally merge multi-char emoji (combined pictures, emoji with skin tone modifiers) into one token. Human-readable emoji descriptions are added as a custom attribute, and an optional lookup table can be provided for your own descriptions.

Disclaimer: This extension only works in spaCy v2.0 (currently in alpha) and is still experimental.