Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Name normalization for bouys #84

Open
pwoods25443 opened this issue Jan 6, 2019 · 0 comments
Open

Name normalization for bouys #84

pwoods25443 opened this issue Jan 6, 2019 · 0 comments

Comments

@pwoods25443
Copy link
Member

pwoods25443 commented Jan 6, 2019

Some AIS devices on bouys and gear encode the battery charge state in the shipname. This causes a single MMSI to have many names like this

BOATY 99%
BOATY 72%
BOATY 23%
BOATY 100%
BOATY 99%

In 2018 there are ~15k unique MMSI that have this pattern

We can strip off the suffix to get a better identity with a regex like this

REGEXP_EXTRACT(shipname.value, "(.*)[[:space:]][0-9]+%$")

and there are another ~2200 that match this

REGEXP_EXTRACT(shipname.value, "(.*)[@]+[0-9]+V[0-9]?$")

eg NETFISHING@@@@@11V8

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant