You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
At the moment, it looks like training can only occur using direct paths to text files. This would be tricky if we want to do some custom pre-processing, or train on text contained over a dataset.
A way to train over an iterator would allow for training in these scenarios. An example of this is the sentencepiece library, which allows for an iterator to be used
At the moment, it looks like training can only occur using direct paths to text files. This would be tricky if we want to do some custom pre-processing, or train on text contained over a dataset.
A way to train over an iterator would allow for training in these scenarios. An example of this is the sentencepiece library, which allows for an iterator to be used
https://github.com/google/sentencepiece/tree/master/python#training-without-local-filesystem
The text was updated successfully, but these errors were encountered: