You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello @limhasic , could you please share more detail about the error you are getting?
The model should be able to handle NaN values in the raw dataset and you will also have the option to impute or generate NaN values in the synthetic data as well.
To impute, you just need to pass the token id of the NUMERIC_NA_TOKEN to the sample() method.
from realtabformer.data_utils import NUMERIC_NA_TOKEN
model = <REaLTabFormer Model>
model.fit(...)
data = model.sample(..., suppress_tokens=[model.vocab["decoder"]["token2id"][NUMERIC_NA_TOKEN]])
in paper
[
Missing values
No transformation is done for missing values present in the data.
We let the model learn the distribution of the missing values.
This strategy gives us the flexibility to let the model impute or generate missing values during the sampling process
]
but error occur by missing data
how do i have to?
The text was updated successfully, but these errors were encountered: