Parallelization of inference/generation in both tabular and child models. #63

efstathios-chatzikyriakidis · 2024-02-23T11:33:45Z

I want to generate houndred of thousands rows using both tabular and relational models. However, it is a bit slow because of the auto-regressive nature of transformers. Currently I am generating e.g. 100k rows in tabular model and then I let the child model to generate the child rows.

Is there something I can do to optimize generation time either for tabular or child models? We touched this a little in the past Is it possible to utilize Distributed Training and/or Parallel Sampling with the library? #15.
From my understanding multi-GPU cannot be used in the inference part, like training does. However, instead of running a parent_model.sample(n_samples=100_000) I can run in parallel multiple parent_model.sample() calls using a different cuda device for each? Let's say break it in 10x10_000 and run 10 rtf_model.sample() calls, each on different GPU card, or have at least a pool of GPU cards to utilize.
Can I use a single GPU card and run multiple parent_model.sample() calls in multiple threads in parallel? Probably this is going to fail as the GPU memory will explode, right?
Is there any special argument I need to use in parent_model.sample() to support that? Is it device argument to specify the cuda device for each batch generation?
How can I partition and batch the rows of a child model? Currently the relationship cardinality is something that is learned by the model, so I can't specify it. I can estimate it outside of the library, #orders a customer could have, #products an order could have. Maybe I can built such a model, However, is it possible to tell the child model how many child rows to create for each parent? If that is possible, I will be able to pass that number from my external estimation.

Thanks!

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelization of inference/generation in both tabular and child models. #63

Parallelization of inference/generation in both tabular and child models. #63

efstathios-chatzikyriakidis commented Feb 23, 2024 •

edited

Loading

Parallelization of inference/generation in both tabular and child models. #63

Parallelization of inference/generation in both tabular and child models. #63

Comments

efstathios-chatzikyriakidis commented Feb 23, 2024 • edited Loading

efstathios-chatzikyriakidis commented Feb 23, 2024 •

edited

Loading