Upgrade sklearnex #1378

Innixma · 2021-11-11T01:59:34Z

Issue #, if available:

Description of changes:

Upgrade sklearnex version
Added workaround to use sklearnex RF via setting oob_score=True
Refactor LinearRegression -> Add time_limit support, speedup inference by 10x, speedup training by 20x+
Enable sklearnex in LR -> 20x+ train speedup
Refactor OHE in LR -> 10x inf speedup
Refactor Preprocessing in LR -> 10x inf speedup
Fix boolean feature handling in LR

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

szha · 2021-11-11T05:47:29Z

Job PR-1378-1 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-1378/1/index.html

szha · 2021-11-11T05:53:25Z

Job PR-1378-2 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-1378/2/index.html

napetrov · 2021-11-17T17:33:26Z

tabular/src/autogluon/tabular/models/rf/rf_model.py

                # FIXME: DAAL OOB score is broken, returns biased predictions. Without this optimization, can't compute Efficient OOF.
-                from daal4py.sklearn.ensemble import RandomForestClassifier, RandomForestRegressor
+                from sklearnex.ensemble import RandomForestClassifier, RandomForestRegressor
                logger.log(15, '\tUsing daal4py RF backend...')


Suggested change

logger.log(15, '\tUsing daal4py RF backend...')

logger.log(15, '\tUsing sklernex RF backend...')

Good suggestion, I have updated this log for RF, KNN, and linear models.

agorshk · 2021-11-17T19:56:35Z

tabular/src/autogluon/tabular/models/rf/rf_model.py

@@ -35,9 +35,8 @@ def _get_model_type(self):
        if self.params_aux.get('use_daal', False):
            # Disabled by default because it appears to degrade performance
            try:
-                # TODO: Use sklearnex instead once a suitable toggle option is provided that won't impact future models
                # FIXME: DAAL OOB score is broken, returns biased predictions. Without this optimization, can't compute Efficient OOF.


OOB score was fixed in scikit-learn-intelex=2021.5 version

Thanks! This is very helpful. I added a new in-line comment to try the new version once it is released.

Hi @agorshk , I tested 2021.5 and it is not yet fixed (train-time oob_score=True works, but not post-fit OOB).

Refer to uxlfoundation/scikit-learn-intelex#933 for more info

Hi @agorshk , I tested 2021.5 and it is not yet fixed (train-time oob_score=True works, but not post-fit OOB).

Refer to intel/scikit-learn-intelex#933 for more info

Hi @Innixma, thanks for report and reproducer, we'll have a look on this problem.

szha · 2021-11-18T03:21:20Z

Job PR-1378-3 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-1378/3/index.html

szha · 2021-11-22T19:45:44Z

Job PR-1378-4 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-1378/4/index.html

szha · 2021-12-02T02:16:25Z

Job PR-1378-5 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-1378/5/index.html

szha · 2021-12-16T04:56:40Z

Job PR-1378-6 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-1378/6/index.html

szha · 2021-12-31T04:49:30Z

Job PR-1378-8 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-1378/8/index.html

…x, enable sklearnex

szha · 2022-03-04T00:58:52Z

Job PR-1378-14 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-1378/14/index.html

szha · 2022-03-04T02:48:12Z

Job PR-1378-15 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-1378/15/index.html

Innixma · 2022-03-04T22:49:54Z

Updates finalized

Note that RF does not yet use sklearnex by default due to a performance issue on KDDCup09-Upselling: uxlfoundation/scikit-learn-intelex#984

yinweisu

LGTM!

szha · 2022-03-04T23:50:04Z

Job PR-1378-16 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-1378/16/index.html

Innixma changed the title ~~Upgrade sklearnex, default RF use_daal=True~~ Upgrade sklearnex Nov 11, 2021

Innixma requested review from gradientsky and yinweisu November 11, 2021 02:08

napetrov reviewed Nov 17, 2021

View reviewed changes

agorshk reviewed Nov 17, 2021

View reviewed changes

Innixma mentioned this pull request Dec 31, 2021

[RF] Incorrect OOB calculation post-fit uxlfoundation/scikit-learn-intelex#933

Closed

Innixma force-pushed the rf_enable_daal branch from 631f1d0 to bd23ce6 Compare December 31, 2021 03:55

Innixma added this to the 0.4 Release milestone Jan 13, 2022

Innixma mentioned this pull request Feb 2, 2022

[v0.4] Cap dependency versions prior to release. #1341

Closed

16 tasks

Innixma added 7 commits March 3, 2022 12:30

Upgrade sklearnex, default RF use_daal=True

8b5908c

disable rf daal until fix to OOB score

7f537fe

addressed comments

bc3fcf5

update to 2021.5

c1786ac

Refactored LinearRegression, speedup inference by 10x, training by 30…

fd28b8e

…x, enable sklearnex

rebase fix

ce20140

enable daal RF by default

f481f3b

Innixma force-pushed the rf_enable_daal branch from afb91b7 to f481f3b Compare March 3, 2022 21:56

Innixma added 2 commits March 3, 2022 15:35

fix unit test

b9ab3e0

minor fix, updated good_quality to be much faster inference

f131eff

disable rf daal by default

72ddd0f

yinweisu approved these changes Mar 4, 2022

View reviewed changes

Innixma merged commit a7e7fa6 into master Mar 5, 2022

Innixma deleted the rf_enable_daal branch March 10, 2022 02:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade sklearnex #1378

Upgrade sklearnex #1378

Innixma commented Nov 11, 2021 •

edited

Loading

szha commented Nov 11, 2021

szha commented Nov 11, 2021

napetrov Nov 17, 2021

Innixma Nov 22, 2021

agorshk Nov 17, 2021

Innixma Nov 22, 2021

Innixma Dec 31, 2021

agorshk Jan 10, 2022

szha commented Nov 18, 2021

szha commented Nov 22, 2021

szha commented Dec 2, 2021

szha commented Dec 16, 2021

szha commented Dec 31, 2021

szha commented Mar 4, 2022

szha commented Mar 4, 2022

Innixma commented Mar 4, 2022

yinweisu left a comment

szha commented Mar 4, 2022

	logger.log(15, '\tUsing daal4py RF backend...')
	logger.log(15, '\tUsing sklernex RF backend...')

Upgrade sklearnex #1378

Upgrade sklearnex #1378

Conversation

Innixma commented Nov 11, 2021 • edited Loading

szha commented Nov 11, 2021

szha commented Nov 11, 2021

napetrov Nov 17, 2021

Choose a reason for hiding this comment

Innixma Nov 22, 2021

Choose a reason for hiding this comment

agorshk Nov 17, 2021

Choose a reason for hiding this comment

Innixma Nov 22, 2021

Choose a reason for hiding this comment

Innixma Dec 31, 2021

Choose a reason for hiding this comment

agorshk Jan 10, 2022

Choose a reason for hiding this comment

szha commented Nov 18, 2021

szha commented Nov 22, 2021

szha commented Dec 2, 2021

szha commented Dec 16, 2021

szha commented Dec 31, 2021

szha commented Mar 4, 2022

szha commented Mar 4, 2022

Innixma commented Mar 4, 2022

yinweisu left a comment

Choose a reason for hiding this comment

szha commented Mar 4, 2022

Innixma commented Nov 11, 2021 •

edited

Loading