-
Notifications
You must be signed in to change notification settings - Fork 228
Update to latest scikit-learn release for deprecation and compatibility #53
Comments
I've been working on this and almost have a PR ready. It will be out this upcoming Monday. |
Thank you for the quick attention. Is anything required of me? I see I was CCed on the related issue, but it looks like that was just for info. |
An update making spark-sklearn compatible with sklearn version >= 0.18.1 has been merged. |
I'm just about to adopt this update. Can you mark a new release in github and and update the version in PyPi? I currently rely on pip for the installs in my environments. I was hoping not to have to change to git just for this package. |
@dsackin Did you end up doing the git install? I'm also running into version issues when installing through pip. |
No. I haven't updated yet. I was hoping they would push it into PyPi before I switched to git install. |
Got it. Just an fyi, ended up doing the git install, and it worked. |
Can you please let us know when a new release be marked and push to PyPi would happen. |
@gordontsai @dsackin I am quite new to git install...can you tell me how to perform git install while we wait this to be pushed pypi |
@thunterdb this is more about what it might take to support 0.20. We have a related issue about not setting things like best_params_ at #73, which seems like an easy fix but the simple fix doesn't run. This PR might also contain some of the necessary changes: #74 . This much I haven't looked into yet. |
I see, this is more than pointing to the right package. The 0.20 release is less than 2 months old, so let us focus on the 0.1x releases until there is a more general need for that. What are your thoughts? |
Yeah, certainly more concerned this second with a new release to fix some bugs, and maybe get random search in. If you have a sec to look at #73 you might know the quick answer; that might also be a quick fix relevant to 0.19 |
Using the current head 0.2.0 release of spark-sklearn and the current release of scikit-learn (0.18.1), I'm getting the following deprecation warning:
/.../python3.4/site-packages/sklearn/cross_validation.py:44: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20.
"This module will be removed in 0.20.", DeprecationWarning)
the library needs to be updated to use the new model_selection module and iterator interfaces.
In addition, due to changes in sklearn.model_selection.GridSearchCV, the attributes available on the fitted spark-sklearn.GridSearchCV are out of date.
sklearn.model_selection.GridSearchCV now has:
While spark-sklearn.GridSearchCV has:
The most critical difference is that sklearn added the more comprehensive cv_results_ which adds data that the formerly compatible grid_scores_ is lacking.
The text was updated successfully, but these errors were encountered: