-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-5769] Set params in constructors and in setParams in Python ML pipelines #4564
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #27347 has started for PR 4564 at commit
|
|
Test build #27347 has finished for PR 4564 at commit
|
|
Test FAILed. |
|
Test build #27348 has started for PR 4564 at commit
|
|
Test build #27348 has finished for PR 4564 at commit
|
|
Test PASSed. |
|
Test build #27424 has started for PR 4564 at commit
|
|
Test build #27424 has finished for PR 4564 at commit
|
|
Test FAILed. |
|
@davies I updated the constructor doc signature. Auto-completion works in both iPython and IntelliJ now. The only issue is that the |
|
Test build #27425 has started for PR 4564 at commit
|
python/pyspark/ml/classification.py
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wait for #4557, then you could user new API for convert RDD into DataFrame:
dataset = rdd.toDF()
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
#4557 is closed, so you change update the tests (examples) to use new API.
|
Test build #27425 has finished for PR 4564 at commit
|
|
Test PASSed. |
|
Test build #27530 has started for PR 4564 at commit
|
|
Test build #27530 has finished for PR 4564 at commit
|
|
Test PASSed. |
|
LGTM, thanks! |
… pipelines This PR allow Python users to set params in constructors and in setParams, where we use decorator `keyword_only` to force keyword arguments. The trade-off is discussed in the design doc of SPARK-4586. Generated doc:  CC: davies rxin Author: Xiangrui Meng <[email protected]> Closes #4564 from mengxr/py-pipeline-kw and squashes the following commits: fedf720 [Xiangrui Meng] use toDF d565f2c [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into py-pipeline-kw cbc15d3 [Xiangrui Meng] fix style 5032097 [Xiangrui Meng] update pipeline signature 950774e [Xiangrui Meng] simplify keyword_only and update constructor/setParams signatures fdde5fc [Xiangrui Meng] fix style c9384b8 [Xiangrui Meng] fix sphinx doc 8e59180 [Xiangrui Meng] add setParams and make constructors take params, where we force keyword args (cherry picked from commit cd4a153) Signed-off-by: Xiangrui Meng <[email protected]>
|
Merged into master and branch-1.3. |
This PR allow Python users to set params in constructors and in setParams, where we use decorator
keyword_onlyto force keyword arguments. The trade-off is discussed in the design doc of SPARK-4586.Generated doc:

CC: @davies @rxin