Skip to content

Params from parent java estimators aren't copied to python mmlspark models #582

@kschelonka

Description

@kschelonka

The java params for mmlspark estimators like LightGBMClassifier, etc. aren't copied over to the python instances.

This is related to this Jira ticket: PySpark ML Models should contain Param values

A temporary fix was added so that the params can be accessed using getOrDefault method. This does make it possible to pull in the params from mmlspark models, like LightGBMClassifier:
Screen Shot 2019-06-07 at 9 20 27 AM

Spark developers are planning on incrementally updating the pyspark API to use the appropriate getter and setter methods, and having the pyspark models define the params within themselves (see SPARK-21812). For example, CountVectorizer was updated in this fashion.

Since it's very useful to be able to access model parameters, I propose updating mmlspark models in a similar fashion. Happy to contribute to this effort.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions