Skip to content

Conversation

@holdenk
Copy link
Contributor

@holdenk holdenk commented Sep 23, 2015

Similar to SPARK-10630 it would be nice if Java users didn't have to parallelize there data explicitly (as Scala users already can skip). Issue came up in http://stackoverflow.com/questions/32613413/apache-spark-machine-learning-cant-get-estimator-example-to-work

@SparkQA
Copy link

SparkQA commented Sep 23, 2015

Test build #42893 has finished for PR 8879 at commit decf0c5.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@holdenk holdenk changed the title [SPARK-10720][SQL][JAVA][WIP] Add a java wrapper to create a dataframe from a local list of java beans [SPARK-10720][SQL][JAVA] Add a java wrapper to create a dataframe from a local list of java beans Sep 23, 2015
@SparkQA
Copy link

SparkQA commented Sep 23, 2015

Test build #42937 has finished for PR 8879 at commit 73f0537.

  • This patch fails MiMa tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@holdenk
Copy link
Contributor Author

holdenk commented Sep 24, 2015

I think thats spurious, jenkins retest this please.

@SparkQA
Copy link

SparkQA commented Sep 24, 2015

Test build #42943 has finished for PR 8879 at commit 73f0537.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can any of this be refactored to share implementation with the RDD version? there's some non-trivial duplication here.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Perhaps, I could take the code to work on Iterators of Rows and move that to a shared function they could both use (along with have it pass in bean info since in local mode we don't need to construct the class by name it already exists). I could also just prallelize it and call the RDD implementation, but I figured it would be better to use the LocalRelation as we did with the List of Rows.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Up to your judgment since you know the details better of course. If it's not worth the overhead to factor this out then leave it.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'll take another look at it with fresh eyes tomorrow.

…is the correct place for this (the other functions in SQLContext are all related to the singleton, but it seemed like the right place. I could put it in a Utils object or similar if this isn't the best place for it).
@holdenk
Copy link
Contributor Author

holdenk commented Sep 25, 2015

@srowen so I refactored the shared code, but for serialization reasons I put it in the companion object. Let me know if this looks good to you :)

@SparkQA
Copy link

SparkQA commented Sep 26, 2015

Test build #43037 has finished for PR 8879 at commit 8a3c124.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@srowen
Copy link
Member

srowen commented Sep 26, 2015

Yeah looks good to me. Let me hold it open a day or so to see anyone has comments

@holdenk
Copy link
Contributor Author

holdenk commented Sep 27, 2015

@srowen just pinging since its been a few days and no extra comments have come in :)

@srowen
Copy link
Member

srowen commented Sep 27, 2015

Fair enough yeah I'm back online merging things now. Time for it.

@srowen
Copy link
Member

srowen commented Sep 27, 2015

Merged to master

@asfgit asfgit closed this in 8ecba3e Sep 27, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants