-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-5922][GraphX]: Add diff(other: RDD[VertexId, VD]) in VertexRDD #4733
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…'leftJoin' from VertexRDD[VD] to RDD[(VertexId, VD)]
|
/cc @ankurdave |
|
Can one of the admins verify this patch? |
|
ok to test |
|
Test build #27863 has started for PR 4733 at commit
|
|
Test build #27863 has finished for PR 4733 at commit
|
|
Test FAILed. |
|
@srowen is there any way to re-run this test without the MiMa tests?? The entire point of this PR is to change the public methods even though they wouldn't affect any calling implementations and I'm, honestly, unsure if that means I need to represent that in any way / shape / form in the code. Thoughts? |
|
It's a public API, right? I don't see that it or the class is PS to answer the question, you can't disable the MiMa checks but would instead add to your PR some new MiMa excludes to tell it this is OK, if you did have to override MiMa. |
|
Though this is source-compatible, MiMa is correct that it breaks binary compatibility. It might be best to add the more general version as an overload without deprecating the more specific version. The code duplication will be small and I think it's worth it to preserve binary compatibility. |
|
Test build #27898 has started for PR 4733 at commit
|
|
Test build #27898 has finished for PR 4733 at commit
|
|
Test FAILed. |
|
retest this please |
|
Test build #27910 has started for PR 4733 at commit
|
|
Test build #27910 has finished for PR 4733 at commit
|
|
Test FAILed. |
|
Any idea why its failing now? Issue is: I would have assumed MiMa would be okay with this since its merely adding a new method, but I might be wrong. A series of Google searches and diving into MiMa isn't helping either. Should I just add the MiMa exclude? That feels wrong in this case. |
|
@brennonyork it has a point; you've added a new |
|
@srowen thanks for that, completely missed that! If I add the MiMa exclude in this case to resolve this issue then shouldn't we deprecate the |
|
Test build #27924 has started for PR 4733 at commit
|
|
Test build #27924 has finished for PR 4733 at commit
|
|
Test PASSed. |
|
The VertexRDD interface is not intended to be implemented by users. I just noticed this isn't explicitly documented, but since VertexRDD references ShippableVertexPartition, which is private[graphx], users should in any case get a compile error when they try to implement it. So LGTM aside from a minor comment: I don't think it makes sense to deprecate the old version, because users don't have the option of switching to the new version unless they add the noise for widening the argument type. |
|
Roger, that makes good sense! Thanks for the update @ankurdave. Since the patch isn't deprecating anything I'd say we're good to merge in then? Just let me know if anything else needs done! |
|
@brennonyork Looks like #4705 introduced a minor merge conflict, so it would be great if you could merge that in. It seems best to use the definition of diff(VertexRDD[VD]) from there, since I just noticed that this PR does some unnecessary work -- calling VertexRDD(other) will always rebuild the vertex partitions even if that's not required. Also, it would be good to remove the remaining comment about deprecation. Thanks! |
|
Test FAILed. |
|
jenkins, retest this please |
|
Test build #28596 has started for PR 4733 at commit
|
|
Test build #28596 has finished for PR 4733 at commit
|
|
Test PASSed. |
|
Test build #28600 has started for PR 4733 at commit
|
|
Test build #28600 has finished for PR 4733 at commit
|
|
Test PASSed. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit: keep this with other Spark imports
|
Test build #28610 has started for PR 4733 at commit
|
|
Test build #28610 has finished for PR 4733 at commit
|
|
Test PASSed. |
|
Test build #28616 has started for PR 4733 at commit
|
|
Test build #28616 has finished for PR 4733 at commit
|
|
Test FAILed. |
|
jenkins, retest this please |
|
Test build #28620 has started for PR 4733 at commit
|
|
Test build #28620 has finished for PR 4733 at commit
|
|
Test FAILed. |
|
Test build #28631 has started for PR 4733 at commit
|
|
Test build #28631 has finished for PR 4733 at commit
|
|
Test PASSed. |
|
LGTM, thanks @brennonyork! Merging into master. |
Changed method invocation of 'diff' to match that of 'innerJoin' and 'leftJoin' from VertexRDD[VD] to RDD[(VertexId, VD)]. This change maintains backwards compatibility and better unifies the VertexRDD methods to match each other.