Skip to content

Conversation

lazyman500
Copy link
Owner

use globStatus to retrieve all FileStatus objects under some path(s),and then do the filtering locally

use globStatus to retrieve all FileStatus objects under some path(s),and then do the filtering locally
lazyman500 added a commit that referenced this pull request Mar 16, 2015
SPARK-5068: fix bug when partition path doesn't exists
@lazyman500 lazyman500 merged commit 13b59cb into master Mar 16, 2015
lazyman500 pushed a commit that referenced this pull request Apr 3, 2015
…ypes (WIP)

Make FPGrowth.run API take generic item types:
`def run[Item: ClassTag, Basket <: Iterable[Item]](data: RDD[Basket]): FPGrowthModel[Item]`
so that user can invoke it by run[String, Seq[String]], run[Int, Seq[Int]], run[Int, List[Int]], etc.

Scala part is done, while java part is still in progress

Author: Jacky Li <[email protected]>
Author: Jacky Li <[email protected]>
Author: Xiangrui Meng <[email protected]>

Closes apache#4340 from jackylk/SPARK-5520-WIP and squashes the following commits:

f5acf84 [Jacky Li] Merge pull request #2 from mengxr/SPARK-5520
63073d0 [Xiangrui Meng] update to make generic FPGrowth Java-friendly
737d8bb [Jacky Li] fix scalastyle
793f85c [Jacky Li] add Java test case
7783351 [Jacky Li] add generic support in FPGrowth

(cherry picked from commit e380d2d)
Signed-off-by: Xiangrui Meng <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant