Skip to content

Commit 55ab179

Browse files
committed
Updated to validate children in newClusterCenters based on comments
1 parent f77904d commit 55ab179

File tree

1 file changed

+3
-2
lines changed

1 file changed

+3
-2
lines changed

mllib/src/main/scala/org/apache/spark/mllib/clustering/BisectingKMeans.scala

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -339,8 +339,9 @@ private object BisectingKMeans extends Serializable {
339339
assignments.map { case (index, v) =>
340340
if (divisibleIndices.contains(index)) {
341341
val children = Seq(leftChildIndex(index), rightChildIndex(index))
342-
if (children.length > 0) {
343-
val selected = children.minBy { child =>
342+
val newClusterChildren = children.filter(newClusterCenters.contains(_))
343+
if (newClusterChildren.nonEmpty) {
344+
val selected = newClusterChildren.minBy { child =>
344345
KMeans.fastSquaredDistance(newClusterCenters(child), v)
345346
}
346347
(selected, v)

0 commit comments

Comments
 (0)