Update metric #5

zer0n · 2018-07-11T22:52:16Z

No description provided.

Pulling Data

zer0n

Why don't I see any deletion?

zer0n · 2018-07-11T22:54:06Z

nab_evaluation.py

+            return window
+
+
+def nab_score(y_true, y_pred):


Let's not call it nab score. It's a fairly standard metric.

I haven't deleted the earlier evaluation (mAP) module. For the new metric, I have created an additional file. I'll change the name from nab_score to weighted_EarlyDetection_score.

added buffer to windows based on sparsity

zer0n

@satyanshukla Please update this

zer0n · 2018-07-20T03:17:23Z

nab_evaluation.py

+    @param window_scale_limit (float): The largest amount the windows to be expanded by. For example, for 
+                                       window_scale_limit=2, the new windows will be at most 
+                                       2 * (current window size). 
+    @param goal_sparsity (int): The goal sparsity of the window after increasing the window size. 


max_sparsity would be more precise

zer0n · 2018-07-20T03:30:08Z

nab_evaluation.py

+    """
+
+    for i, window in enumerate(windows):
+        if index <= window[1] and index >= window[0]:


window[0] <= index and index <= window[1] is easier to read.

zer0n · 2018-07-20T03:31:15Z

nab_evaluation.py

+        if index <= window[1] and index >= window[0]:
+            return window
+        elif index > window[1] and index < windows[i+1][0]:
+            return window


You return window in both cases? Don't you want to return None here?

What do you return at the end of the loop?

I had these two cases for clarity, have combined them into one.

There will never be the end of the loop, those cases have already been taken care before calling this function.

zer0n · 2018-07-20T03:34:40Z

nab_evaluation.py

+import numpy as np
+
+def scaledSigmoid(relativePositionInWindow):
+    if relativePositionInWindow > 3.0:


Will it ever happen? The condition for this function to be called is that the prediction point in properly within the window, right?

Yes, if the point is predicted right outside the window, the scaled sigmoid is used to calculate its score.

zer0n · 2018-07-20T03:35:27Z

nab_evaluation.py

+    sparsity = sum(y_true)/float(len(y_true))
+    label_windows = add_buffer_to_label(sparsity, label_windows, 0, len(y_true))
+
+    detection_info = {}


Use set instead

zer0n · 2018-07-20T03:36:46Z

nab_evaluation.py

+    tp_score = 0
+    fp_score = 0
+    fn_score = 0
+    for i in range(len(y_pred)):


Is i the index of the time series? If so, I would name it t to be clearer.

zer0n · 2018-07-20T03:38:16Z

nab_evaluation.py

+            if i < label_windows[0][0]:
+                fp_score += -1.0*fp_weight
+            elif i > label_windows[-1][1]:
+                position = abs(label_windows[-1][1]-i)/float(label_windows[-1][1]-label_windows[-1][0])


Based on the latest dicussion, should it be fp_weight score?

Yeah, you can not give the same score to fp_weight and fn_weight. For standard score, NAB suggests tp_weight = 1.0, fp_weight = 0.11 and fn_weight = 1.0. This is to compensate for the fact that it is okay to have a couple of false positives if the algorithm predicts some true positives.

zer0n · 2018-07-20T03:45:59Z

nab_evaluation.py

+
+            else:
+                cWindow = getCorrespondingWindow(i, label_windows)
+                if i <= cWindow[1] and i >= cWindow[0] and detection_info[cWindow] == 0:


If there there are several points falling in the same window, we should not over count them.

A cleaner implementation would be:

1. Go through the windows 2. For each window: - Find the first prediction index in that window (from window[0] to window[1]) - Change the subsequent 1's in the predictions, in the window range, to 0 - Compute the score; if no anomaly prediction found, return the false negative weight - Return the score 3. The remaining prediction points would be false positives. Add those to the total score.

For false positives also, we need to find the preceding window. If there are several points falling in the same window, we take care using the detection_info dictionary. If a window is detected once, detection_info for that window turns to 1 and subsequent points falling in that window will not contribute to the score.

It's unclear whether you we explaining that the existing code works or you're going to submit an update commit.

satyanshukla added 2 commits June 27, 2018 09:31

Merge pull request #1 from MSRDL/master

8604b82

Pulling Data

Evaluation module for computing NAB score

f5e2ad7

zer0n commented Jul 11, 2018

View reviewed changes

zer0n self-assigned this Jul 18, 2018

dtseng and others added 2 commits July 19, 2018 17:14

added buffer to windows

19e40e3

Merge pull request #2 from dtseng/master

c196b72

added buffer to windows based on sparsity

zer0n commented Jul 20, 2018

View reviewed changes

fixing small issues

caefe7b

Update metric #5

Are you sure you want to change the base?

Update metric #5

Uh oh!

Conversation

zer0n commented Jul 11, 2018

Uh oh!

zer0n left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zer0n left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants