atqy
diff --git a/‎introduction_to_amazon_algorithms/jumpstart_image_classification/Amazon_JumpStart_Image_Classification.ipynb‎
Lines changed: 85 additions & 9 deletions b/‎introduction_to_amazon_algorithms/jumpstart_image_classification/Amazon_JumpStart_Image_Classification.ipynb‎
Lines changed: 85 additions & 9 deletions
diff --git a/‎introduction_to_amazon_algorithms/jumpstart_image_classification/README.md‎
Lines changed: 1 addition & 1 deletion b/‎introduction_to_amazon_algorithms/jumpstart_image_classification/README.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎introduction_to_amazon_algorithms/jumpstart_object_detection/Amazon_JumpStart_Object_Detection.ipynb‎
Lines changed: 90 additions & 29 deletions b/‎introduction_to_amazon_algorithms/jumpstart_object_detection/Amazon_JumpStart_Object_Detection.ipynb‎
Lines changed: 90 additions & 29 deletions
@@ -41,8 +41,9 @@
     "4. [Fine-tune the pre-trained model on a custom dataset](#4.-Fine-tune-the-pre-trained-model-on-a-custome-dataset)\n",
     "    * [Retrieve JumpStart Training artifacts](#4.1.-Retrieve-JumpStart-Training-artifacts)\n",
     "    * [Set Training parameters](#4.2.-Set-Training-parameters)\n",
-    "    * [Start Training](#4.3.-Start-Training)\n",
-    "    * [Deploy & run Inference on the fine-tuned model](#4.4.-Deploy-&-run-Inference-on-the-fine-tuned-model)"
+    "    * [Train with Automatic Model Tuning (HPO)](#AMT)\n",
+    "    * [Start Training](#4.4.-Start-Training)\n",
+    "    * [Deploy & run Inference on the fine-tuned model](#4.5.-Deploy-&-run-Inference-on-the-fine-tuned-model)"
    ]
   },
   {
@@ -407,7 +408,7 @@
     "from sagemaker import image_uris, model_uris, script_uris, hyperparameters\n",
     "\n",
     "model_id, model_version = dropdown.value, \"*\"\n",
-    "training_instance_type = \"ml.g4dn.xlarge\"\n",
+    "training_instance_type = \"ml.p3.2xlarge\"\n",
     "\n",
     "# Retrieve the docker image\n",
     "train_image_uri = image_uris.retrieve(\n",
@@ -491,12 +492,66 @@
     "print(hyperparameters)"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "id": "19c3a820",
+   "metadata": {
+    "collapsed": false
+   },
+   "source": [
+    "### 4.3. Train with Automatic Model Tuning ([HPO](https://docs.aws.amazon.com/sagemaker/latest/dg/automatic-model-tuning.html)) <a id='AMT'></a>\n",
+    "***\n",
+    "Amazon SageMaker automatic model tuning, also known as hyperparameter tuning, finds the best version of a model by running many training jobs on your dataset using the algorithm and ranges of hyperparameters that you specify. It then chooses the hyperparameter values that result in a model that performs the best, as measured by a metric that you choose. We will use a [HyperparameterTuner](https://sagemaker.readthedocs.io/en/stable/api/training/tuner.html) object to interact with Amazon SageMaker hyperparameter tuning APIs.\n",
+    "***"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "1684d6c6",
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   },
+   "outputs": [],
+   "source": [
+    "from sagemaker.tuner import ContinuousParameter\n",
+    "\n",
+    "# Use AMT for tuning and selecting the best model\n",
+    "use_amt = True\n",
+    "\n",
+    "# Define objective metric per framework, based on which the best model will be selected.\n",
+    "metric_definitions_per_model = {\n",
+    "    \"tensorflow\": {\n",
+    "        \"metrics\": [{\"Name\": \"val_accuracy\", \"Regex\": \"val_accuracy: ([0-9\\\\.]+)\"}],\n",
+    "        \"type\": \"Maximize\",\n",
+    "    },\n",
+    "    \"pytorch\": {\n",
+    "        \"metrics\": [{\"Name\": \"val_accuracy\", \"Regex\": \"val Acc: ([0-9\\\\.]+)\"}],\n",
+    "        \"type\": \"Maximize\",\n",
+    "    },\n",
+    "}\n",
+    "\n",
+    "# You can select from the hyperparameters supported by the model, and configure ranges of values to be searched for training the optimal model.(https://docs.aws.amazon.com/sagemaker/latest/dg/automatic-model-tuning-define-ranges.html)\n",
+    "hyperparameter_ranges = {\n",
+    "    \"adam-learning-rate\": ContinuousParameter(0.0001, 0.1, scaling_type=\"Logarithmic\")\n",
+    "}\n",
+    "\n",
+    "# Increase the total number of training jobs run by AMT, for increased accuracy (and training time).\n",
+    "max_jobs = 6\n",
+    "# Change parallel training jobs run by AMT to reduce total training time, constrained by your account limits.\n",
+    "# if max_jobs=max_parallel_jobs then Bayesian search turns to Random.\n",
+    "max_parallel_jobs = 2"
+   ]
+  },
   {
    "cell_type": "markdown",
    "id": "336871f4",
    "metadata": {},
    "source": [
-    "### 4.3. Start Training\n",
+    "### 4.4. Start Training\n",
     "***\n",
     "We start by creating the estimator object with all the required assets and then launch the training job.\n",
     "***"
@@ -511,6 +566,7 @@
    "source": [
     "from sagemaker.estimator import Estimator\n",
     "from sagemaker.utils import name_from_base\n",
+    "from sagemaker.tuner import HyperparameterTuner\n",
     "\n",
     "training_job_name = name_from_base(f\"jumpstart-example-{model_id}-transfer-learning\")\n",
     "\n",
@@ -526,18 +582,38 @@
     "    max_run=360000,\n",
     "    hyperparameters=hyperparameters,\n",
     "    output_path=s3_output_location,\n",
+    "    base_job_name=training_job_name,\n",
     ")\n",
     "\n",
-    "# Launch a SageMaker Training job by passing s3 path of the training data\n",
-    "ic_estimator.fit({\"training\": training_dataset_s3_path}, logs=True)"
+    "if use_amt:\n",
+    "    metric_definitions = next(\n",
+    "        value for key, value in metric_definitions_per_model.items() if model_id.startswith(key)\n",
+    "    )\n",
+    "\n",
+    "    hp_tuner = HyperparameterTuner(\n",
+    "        ic_estimator,\n",
+    "        metric_definitions[\"metrics\"][0][\"Name\"],\n",
+    "        hyperparameter_ranges,\n",
+    "        metric_definitions[\"metrics\"],\n",
+    "        max_jobs=max_jobs,\n",
+    "        max_parallel_jobs=max_parallel_jobs,\n",
+    "        objective_type=metric_definitions[\"type\"],\n",
+    "        base_tuning_job_name=training_job_name,\n",
+    "    )\n",
+    "\n",
+    "    # Launch a SageMaker Tuning job to search for the best hyperparameters\n",
+    "    hp_tuner.fit({\"training\": training_dataset_s3_path})\n",
+    "else:\n",
+    "    # Launch a SageMaker Training job by passing s3 path of the training data\n",
+    "    ic_estimator.fit({\"training\": training_dataset_s3_path}, logs=True)"
    ]
   },
   {
    "cell_type": "markdown",
    "id": "cc67f26b",
    "metadata": {},
    "source": [
-    "## 4.4. Deploy & run Inference on the fine-tuned model\n",
+    "## 4.5. Deploy & run Inference on the fine-tuned model\n",
     "***\n",
     "A trained model does nothing on its own. We now want to use the model to perform inference. For this example, that means predicting the class label of an image. We follow the same steps as in [3. Run inference on the pre-trained model](#3.-Run-inference-on-the-pre-trained-model). We start by retrieving the jumpstart artifacts for deploying an endpoint. However, instead of base_predictor, we  deploy the `ic_estimator` that we fine-tuned.\n",
     "***"
@@ -569,7 +645,7 @@
     "endpoint_name = name_from_base(f\"jumpstart-example-FT-{model_id}-\")\n",
     "\n",
     "# Use the estimator from the previous step to deploy to a SageMaker endpoint\n",
-    "finetuned_predictor = ic_estimator.deploy(\n",
+    "finetuned_predictor = (hp_tuner if use_amt else ic_estimator).deploy(\n",
     "    initial_instance_count=1,\n",
     "    instance_type=inference_instance_type,\n",
     "    entry_point=\"inference.py\",\n",
@@ -695,4 +771,4 @@
  },
  "nbformat": 4,
  "nbformat_minor": 5
-}
+}
@@ -1,2 +1,2 @@
 ### SageMaker JumpStart Image classification Training & Deployment
-This notebook `Amazon_JumpStart_Image_Classification.ipynb` demos how to fine-tune and deploy a pre-trained image classification model using JumpStart API. It shows how to select a pre-trained image classification model from JumpStart and fine-tune it on an example dataset containing raw .jpg/.png images, while varying training hyperparameters such as learning rate, batch-size and number of epochs. Once the training is complete, the notebook shows how to host the trained model for inference. It also shows how to host the pre-trained model as-it-is without first fine-tuning it.
+This notebook `Amazon_JumpStart_Image_Classification.ipynb` demos how to fine-tune and deploy a pre-trained image classification model using JumpStart API. It shows how to select a pre-trained image classification model from JumpStart and fine-tune it on an example dataset containing raw .jpg/.png images, while varying training hyperparameters such as learning rate, batch-size and number of epochs. AMT (Automatic Model Tuning) is used to search for the best hyperparameters. Once the training is complete, the notebook shows how to host the trained model for inference. It also shows how to host the pre-trained model as-it-is without first fine-tuning it.
@@ -44,8 +44,9 @@
     "3. [Fine-tune the pre-trained model on a custom dataset](#3.-Fine-tune-the-pre-trained-model-on-a-custom-dataset)\n",
     "    * [Retrieve Training Artifacts](#3.1.-Retrieve-Training-Artifacts)\n",
     "    * [Set Training parameters](#3.2.-Set-Training-parameters)\n",
-    "    * [Start Training](#3.3.-Start-Training)\n",
-    "    * [Deploy and run inference on the fine-tuned model](#3.4.-Deploy-and-run-inference-on-the-fine-tuned-model)\n"
+    "    * [Train with Automatic Model Tuning (HPO)](#AMT)\n",
+    "    * [Start Training](#3.4.-Start-Training)\n",
+    "    * [Deploy and run inference on the fine-tuned model](#3.5.-Deploy-and-run-inference-on-the-fine-tuned-model)\n"
    ]
   },
   {
@@ -506,11 +507,11 @@
     "# Currently, not all the object detection models in jumpstart support finetuning. Thus, we manually select a model\n",
     "# which supports finetuning.\n",
     "train_model_id, train_model_version, train_scope = (\n",
-    "    \"mxnet-od-ssd-512-vgg16-atrous-coco\",\n",
+    "    \"mxnet-od-ssd-512-vgg16-atrous-coco\",  # \"pytorch-od1-fasterrcnn-resnet50-fpn\"\n",
     "    \"*\",\n",
     "    \"training\",\n",
     ")\n",
-    "training_instance_type = \"ml.p2.xlarge\"\n",
+    "training_instance_type = \"ml.p3.2xlarge\"\n",
     "\n",
     "# Retrieve the docker image\n",
     "train_image_uri = image_uris.retrieve(\n",
@@ -598,12 +599,66 @@
     "print(hyperparameters)"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "id": "8108dfae",
+   "metadata": {
+    "collapsed": false
+   },
+   "source": [
+    "### 3.3. Train with Automatic Model Tuning ([HPO](https://docs.aws.amazon.com/sagemaker/latest/dg/automatic-model-tuning.html)) <a id='AMT'></a>\n",
+    "***\n",
+    "Amazon SageMaker automatic model tuning, also known as hyperparameter tuning, finds the best version of a model by running many training jobs on your dataset using the algorithm and ranges of hyperparameters that you specify. It then chooses the hyperparameter values that result in a model that performs the best, as measured by a metric that you choose. We will use a [HyperparameterTuner](https://sagemaker.readthedocs.io/en/stable/api/training/tuner.html) object to interact with Amazon SageMaker hyperparameter tuning APIs.\n",
+    "***"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "be0a1097",
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   },
+   "outputs": [],
+   "source": [
+    "from sagemaker.tuner import ContinuousParameter\n",
+    "\n",
+    "# Use AMT for tuning and selecting the best model\n",
+    "use_amt = True\n",
+    "\n",
+    "# Define objective metric per framework, based on which the best model will be selected.\n",
+    "metric_definitions_per_model = {\n",
+    "    \"mxnet\": {\n",
+    "        \"metrics\": [{\"Name\": \"val_cross_entropy\", \"Regex\": \"Val_CrossEntropy=([0-9\\\\.]+)\"}],\n",
+    "        \"type\": \"Minimize\",\n",
+    "    },\n",
+    "    \"pytorch\": {\n",
+    "        \"metrics\": [{\"Name\": \"val_loss\", \"Regex\": \"val_loss: ([0-9\\\\.]+)\"}],\n",
+    "        \"type\": \"Minimize\",\n",
+    "    },\n",
+    "}\n",
+    "\n",
+    "# You can select from the hyperparameters supported by the model, and configure ranges of values to be searched for training the optimal model.(https://docs.aws.amazon.com/sagemaker/latest/dg/automatic-model-tuning-define-ranges.html)\n",
+    "hyperparameter_ranges = {\n",
+    "    \"adam-learning-rate\": ContinuousParameter(0.0001, 0.1, scaling_type=\"Logarithmic\")\n",
+    "}\n",
+    "\n",
+    "# Increase the total number of training jobs run by AMT, for increased accuracy (and training time).\n",
+    "max_jobs = 6\n",
+    "# Change parallel training jobs run by AMT to reduce total training time, constrained by your account limits.\n",
+    "# if max_jobs=max_parallel_jobs then Bayesian search turns to Random.\n",
+    "max_parallel_jobs = 2"
+   ]
+  },
   {
    "cell_type": "markdown",
    "id": "70a010d7",
    "metadata": {},
    "source": [
-    "### 3.3. Start Training"
+    "### 3.4. Start Training"
    ]
   },
   {
@@ -626,6 +681,7 @@
    "source": [
     "from sagemaker.estimator import Estimator\n",
     "from sagemaker.utils import name_from_base\n",
+    "from sagemaker.tuner import HyperparameterTuner\n",
     "\n",
     "training_job_name = name_from_base(f\"jumpstart-example-{train_model_id}-transfer-learning\")\n",
     "\n",
@@ -641,26 +697,40 @@
     "    max_run=360000,\n",
     "    hyperparameters=hyperparameters,\n",
     "    output_path=s3_output_location,\n",
-    ")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "540de4ae",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "# Launch a SageMaker Training job by passing s3 path of the training data\n",
-    "od_estimator.fit({\"training\": training_dataset_s3_path}, logs=True, job_name=training_job_name)"
+    "    base_job_name=training_job_name,\n",
+    ")\n",
+    "\n",
+    "if use_amt:\n",
+    "    metric_definitions = next(\n",
+    "        value\n",
+    "        for key, value in metric_definitions_per_model.items()\n",
+    "        if train_model_id.startswith(key)\n",
+    "    )\n",
+    "\n",
+    "    hp_tuner = HyperparameterTuner(\n",
+    "        od_estimator,\n",
+    "        metric_definitions[\"metrics\"][0][\"Name\"],\n",
+    "        hyperparameter_ranges,\n",
+    "        metric_definitions[\"metrics\"],\n",
+    "        max_jobs=max_jobs,\n",
+    "        max_parallel_jobs=max_parallel_jobs,\n",
+    "        objective_type=metric_definitions[\"type\"],\n",
+    "        base_tuning_job_name=training_job_name,\n",
+    "    )\n",
+    "\n",
+    "    # Launch a SageMaker Tuning job to search for the best hyperparameters\n",
+    "    hp_tuner.fit({\"training\": training_dataset_s3_path})\n",
+    "else:\n",
+    "    # Launch a SageMaker Training job by passing s3 path of the training data\n",
+    "    od_estimator.fit({\"training\": training_dataset_s3_path}, logs=True)"
    ]
   },
   {
    "cell_type": "markdown",
    "id": "99a147f2",
    "metadata": {},
    "source": [
-    "### 3.4. Deploy and run inference on the fine-tuned model\n",
+    "### 3.5. Deploy and run inference on the fine-tuned model\n",
     "\n",
     "---\n",
     "\n",
@@ -695,7 +765,7 @@
     "endpoint_name = name_from_base(f\"jumpstart-example-FT-{train_model_id}-\")\n",
     "\n",
     "# Use the estimator from the previous step to deploy to a SageMaker endpoint\n",
-    "finetuned_predictor = od_estimator.deploy(\n",
+    "finetuned_predictor = (hp_tuner if use_amt else od_estimator).deploy(\n",
     "    initial_instance_count=1,\n",
     "    instance_type=inference_instance_type,\n",
     "    entry_point=\"inference.py\",  # entry point file in source_dir and present in deploy_source_uri\n",
@@ -800,17 +870,8 @@
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
    "version": "3.6.13"
-  },
-  "pycharm": {
-   "stem_cell": {
-    "cell_type": "raw",
-    "metadata": {
-     "collapsed": false
-    },
-    "source": []
-   }
   }
  },
  "nbformat": 4,
  "nbformat_minor": 5
-}
+}
Original file line number	Diff line number	Diff line change
`@@ -1,2 +1,2 @@`
`1`	`1`	`### SageMaker JumpStart Image classification Training & Deployment`
`2`		-This notebook `Amazon_JumpStart_Image_Classification.ipynb` demos how to fine-tune and deploy a pre-trained image classification model using JumpStart API. It shows how to select a pre-trained image classification model from JumpStart and fine-tune it on an example dataset containing raw .jpg/.png images, while varying training hyperparameters such as learning rate, batch-size and number of epochs. Once the training is complete, the notebook shows how to host the trained model for inference. It also shows how to host the pre-trained model as-it-is without first fine-tuning it.
	`2`	+This notebook `Amazon_JumpStart_Image_Classification.ipynb` demos how to fine-tune and deploy a pre-trained image classification model using JumpStart API. It shows how to select a pre-trained image classification model from JumpStart and fine-tune it on an example dataset containing raw .jpg/.png images, while varying training hyperparameters such as learning rate, batch-size and number of epochs. AMT (Automatic Model Tuning) is used to search for the best hyperparameters. Once the training is complete, the notebook shows how to host the trained model for inference. It also shows how to host the pre-trained model as-it-is without first fine-tuning it.