Skip to content

Releases: aws/sagemaker-pytorch-training-toolkit

v2.5.0

08 Jul 23:25
Compare
Choose a tag to compare

Features

  • Add Heterogeneous Cluster support

v2.4.1

08 Jul 22:37
Compare
Choose a tag to compare

Bug Fixes and Other Changes

  • CI

v2.4.0

11 Dec 03:59
Compare
Choose a tag to compare

Features

  • add data parallelism support (#11) (#12)

Bug Fixes and Other Changes

  • use ubuntu 18.04 base image in dlc gpu image
  • remove TODOs in 1.6.0 dlc gpu dockerfile and reduce parameters for data parallel integ test
  • use base cuda 11 image for test dlc gpu image
  • use 1.6.0 for gpu tests and disable horovod tests
  • remove local data parallel integ test
  • use sagemaker-training 3.7.0 and enable data parallel integ tests
  • patch socket call and update flake8 violations

v2.3.0

31 Aug 15:22
Compare
Choose a tag to compare

Features

  • Use MPIRunnerType

Bug Fixes and Other Changes

  • Update main buildspec to only perform CPU integration tests
  • Add GPU and unit test buildspecs
  • Pin SageMaker version to less than v2

Documentation Changes

  • improve training.py doc style

v2.2.1.post2

25 Jun 15:20
Compare
Choose a tag to compare

Testing and Release Infrastructure

  • add issue templates

v2.2.1.post1

16 Jun 20:34
Compare
Choose a tag to compare

Documentation Changes

  • remove confusing information from the Readme.

Testing and Release Infrastructure

  • do not duplicate test dependencies in tox.ini
  • Rename buildspec files.

v2.2.1.post0

05 Jun 00:35
Compare
Choose a tag to compare

Testing and Release Infrastructure

  • Make docker folder read only, remove unused tests, rename test-toolkit/ -> test/.

v2.2.1

12 May 15:21
Compare
Choose a tag to compare

Bug Fixes and Other Changes

  • Bump version of sagemaker-training for typing fix

v2.2.0

07 May 15:21
Compare
Choose a tag to compare

Features

  • add Python 3.7 support

v2.1.1

05 May 15:20
Compare
Choose a tag to compare

Bug Fixes and Other Changes

  • Pin Smdebug to the latest version (0.7.2)