10BC0 feature: add inferentia pytorch inference container config by ashishgupta023 · Pull Request #1915 · aws/sagemaker-python-sdk · GitHub
[go: up one dir, main page]

Skip to content

Conversation

@ashishgupta023
Copy link
Contributor
@ashishgupta023 ashishgupta023 commented Sep 22, 2020

Issue #, if available:

Description of changes: add inferentia pytorch inference container config

Testing done:

Merge Checklist

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your pull request.

General

  • I have read the CONTRIBUTING doc
  • I used the commit message format described in CONTRIBUTING
  • I have passed the region in to all S3 and STS clients that I've initialized as part of this change.
  • I have updated any necessary documentation, including READMEs and API docs (if appropriate)

Tests

  • I have added tests that prove my fix is effective or that my feature works (if appropriate)
  • I have checked that my tests are not configured for a specific region or account (if appropriate)
  • I have used unique_name_from_base to create resource names in integ tests (if appropriate)

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

@ashishgupta023 ashishgupta023 changed the title add inferentia pytorch config add inferentia pytorch inference container config Sep 22, 2020
@sagemaker-bot
Copy link
Collaborator

AWS CodeBuild CI Report

  • CodeBuild project: sagemaker-python-sdk-notebook-tests
  • Commit ID: 17ac4b1
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@sagemaker-bot
< 8000 div class="ml-n3 timeline-comment unminimized-comment comment previewable-edit js-task-list-container js-comment timeline-comment--caret" data-body-version="10cb8abfdeacd90b9ea4ec3b79e1f3d890ebbbc64fd5bcb36afd80a3769f2944">
Copy link
Collaborator

AWS CodeBuild CI Report

  • CodeBuild project: sagemaker-python-sdk-local-mode-tests
  • Commit ID: 17ac4b1
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@sagemaker-bot
Copy link
Collaborator

AWS CodeBuild CI Report

  • CodeBuild project: sagemaker-python-sdk-unit-tests
  • Commit ID: 17ac4b1
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@sagemaker-bot
Copy link
Collaborator

AWS CodeBuild CI Report

  • CodeBuild project: sagemaker-python-sdk-unit-tests
  • Commit ID: e608acf
  • Result: FAILED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@sagemaker-bot
Copy link
Collaborator

AWS CodeBuild CI Report

  • CodeBuild project: sagemaker-python-sdk-unit-tests
  • Commit ID: 710d4e9
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@sagemaker-bot
Copy link
Collaborator

AWS CodeBuild CI Report

  • CodeBuild project: sagemaker-python-sdk-notebook-tests
  • Commit ID: e608acf
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@sagemaker-bot
Copy link
Collaborator

AWS CodeBuild CI Report

  • CodeBuild project: sagemaker-python-sdk-local-mode-tests
  • Commit ID: e608acf
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@sagemaker-bot
Copy link
Collaborator
8000 sagemaker-bot commented Sep 22, 2020

AWS CodeBuild CI Report

  • CodeBuild project: sagemaker-python-sdk-notebook-tests
  • Commit ID: 710d4e9
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@sagemaker-bot
Copy link
Collaborator

AWS CodeBuild CI Report

  • CodeBuild project: sagemaker-python-sdk-local-mode-tests
  • Commit ID: 710d4e9
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@sagemaker-bot
Copy link
Collaborator

AWS CodeBuild CI Report

  • CodeBuild project: sagemaker-python-sdk-pr
  • Commit ID: 710d4e9
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@chuyang-deng chuyang-deng changed the title add inferentia pytorch inference container config feature: add inferentia pytorch inference container config Sep 23, 2020
@chuyang-deng chuyang-deng merged commit 1900524 into aws:master Sep 23, 2020
guoqiao1992 pushed a commit to guoqiao1992/sagemaker-python-sdk that referenced this pull request Oct 26, 2020
* add inferentia pytorch config

* add test

* fix styling

Co-authored-by: Ashish Gupta <guas@amazon.com>
rsareddy0329 added a commit that referenced this pull request Dec 3, 2025
* sagemaker-core update, port sft-trainer and ai registry (#1895)

* sagemaker core update with MC service.json, SFT notebook flow succeeded before modelpackage

* remove workaround, modelpackage issue persists

* remove ai registry port

* feat: AIRegistry SDK Implementation (#1893)

* remove sft trainer port, fix ModelPackage issue (#1901)

* Add/port finetuning interfaces to v3 (#1906)

* Add/port finetuning interfaces to v3

* Add/port finetuning interfaces to v3

---------

Co-authored-by: Roja Reddy Sareddy <rsareddy@amazon.com>

* ported evaluation from v2->v3. pending example notebooks results (#1902)

* v2->v3 porting for eval

* bring back deleted needed files

* unit tests

---------

Co-authored-by: Mohamed Zeidan <zeidmo@amazon.com>

* Mc deployment (#1909)

* Updates for Model Customization

* Model Builder Base change

* Fix

* Support for Model Customization

* New Model Builder changes with Model Customization

* Fix

* Add base_trainer, addition to PR#1906 (#1910)

* Add/port finetuning interfaces to v3

* Add/port finetuning interfaces to v3

* Add base_trainer, addition to PR#1906

---------

Co-authored-by: Roja Reddy Sareddy <rsareddy@amazon.com>

* AIRegistry Integration Tests (#1908)

* feat: Trainer wait with MLFlow metrics (#1907)

* feat: update eval integs (#1912)

Co-authored-by: Mufaddal Rohawala <mufi@amazon.com>

* Update get_all method for Evaluator and Dataset (#1913)

* Update get_all method for Evaluator and Dataset

- Update Dataset.get_all() and added refresh()
- Update Evaluator.get_all() and added refresh()
- Added more tags in condition when importing hub content

* Update dataset, evaluators and examples

* remove hub_content_name

* port bedrock model builder and add integ test (#1911)

* port bedrock model builder and add integ test

* Add unit test and notebook, port new version of bedrock model builder

* add docstring

* Mc deployment integ tests and example notebook (#1914)

* Updates for Model Customization

* Model Builder Base change

* Fix

* Support for Model Customization

* New Model Builder changes with Model Customization

* Fix

* Deployment Integ tests

* Tests

* Deployment NOtebook

* fix

* Update model hub name and encoding method (#1918)

* Update get_all method for Evaluator and Dataset

- Update Dataset.get_all() and added refresh()
- Update Evaluator.get_all() and added refresh()
- Added more tags in condition when importing hub content

* Update dataset, evaluators and examples

* remove hub_content_name

* Update hub name and hash

- Update model hub name to be AIRegistry-{accountId}-{region}
- Update hub name hash

* Add telemetry for model customization (#1920)

* add telemetry to model customization

* add telemetry to model customization 2

* small fix

* INteg tests for nova bedrock  (#1926)

* Bedrock Nova

* Unit tests for bedrock nova

* accept eula always (#1928)

Co-authored-by: Mohamed Zeidan <zeidmo@amazon.com>

* Add studio tags to trainer (#1927)

* Update get_all method for Evaluator and Dataset

- Update Dataset.get_all() and added refresh()
- Update Evaluator.get_all() and added refresh()
- Added more tags in condition when importing hub content

* Update dataset, evaluators and examples

* remove hub_content_name

* Update hub name and hash

- Update model hub name to be AIRegistry-{accountId}-{region}
- Update hub name hash

* Add studio tags to trainer

Tested nu checking training job creation inputs and unit tests

---------

Co-authored-by: rsareddy0329 <rsareddy0329@gmail.com>

* Finetuning classes: add accept_eula support & Add integ tests for sft, rlvr,rlaif, dpo trainers (#1915)

* Add/port finetuning interfaces to v3

* Add/port finetuning interfaces to v3

* Add base_trainer, addition to PR#1906

* Add integ tests for sft, rlvr,rlaif, dpo trainers

* Add integ tests for sft, rlvr,rlaif, dpo trainers

* Finetuning classes: Add accept_eula support

* Finetuning classes: Add accept_eula support

---------

Co-authored-by: Roja Reddy Sareddy <rsareddy@amazon.com>

* feat: pipeline name fix + mlflow arn fix + jinja dep (#1931)

* changes for pipeline naming

* feat: pipeline name fix + mlflow arn fix + jinja dep

---------

Co-authored-by: Mufaddal Rohawala <mufi@amazon.com>

* AIRegistry: Add Support to user provided session and role (#1932)

* Add/port finetuning interfaces to v3

* Add/port finetuning interfaces to v3

* Add base_trainer, addition to PR#1906

* Add integ tests for sft, rlvr,rlaif, dpo trainers

* Add integ tests for sft, rlvr,rlaif, dpo trainers

* Finetuning classes: Add accept_eula support

* Finetuning classes: Add accept_eula support

* AIRegistry: Support user provided session and role

* AIRegistry: Support user provided session and role

* AIRegistry: Support user provided session and role

---------

Co-authored-by: Roja Reddy Sareddy <rsareddy@amazon.com>

* format name for ai reg, stripped name of =, added domain-id tags for … (#1930)

* format name for ai reg, stripped name of =, added domain-id tags for datasets and evaluators, prefix changed t use full path including filename

* pulling metadata path since get_current_domain_id didnt work

* rmvd gamma endpoint

---------

Co-authored-by: Mohamed Zeidan <zeidmo@amazon.com>

* fixed train unit tests (#1933)

Co-authored-by: Mohamed Zeidan <zeidmo@amazon.com>

* Updating DataSet versioning pattern for parity with UI (#1934)

* fix dataset arn configuration in eval (#1935)

Co-authored-by: Mufaddal Rohawala <mufi@amazon.com>

* Master model customization v3 (#1936)

* Updating DataSet versioning pattern for parity with UI

* Updating default S3 bucket and prefix for datasets

* fix: custom metrics and pipeline name in lineage (#1937)

* remove mlflow config for base model eval only

* fix unit

* fix custom metrics and pipeline name in lineage

---------

Testing:
1. Added/Ran units.
2. Tested LLMAJ eval.

Co-authored-by: Mufaddal Rohawala <mufi@amazon.com>

* Integ tests for ModelBuilder (#1939)

* Bedrock Nova

* Unit tests for bedrock nova

* Integ test updates

* Test case updates

* fixing AIRegistry integration and unit tests (#1941)

* Update error messages based on feedback (#1938)

* feat: Inline MLFlow metrics with Serverless MLFlow App (#1942)

* Master model customization v3 (#1945)

* Bedrock Nova

* Unit tests for bedrock nova

* Integ test updates

* Test case updates

* Bedrock tests

* fix: update eval integ tests (#1946)

* update eval integ tests

* fix unit tests

---------

Co-authored-by: Mufaddal Rohawala <mufi@amazon.com>

* Finetuning classes: Add integ tests (#1947)

* Add/port finetuning interfaces to v3

* Add/port finetuning interfaces to v3

* Add base_trainer, addition to PR#1906

* Add integ tests for sft, rlvr,rlaif, dpo trainers

* Add integ tests for sft, rlvr,rlaif, dpo trainers

* Finetuning classes: Add accept_eula support

* Finetuning classes: Add accept_eula support

* AIRegistry: Support user provided session and role

* AIRegistry: Support user provided session and role

* AIRegistry: Support user provided session and role

* Finetuning classes: Add integration tests

* Merge conflicts

* Merge conflicts

---------

Co-authored-by: Roja Reddy Sareddy <rsareddy@amazon.com>

* Workflow change to enable github actions (#1948)

* workflow change and test

* remove test

* Remove beta endpoint info, add region validation and tests  (#1950)

* Add/port finetuning interfaces to v3

* Add/port finetuning interfaces to v3

* Add base_trainer, addition to PR#1906

* Add integ tests for sft, rlvr,rlaif, dpo trainers

* Add integ tests for sft, rlvr,rlaif, dpo trainers

* Finetuning classes: Add accept_eula support

* Finetuning classes: Add accept_eula support

* AIRegistry: Support user provided session and role

* AIRegistry: Support user provided session and role

* AIRegistry: Support user provided session and role

* Finetuning classes: Add integration tests

* Merge conflicts

* Merge conflicts

* Remove beta endpoint details in sagemaker-core

* add region validation for models

---------

Co-authored-by: Roja Reddy Sareddy <rsareddy@amazon.com>

---------

Co-authored-by: Molly He <mollyhe@amazon.com>
Co-authored-by: jam-jee <jamjee@amazon.com>
Co-authored-by: rsareddy0329 <rsareddy0329@gmail.com>
Co-authored-by: Roja Reddy Sareddy <rsareddy@amazon.com>
Co-authored-by: Mohamed Zeidan <zeidmo@amazon.com>
Co-authored-by: Gokul Anantha Narayanan <166456257+nargokul@users.noreply.github.com>
Co-authored-by: Mufaddal Rohawala <89424143+mufaddal-rohawala@users.noreply.github.com>
Co-authored-by: Mufaddal Rohawala <mufi@amazon.com>
Co-authored-by: Zhaoqi <52220743+zhaoqizqwang@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants

0