Skip to content

Conversation

@tohtana
Copy link
Collaborator

@tohtana tohtana commented Feb 1, 2026

The current code has the following issues:

  • use_default_specs: false doesn't work
  • Injection by the traditional pattern runs even when custom patterns are set
  • mpu needs to be passed to deepspeed.initialize (HF integration doesn't pass mpu)

This PR fixes AutoTP setup to respect use_default_specs: false and disable the traditional injection path when custom patterns are enabled. Also, when mpu is not passed, we create a TP group in the initialization process.

With these changes, the related tests pass and all AutoTP examples in DeepSpeedExamples work now (PR).

Signed-off-by: Masahiro Tanaka <mtanaka@anyscale.com>
Signed-off-by: Masahiro Tanaka <mtanaka@anyscale.com>
Signed-off-by: Masahiro Tanaka <mtanaka@anyscale.com>
Signed-off-by: Masahiro Tanaka <mtanaka@anyscale.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants