TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
TorchX currently supports:
- Kubernetes (EKS, GKE, AKS, etc)
- Slurm
- AWS Batch
- Docker
- Local
Need a scheduler not listed? Let us know!
See the quickstart guide.
torchx:
Certain schedulers may require scheduler specific requirements. See installation for info.
# install torchx sdk and CLI -- minimum dependencies
pip install torchx
# install with all dependencies
pip install "torchx[dev]"
# install torchx Kubernetes / Volcano support
pip install "torchx[kubernetes]"
# install torchx GCP Batch support
pip install "torchx[gcp_batch]"# install torchx sdk and CLI
pip install torchx-nightly[dev]# clone the repo
git clone https://github.com/meta-pytorch/torchx.git
cd torchx
# install with uv (recommended for development)
uv sync --extra dev
# or with pip
pip install -e ".[dev]"TorchX provides a docker container for using as as part of a TorchX role.
See: https://github.com/meta-pytorch/torchx/pkgs/container/torchx
We welcome PRs! See the CONTRIBUTING file.
TorchX is BSD licensed, as found in the LICENSE file.