Shortcuts

TorchElasticEnvironment

class lightning.pytorch.plugins.environments.TorchElasticEnvironment[소스]

기반 클래스: lightning.fabric.plugins.environments.cluster_environment.ClusterEnvironment

Environment for fault-tolerant and elastic training with torchelastic

static detect()[소스]

Returns True if the current process was launched using the torchelastic command.

반환 형식

bool

global_rank()[소스]

The rank (index) of the currently running process across all nodes and devices.

반환 형식

int

local_rank()[소스]

The rank (index) of the currently running process inside of the current node.

반환 형식

int

node_rank()[소스]

The rank (index) of the node on which the current process runs.

반환 형식

int

world_size()[소스]

The number of processes across all devices and nodes.

반환 형식

int

property creates_processes_externally: bool

Whether the environment creates the subprocesses or not.

반환 형식

bool

property main_address: str

The main address through which all processes connect and communicate.

반환 형식

str

property main_port: int

An open and configured port in the main node through which all processes communicate.

반환 형식

int