API Reference

Packages

batch.tensorstack.dev/v1beta1

Package v1beta1 contains API Schema definitions for the batch v1beta1 API group

Resource Types

ReplicaSpec

ReplicaSpec describes the spec of a replica.

Appears in:

FieldDescription
type ReplicaTypeReplicaType is the type of the replica, one of “chief”, “worker”, “ps”, or “evaluator”.
replicas integerThe desired number of replicas created for the current replica type. If unspecified, defaults to 1.
template PodTemplateSpecDescribes the pod that will be created for this replica. Note that RestartPolicy in PodTemplateSpec will always be set to Never as the job controller will create new pods if restart is required.
restartPolicy RestartPolicyThe restart policy for this replica, one of Always, OnFailure, Never, or ExitCode.

ReplicaType

Underlying type: string

ReplicaType is the type of the replica, one of “chief”, “worker”, “ps”, or “evaluator”.

Appears in:

RestartPolicy

Underlying type: string

RestartPolicy describes how the replicas should be restarted. Can be one of: Always, OnFailure, Never, or ExitCode.

Appears in:

RunPolicy

RunPolicy encapsulates various runtime policies of the distributed training job, for example how to clean up resources and how long the job can stay active.

Appears in:

FieldDescription
activeDeadlineSeconds integerSpecifies the duration in seconds relative to the startTime that the job may be active before the system tries to terminate it; value must be positive integer.
backoffLimit integerOptional number of retries before marking this job failed.
cleanUpPolicy CleanUpPolicyClean the tasks after the training job finished.

TensorFlowTrainingJob

TensorFlowTrainingJob is the Schema for the TensorFlowTrainingJob API.

Appears in:

FieldDescription
apiVersion stringbatch.tensorstack.dev/v1beta1
kind stringTensorFlowTrainingJob
metadata ObjectMetaRefer to Kubernetes API documentation for fields of metadata.
spec TensorFlowTrainingJobSpec
status TensorFlowTrainingJobStatus

TensorFlowTrainingJobList

TensorFlowTrainingJobList contains a list of TensorFlowTrainingJob

FieldDescription
apiVersion stringbatch.tensorstack.dev/v1beta1
kind stringTensorFlowTrainingJobList
metadata ListMetaRefer to Kubernetes API documentation for fields of metadata.
items TensorFlowTrainingJob array

TensorFlowTrainingJobSpec

TensorFlowTrainingJobSpec outlines the intended configuration and execution parameters for a TensorFlowTrainingJob.

Appears in:

FieldDescription
replicaSpecs ReplicaSpec arrayDescribes the spec of the replicas of the job.
runMode RunModeJob’s execution behavior. If omitted, defaults to Immediate mode, and tasks are executed immediately upon submission.
tensorboardSpec TensorBoardSpecDescribes the Tensorboard to be created for showing training logs.
runPolicy RunPolicyExecution policy configurations governing the behavior of the TensorFlowTrainingJob.
scheduler SchedulePolicyIdentifies the preferred scheduler for allocating resources to replicas. Defaults to cluster default scheduler.

TensorFlowTrainingJobStatus

TensorFlowTrainingJobStatus defines the observed state of TensorFlowTrainingJob

Appears in:

FieldDescription
tasks Tasks arrayThe statuses of individual tasks.
tensorboard DependentStatusThe status of tensorboard.
backoffCount integerThe number of restarts being performed.
aggregate Aggregate
conditions JobCondition arrayRepresents the latest available observations of a TensorFlowTrainingJob’s current state.
phase JobPhasePhase is the phase-style status of the TensorFlowTrainingJob.