⚠ This page documents a version of Oakestra which is not the latest stable. Please refer to the latest docs for a current version.

Scheduling

How does the scheduling work in Oakestra?

Oakestra’s architecture consists of a two-tier design where resources are organized into clusters. Each cluster represents an aggregation of all its resources. When a job is submitted, it is first scheduled to a cluster. The cluster’s scheduler then determines the target worker node to execute the job.

The scheduler component is implemented as a lightweight Go component. It receives a job description and returns an allocation target. Scheduling in Oakestra operates at two levels:

Root Scheduler: Determines a suitable cluster for the job.
Cluster Scheduler: Selects the appropriate worker node within the chosen cluster.

To abstract the scheduling targets (candidates), Oakestra employs a Resource Abstractor. This service transforms clusters and worker nodes into generic candidates with defined capabilities. This abstraction ensures compatibility between cluster and worker selection algorithms.

Scheduler Architecture

The scheduler operates in the following manner:

It receives requests in the form of service desriptors from the root or cluster orchestrator via an exposed API endpoint
The scheduling jobs are enqueued in a task queue
The scheduler queries the resource abstractor for a list of available candidates and their resources
An implemented scheduling algorithm is called to evaluate the candidates with respect to the service descriptor
The resulting candidate is communicated back to the orchestrator

Scheduling Algorithms

The Scheduler supports linking in different scheduling algorithms through the Go modules system. This allows the scheduler to optimise for different criteria or consider different resources.

Scheduling algorithms typically evaluate the available candidates in two passes:

Filtering Stage: All candidates are filtered with respect to the minimum service requirements and constraints
Evaluation Stage: The remaining candidates are sorted according to an optimisation criterium. The best candidate is returned

Interested Resources

Each scheduling algorithm provides information on which resource types it considers. The interested resources are passed to the resource abstractor when quering the available candidates. The resource abstractor will return the available candiates with all of the canonical resources, and the interested non-canonical resources.

The concept of canonical and non-canonical resources is new to Oakestra since Conga. You can read more under Resource Management

Contraints

Constrains are requirements a service may have, that are not resource demands. Currently only direct mapping constraints are implemented.

Direct mapping positioning

The direct mapping constraint allows developers to explicitly define a list of target clusters and nodes in the deployment description. The scheduling algorithm will then operate only on the active clusters or nodes specified in the list.

For example, the following constraint:

"constraints":[
            {
              "type":"direct",
              "node":"xavier1",
              "cluster":"cluster1"
            }
          ]

limits the deployment to the node xavier1 of the cluster cluster1. While the following constraint:

"constraints":[
            {
              "type":"direct",
              "cluster":"gpu"
            }
          ]

limits the deployment to all worker nodes within the cluster gpu.

Resource Management

Networking

Scheduling

How does the scheduling work in Oakestra?#

Scheduler Architecture#

Scheduling Algorithms#

Interested Resources#

Contraints#

Direct mapping positioning#

How does the scheduling work in Oakestra?

Scheduler Architecture

Scheduling Algorithms

Interested Resources

Contraints

Direct mapping positioning