Skip to content

Canard

Serverless Isaac Sim RL training. Define your sweep, we handle the GPUs.

Canard is a managed platform for robotics reinforcement learning. Submit training runs and hyperparameter sweeps from Python — Canard distributes them across cloud GPUs, handles fault tolerance, and streams results to a live dashboard.


Install

pip install canard-sdk

30-Second Example

from canard import Client

client = Client(api_url="https://api.canard.cloud")

run = client.submit_training_run(
    name="go2-standing",
    task_name="Template-Go2-Standing-Direct-v0",
    code_url="https://github.com/canard-cloud/go2-standing-env",
    num_envs=4096,
    max_iterations=5000,
    gpu_count=2,
)

run.wait_for_completion(show_progress=True)
run.download_best_checkpoint("./checkpoints")

What You Get

Feature How
Training runs submit_training_run() — single run on cloud GPUs
Hyperparameter sweeps submit_training_sweep() — grid search across learning rates, entropy, etc.
Live monitoring Dashboard at app.canard.cloud with reward curves
Fault tolerance Spot preemption? Task re-queues automatically
Cost estimates SDK prints estimated cost before you confirm
Artifact download Checkpoints, videos, TensorBoard logs — all in S3
W&B integration Weights & Biases logging built in

Training Tiers

Tier GPUs Hourly Rate Best For
Standard 1x ~$0.50/hr Prototyping
Fast 2x ~$1.00/hr Research (default)
Turbo 4x ~$2.00/hr Sweeps, deadlines

Next Steps

  • :material-download: Installation — Install the SDK and set up credentials
  • :material-rocket-launch: Quickstart — Submit your first training run
  • :material-tune: Parameter Sweeps — Run hyperparameter searches
  • :material-book-open-variant: API Reference — Full Client and RunHandle docs