Deploying TGI on Salad

Container

Required - Container Gateway Setup

Exec Health Probe

Recommended - Health Probes

Community

Blog

Salad Cloud

Home

Container Engine

Inference Endpoints

Managed Services

Gateway Service

API Reference

Portal

Run TGI (Text Generation Interface) by Hugging Face

Explore our guides and examples to integrate Salad.

Documentation

Start using the powerful network of Salad GPUs in under 5 minutes

Quickstart

Start integrating directly with Salad's robust API

Using the API

List Container Groups

Create a Container Group

Get a Container Group

Delete a Container Group

Update a Container Group

Start a Container Group

Stop a Container Group

Retrieves a list of container group instances

List Container Group Instances

Retrieves the details of a single instance within a container group by instance ID

Get Container Group Instance by instance ID

Remove a node from a workload and reallocate the workload to a different node

Reallocate container group instance to another node

Stops a container, destroys it, creates a new one without requiring the image to be downloaded again on a different node

Recreate container on a node

Restarts a workload on a node without reallocating it

Restart container on a node

Gets a list of inference endpoints that match the paginated query parameters. If no paginated query\nparameters are provided, the full list is returned.

Get an Inference Endpoint

Create a new Job

Returns a job in an inference endpoint

Delete a job from an inference endpoint

List the GPU Classes

List Queues

Create a Queue

Get a Queue

Delete a Queue

Update a Queue

Returns a job in a queue

Delete a job from a queue

Get Quotas

List Recipe Deployments

Create a Recipe Deployment

Gets a Recipe Deployment by its unique name

Get a Recipe Deployment

Delete a Recipe Deployment

Update a Recipe Deployment

Start a Deployed Recipe

Restart a Deployed Recipe

Stop a Deployed Recipe

Retrieves a list of recipe deployment instances

List Recipe Deployment Instances

List Recipes

Get a Recipe

Get workload errors

Salad Inference Endpoints (SIE)

Container Groups

Get your first container up and running on Salad in minutes!

Quickstart - API

Salad Container Engine (SCE)

The Deployment Lifecycle

Managing Deployments

Using Environment Variables

Requesting and using a JWT from Salad

Container Registries

Dockerhub

Amazon Elastic Container Registry (ECR)

Azure Container Registry

Google Container Registry

Quay Container Registry

GitHub Container Registry

External Logging

Axiom

New Relic

Splunk

Datadog

HTTP

Health Probes

Startup Probes

Liveness Probes

Readiness Probes (Preview)

Health Probe Examples

Specifying a command

Disk Space

Workload Logs

Container Logs

Billing & Pricing

Quotas

FAQs

Networking / Container Gateway

Enabling IPv6

Authenticated Requests

WebSockets

Error Pages

Job Queues

Creating a Job Queue

Job Queue Worker

Using Queues

Using Salad Recipes

How to deploy and use Stable Diffusion XL Recipe

Run Ollama

Run a Python App

How to deploy a Node App on Salad

Create Your first "hello world"

Create a container

Save to docker hub

Deploy via portal

Deploy via API

Run JupyterLab

Transcribe YouTube Videos

Deploy SLIP with Cog HTTP Prediction Server

Run Cog Applications on SaladCloud

In this step-by-step guide, we share how to deploy YOLOv8 on Salad's distributed cloud infrastructure for real-time object detection.

Deployment Guide

Batch Processing

In this blog, we showcase training three distinct custom YOLOv8 models on SaladCloud within an hour for just $1.

Training a custom YOLOv8 model

Long-Running Jobs

Salad's Managed Transcription API is available for Alpha customers. We don't have an SLA now, but we expect the service to reliably deliver high-quality transcripts/captions/subtitles at high volumes. We are quickly adding new capabilities and enhancing the scalability of our transcription service. Please use this [booking calendar](https://meetings.hubspot.com/derick-thompson1/salad-managed-ai-services) to schedule a time with our team to discuss your use case. If you want to get started quickly, [Fork this PostMan.com collection.](https://www.postman.com/salad-apis/workspace/salad-transcription-api/collection/34559698-f682b990-8c66-4f8f-a8ce-f2cafe8caaac/fork?origin=request-send)

Transcription API

Learn to create a pet avatar using the Salad Dreambooth API and Comfy UI.

Dreambooth API Tutorial - Pet Avatar

Learn to create an AI Avatar using the Salad Dreambooth API and Comfy UI.

(Video) Dreambooth API Tutorial - AI Avatar

Introduction

Connect to SGS using HTTP CONNECT over TLS and begin using it in production

Connect to SGS using HAProxy's PROXY v2 protocol

Using PROXY v2 Protocol

How to request removal of a Salad Node from your SGS server

Node Removals

SGS is designed for connecting to only specific domains

Approved Domains

Optional feature to increase success rate when connecting to specific domains

Smart Routing

Getting Started

Container Groups

Accessing Containers

Guides & FAQ

Run TGI (Text Generation Interface) by Hugging Face

Deploying TGI on Salad

Container

Required - Container Gateway Setup

Recommended - Health Probes

Exec Health Probe

Getting Started

Container Groups

Accessing Containers

Guides & FAQ

​Deploying TGI on Salad

​Container

​Required - Container Gateway Setup

​Recommended - Health Probes

​Exec Health Probe

Deploying TGI on Salad

Container

Required - Container Gateway Setup

Recommended - Health Probes

Exec Health Probe