Cargo Registry

Browse available cargos

A Cargo is a portable, signed unit of computation that runs on Islands across the Archipelag.io network.

Think of it as a shipping container for code — each one packages a model, runtime, and resource requirements into a single deployable artifact. Consumers submit jobs, the coordinator finds the best Island, and the Cargo executes in a secure sandbox.

Consumer

Cargo

Island

ONNXDockerWASMGGUF

Read the full architecture docs

GGUF

Mistral 7B InstructOfficial

Mistral 7B Instruct v0.2 chat inference using llama.cpp

Mistral 7B InstructQ4_K_M4.4 GBGPU optional

GGUF

Qwen3.5-0.8BOfficial

Ultra-lightweight language model for edge and mobile inference

Qwen3.5-0.8BQ4_K_M0.6 GBGPU optional

GGUF

Qwen3.5-4BOfficial

Compact language model with strong reasoning capabilities

Qwen3.5-4BQ4_K_M2.7 GBGPU optional

GGUF

Qwen3.5-9BOfficial

Mid-size language model for consumer GPU inference

Qwen3.5-9BQ4_K_M5.5 GBGPU optional

GGUF

Qwen3.5-27BOfficial

High-quality language model for demanding reasoning tasks

Qwen3.5-27BQ4_K_M16 GBGPU required

GGUF

Qwen3.5-35B-A3B (MoE)Official

Mixture-of-Experts LLM with 35B total / 3B active parameters

Qwen3.5-35B-A3BMoEQ4_K_M20 GBGPU optional

GGUF

Qwen3-CoderOfficial

Code-specialized LLM for code generation, completion, and explanation

Qwen3-Coder-NextQ4_K_M16 GBGPU required

GGUF

GLM-5Official

Massive MoE LLM for reasoning, coding, and agentic tasks

GLM-5MoE744B/40B activeQ4_K_M42 GBGPU required

GGUF

Llama 3.1 8BOfficial

Meta's general-purpose LLM with large fine-tune ecosystem

Llama-3.1-8B-InstructQ4_K_M4.9 GBGPU optional

GGUF

GPT-OSS 20BOfficial

OpenAI's first open-weight language model

gpt-oss-20bQ4_K_M12 GBGPU required

GGUF

MiniMax M2.5Official

Agentic LLM optimized for complex multi-step workflows

MiniMax-M2.5Q4_K_M16 GBGPU required

GGUF

Nanbeige4.1-3BOfficial

Lightweight multilingual LLM with strong Chinese language support

Nanbeige4.1-3BQ4_K_M2 GBGPU optional

GGUF

Nemotron-3 120B-A12BOfficial

NVIDIA's MoE LLM with 120B total / 12B active parameters

Nemotron-3-120B-A12BMoEQ4_K_M68 GBGPU required

DIFF

Stable DiffusionOfficial

Text-to-image generation using Stable Diffusion 1.5 and SDXL

SD 1.5 · SDXL8 GB VRAMGPU required

DIFF

Music GenOfficial

Text-to-music generation using Meta MusicGen

facebook/musicgen-small4 GB VRAMGPU required

DIFF

FLUX.1-devOfficial

High-quality text-to-image generation with superior prompt adherence

black-forest-labs/FLUX.1-dev12 GB VRAMGPU required

DIFF

FLUX.1-schnellOfficial

Ultra-fast text-to-image generation (4-step distilled)

FLUX.1-schnell4-step8 GB VRAMGPU required

DIFF

Z-Image-TurboOfficial

Ultra-fast text-to-image generation from Alibaba

Tongyi-MAI/Z-Image-Turbo8 GB VRAMGPU required

DIFF

Stable Diffusion 3.5 LargeOfficial

Latest text-to-image from Stability AI with MMDiT architecture

stabilityai/sd-3.5-largeMMDiT12 GB VRAMGPU required

DIFF

Qwen-ImageOfficial

Multimodal image generation from Alibaba's Qwen family

Qwen/Qwen-Image-251212 GB VRAMGPU required

DIFF

Wan2.1-T2V-14BOfficial

High-quality text-to-video generation (14B parameters)

Wan-AI/Wan2.1-T2V-14BVideo24 GB VRAMGPU required

DIFF

Wan2.2-T2V-A14BOfficial

Text-to-video with MoE architecture for efficient inference

Wan2.2-T2V-A14BMoEVideo16 GB VRAMGPU required

DIFF

HunyuanVideo-1.5Official

High-fidelity text-to-video generation from Tencent

tencent/HunyuanVideo-1.5Video24 GB VRAMGPU required

DIFF

Wan2.1-T2V-1.3BOfficial

Lightweight text-to-video for consumer GPUs

Wan2.1-T2V-1.3BVideo8 GB VRAMGPU required

DIFF

CogVideoX-2bOfficial

Efficient text-to-video generation from Z.AI

zai-org/CogVideoX-2bVideo8 GB VRAMGPU required

ONNX

DistilBERT SentimentOfficial

Classifies text sentiment as positive or negative using DistilBERT

distilbert-base-uncased-finetuned-sst-2-english

ONNX

BART-CNN SummarizationOfficial

Summarizes long text using BART-large fine-tuned on CNN/DailyMail

facebook/bart-large-cnn

ONNX

OPUS-MT TranslationOfficial

Translates text between 100+ language pairs using MarianMT

Helsinki-NLP/opus-mt

ONNX

BERT-NEROfficial

Identifies persons, organizations, and locations in text

dbmdz/bert-large-cased-finetuned-conll03-english

ONNX

DistilBERT QAOfficial

Extracts answers from context passages using DistilBERT

distilbert-base-cased-distilled-squad

ONNX

BERT Base Fill-MaskOfficial

Predicts masked tokens in text using BERT

bert-base-uncased

ONNX

BART-MNLI ClassificationOfficial

Classifies text into arbitrary categories using BART

facebook/bart-large-mnli

ONNX

T5 Grammar CorrectionOfficial

Corrects grammar in text using a fine-tuned T5 model

vennify/t5-base-grammar-correction

ONNX

T5 ParaphrasingOfficial

Generates paraphrased versions of input text using T5

Vamsi/T5_Paraphrase_Paws

ONNX

Toxic-BERTOfficial

Detects toxic content in text using a fine-tuned BERT classifier

unitary/toxic-bert

ONNX

KeyBERT ExtractionOfficial

Extracts keywords and keyphrases from text using KeyBERT

sentence-transformers/all-MiniLM-L6-v2

ONNX

MiniLM-L6 EmbeddingsOfficial

Generates text embeddings using sentence-transformers

all-MiniLM-L6-v2 · 384 dims

ONNX

CodeGen-350MOfficial

Generates code completions using CodeGen for Python

Salesforce/codegen-350M-mono

ONNX

YOLOv8 DetectionOfficial

Detects objects in images with bounding boxes using YOLOv8

ultralytics/yolov8n

ONNX

YOLOv8 SegmentationOfficial

Performs instance segmentation on images using YOLOv8-seg

ultralytics/yolov8n-seg

ONNX

BLIP CaptioningOfficial

Generates natural language descriptions of images using BLIP

Salesforce/blip-image-captioning-base

ONNX

DPT-Hybrid DepthOfficial

Estimates depth maps from single images using DPT-Hybrid

Intel/dpt-hybrid-midas

ONNX

MTCNN-FaceNetOfficial

Detects faces and generates facial embeddings using MTCNN + FaceNet

InceptionResnetV1-vggface2GPU required

ONNX

U2-Net Background RemovalOfficial

Removes background from images, returns transparent PNG

U2-Net (rembg)

ONNX

Style TransferOfficial

Applies artistic style transfer to images

fast_neural_styleGPU required

ONNX

Real-ESRGAN UpscalingOfficial

Upscales images 2x or 4x using Real-ESRGAN

RealESRGAN_x4plusGPU required

ONNX

GLM-OCROfficial

Advanced document understanding and OCR using GLM vision model

zai-org/GLM-OCR4 GB VRAMGPU required

ONNX

NuMarkdownOfficial

Converts document images to structured Markdown with reasoning

numind/NuMarkdown-8B-Thinking16 GB VRAMGPU required

ONNX

FireRed-OCROfficial

High-accuracy OCR for documents, handwriting, and scene text

FireRedTeam/FireRed-OCR2 GB VRAMGPU required

ONNX

Whisper BaseOfficial

Transcribes and translates audio with word-level timestamps

openai/whisper-baseGPU required

ONNX

Tacotron2 TTSOfficial

Converts text to speech audio using Coqui TTS

tacotron2-DDCGPU required

ONNX

Whisper Large-v3-TurboOfficial

Fast speech-to-text (2x faster than large-v3, near-identical quality)

openai/whisper-large-v3-turbo2x faster2 GB VRAMGPU required

ONNX

Qwen3-ASROfficial

Multilingual speech recognition from Qwen family

Qwen/Qwen3-ASR-1.7BMultilingual4 GB VRAMGPU required

ONNX

Voxtral-ASROfficial

Realtime streaming speech recognition from Mistral AI

mistralai/Voxtral-Mini-4BRealtime8 GB VRAMGPU required

ONNX

VibeVoice-ASROfficial

High-accuracy speech recognition from Microsoft

microsoft/VibeVoice-ASR2 GB VRAMGPU required

ONNX

Parakeet-ASROfficial

Fast accurate speech recognition from NVIDIA (0.6B parameters)

nvidia/parakeet-tdt-0.6b-v30.6BGPU optional

ONNX

Kokoro TTSOfficial

Ultra-lightweight high-quality text-to-speech (82M parameters)

hexgrad/Kokoro-82M82M paramsCPUGPU optional

ONNX

Qwen3-TTSOfficial

Voice cloning and custom voice text-to-speech from Qwen family

Qwen/Qwen3-TTS-1.7BVoice cloning4 GB VRAMGPU required

ONNX

Chatterbox TTSOfficial

High-quality voice cloning text-to-speech

ResembleAI/chatterboxVoice cloning2 GB VRAMGPU required

ONNX

VibeVoice TTSOfficial

Multi-speaker instruction-following text-to-speech from Microsoft

microsoft/VibeVoice-1.5BMulti-speaker4 GB VRAMGPU required

ONNX

CSM TTSOfficial

Conversational speech model optimized for natural dialogue

sesame/csm-1bConversational2 GB VRAMGPU required

ONNX

IndexTTS-2Official

High-quality multilingual text-to-speech

IndexTeam/IndexTTS-2Multilingual2 GB VRAMGPU required

ONNX

VibeVoice RealtimeOfficial

Realtime streaming text-to-speech optimized for low latency

microsoft/VibeVoice-Realtime-0.5BStreamingGPU optional

ONNX

Fish Audio S2Official

Multi-speaker text-to-speech with instruction-following control

fishaudio/s2-proMulti-speaker4 GB VRAMGPU required

ONNX

Qwen3-Embedding-0.6BOfficial

Lightweight text embeddings from Qwen family

Qwen/Qwen3-Embedding-0.6B0.6BCPU

ONNX

Qwen3-Embedding-8BOfficial

High-quality text embeddings for RAG from Qwen family

Qwen/Qwen3-Embedding-8B8B16 GB VRAMGPU required

ONNX

E5-Large MultilingualOfficial

Battle-tested multilingual text embeddings

intfloat/multilingual-e5-large1024 dimsCPU

ONNX

BGE-SmallOfficial

Fast lightweight English text embeddings

BAAI/bge-small-en-v1.5384 dimsCPU

ONNX

Jina Embeddings v5Official

Latest-generation text embeddings from Jina AI

jinaai/jina-embeddings-v5-text-smallCPU

ONNX

Qwen3-VL-EmbeddingOfficial

Vision+text multimodal embeddings from Qwen family

Qwen/Qwen3-VL-Embedding-2BMultimodal4 GB VRAMGPU required

OCI

Video TranscodeOfficial

Video transcoding and format conversion

OCI

Video CompressOfficial

Video compression and transcoding

OCI

Video ClipOfficial

Extract a time range from a video

OCI

Video ThumbnailOfficial

Extract thumbnails from video

OCI

Video SubtitleOfficial

Video subtitling using Whisper

OCI

GIF MakerOfficial

GIF creation from images or video

OCI

ScreenshotOfficial

Capture screenshots from URLs using headless Chrome

OCI

Audio ConvertOfficial

Audio format conversion using ffmpeg

OCI

Audio MergeOfficial

Merge and concatenate audio files

OCI

Audio SplitOfficial

Split audio into segments at silence boundaries

OCI

Noise RemoveOfficial

Audio noise removal using noisereduce

OCI

Normalize AudioOfficial

Normalize audio volume levels

OCI

OCROfficial

Optical character recognition for image-to-text

OCI

PDF ExtractOfficial

PDF text and content extraction

OCI

PDF MergeOfficial

PDF merging and splitting

OCI

PDF SignOfficial

Add text watermark or signature to PDF pages

OCI

HTML to PDFOfficial

Convert HTML to PDF using WeasyPrint

OCI

Markdown to PDFOfficial

Convert Markdown to styled PDF

OCI

DOCX ExtractOfficial

Extract text and images from Word documents

OCI

Excel ParseOfficial

Parse Excel spreadsheets to JSON

OCI

Batch TemplateOfficial

Generic data processing template for batch jobs

OCI

ResizeOfficial

Image resizing and thumbnail generation

OCI

Compress ImageOfficial

Image compression with quality control

OCI

Convert ImageOfficial

Image format conversion

OCI

Smart CropOfficial

Smart crop using edge-based saliency detection

OCI

Blur FaceOfficial

Blur faces in images for privacy

OCI

WatermarkOfficial

Add text or image watermarks

OCI

ColorizeOfficial

Colorize grayscale images

OCI

Image DiffOfficial

Compare two images and highlight differences

OCI

Image MetadataOfficial

Extract and manipulate image EXIF metadata

OCI

SVG OptimizeOfficial

SVG optimization and minification

OCI

SVG to PNGOfficial

Convert SVG to PNG using CairoSVG

OCI

BarcodeOfficial

Generate and read barcodes

OCI

Format CodeOfficial

Code formatting for multiple languages

OCI

JSON to TypesOfficial

Generate TypeScript or Python types from JSON

OCI

Language DetectOfficial

Language detection using lingua

OCI

LLM Chat (Mock)Official

Mock LLM chat for testing

OCI

Stable Diffusion (Mock)Official

Mock image generation for testing

CML

Qwen3.5-0.8B (CoreML)Official

Ultra-lightweight on-device LLM for iPhone Neural Engine

Qwen3.5-0.8BCoreMLiOS 17+Neural Engine

CML

WhisperKit (CoreML)Official

On-device speech recognition using pre-compiled CoreML Whisper

argmaxinc/whisperkitCoreMLiOS 17+Neural Engine

CML

Kokoro TTS (CoreML)Official

Ultra-lightweight on-device text-to-speech for iOS

Kokoro-82MCoreMLiOS 17+Neural Engine

WASM

HashOfficial

SHA-256, MD5, and Blake3 hash computation

WASM

JSONOfficial

JSON processing with jq-like operations

WASM

CSVOfficial

CSV processing with JSON conversion and filtering

WASM

Base64Official

Base64 encode and decode operations

WASM

CompressOfficial

Gzip compression and decompression

WASM

RegexOfficial

Regex testing and extraction

WASM

MarkdownOfficial

Markdown to HTML conversion (GFM)

WASM

Markdown TOCOfficial

Generate table of contents from Markdown

WASM

UUIDOfficial

UUID generation (v4, v7)

WASM

QR CodeOfficial

QR code generation from text or URLs

WASM

JWTOfficial

JWT decode and inspect

WASM

SanitizeOfficial

HTML sanitization for XSS protection

WASM

YAMLOfficial

YAML to JSON conversion

WASM

TOMLOfficial

TOML to JSON conversion

WASM

URLOfficial

URL parsing and manipulation

WASM

SlugOfficial

URL slug generation

WASM

SemverOfficial

Semantic version parsing and comparison

WASM

DiffOfficial

Text diff generation

WASM

HighlightOfficial

Syntax highlighting with HTML output

WASM

MinifyOfficial

HTML, CSS, and JS minification

WASM

CronOfficial

Cron expression parsing and next-run calculation

WASM

EchoOfficial

Test cargo that echoes JSON input

Submit your own cargo

Package your model, container, or WASM module as a cargo and publish it to the Archipelag.io network. Community cargos start unverified — pass our security review to earn the Certified badge and unlock higher trust levels.

CommunitySubmit a cargo — runs sandboxed, no network access

CertifiedPass security review — higher resource limits, trusted by consumers

OfficialMaintained by Archipelag.io — full network and GPU access

Publishing guide View on GitHub