Nexlab project · local AI worker · AISBF-ready

CoderAI turns your machine into a multimodal AI backend.

CoderAI is an open source local model orchestrator with an OpenAI-compatible API, a Web Studio for generation tasks, GPU-aware model loading, and a broker client that lets AISBF route work to your machine over outbound WebSockets.

Read CoderAI docs Download Docker bundle CoderAI tutorials View CoderAI source View screenshots Connect it to AISBF

Source-backed from the CoderAI repository README, reverse-proxy docs, broker reference, and runtime config defaults.

Jump to: Full docs · Install · Docker / OCI · Docker install bundle · Gallery · API · Broker · Market · What it is · Features · Bundled apps · AISBF broker · Install · API surface · Operations

See the operator UI.

The 10-image CoderAI gallery shows the model configuration workflow: model configuration, live task monitoring, HuggingFace model discovery, active downloads, and repository detail drawers.

Model configuration screenshots

Use the 10-screenshot gallery for a quick visual tour of the model-management interface before installing CoderAI locally or connecting it to AISBF.

Open CoderAI gallery

Local control, cloud-style API.

CoderAI is the local compute side of the stack: run models on your own GPUs, expose them through familiar HTTP routes, and optionally let AISBF treat that machine as a provider.

OpenAI compatible

Drop-in API shape

It exposes OpenAI-style endpoints including GET /v1/models, POST /v1/chat/completions, completions, embeddings, image, video, and audio routes.

Web Studio

Browser UI for generation

The built-in Studio covers chat, image, video, audio, pipelines, archive browsing, model settings, users, tokens, and admin controls.

GPU aware

NVIDIA, AMD, Intel paths

CUDA, Vulkan, Metal, GGUF, Transformers, stable-diffusion.cpp, whisper.cpp, and related runtimes are handled by platform-specific build paths.

Config first

No giant CLI ceremony

Runtime state lives under ~/.coderai/: config.json, models.json, auth.json, pipelines.json, archive, and generated secrets.

Built for more than chat.

The repository describes CoderAI as a multimodal orchestrator, not only an LLM server. The useful bit: one local control plane for many heavy generation jobs.

Text and embeddings

Chat completions, text completions, embeddings, streaming responses, prompt caching, tool calling, per-model context and quantization settings.

Images

Text-to-image, image editing, inpainting, upscaling, deblur, unpixelate, outfit change, face swap, depth, segmentation, and 2D/3D conversion.

Video

Text-to-video, image-to-video, video-to-video, frame interpolation, upscaling, subtitles, dubbing, lip sync via Wav2Lip or SadTalker, and 3D video processing.

Audio

Kokoro TTS, Whisper STT, MusicGen/AudioGen/AudioLDM2, F5-TTS voice cloning, Seed-VC voice conversion, and saved voice profiles.

Profiles

Character profiles and environment profiles preserve visual identity and scene style across image/video generation. The README documents up to six of each per generation.

Pipelines

Built-in and custom pipelines chain steps like text → image → video → TTS. Template variables include {{input}}, {{stepN.output}}, and {{stepN.url}}.

Bundled example applications.

CoderAI is not only an API server. The source tree includes three practical demo/example web applications under tools/ — video_editor.py, videogen.py, and gen_township_fighters.py — and the Docker / OCI image starts them behind the same published port.

tools/video_editor.py

Video editor

A browser-based editor backed by CoderAI TTS and local ffmpeg. Pick or upload video, scrub a timeline, generate narration, place clips at exact positions, add music, speed up regions, and render the final result.

OCI route: /editor/

tools/videogen.py

VideoGen studio

A small web app for managing character and environment profiles, assembling multi-clip short movies, and chaining image/video generation, speech, lip-sync, music, and sound effects through the CoderAI API.

OCI route: /videogen/

tools/gen_township_fighters.py

Township fighters demo

An example generator for fighter-match videos in an MMA-style workflow: generate characters, environments, and fight clips through CoderAI, then save the produced media under the container cache/output paths.

OCI route: /township/

The AISBF connection.

CoderAI does not need AISBF to work. Use CoderAI standalone for local generation, the browser UI, and direct OpenAI-compatible API calls. Add AISBF only when you want brokered routing, metering, clustering, or shared access to that local capacity.

CoderAI can also be a first-class AISBF provider. Direct HTTP works on a LAN/server. Broker mode is for machines behind NAT: CoderAI dials out, AISBF routes in.

AppOpenAI-style request

AISBFprovider routing

Broker WSSoutbound socket

CoderAIlocal GPU worker

Modelstext/media jobs

Broker config in CoderAI

{
  "broker": {
    "enabled": true,
    "base_url": "https://aisbf.cloud",
    "scope": "user",
    "username": "alice",
    "provider_id": "coderai-local",
    "client_id": "workstation-01",
    "registration_token": "your-provider-token",
    "transport": "websocket"
  }
}

AISBF provider shape

{
  "type": "coderai",
  "endpoint": "http://127.0.0.1:8776",
  "api_key_required": false,
  "coderai_config": {
    "broker_enabled": true,
    "broker_preferred": true,
    "client_id": "workstation-01",
    "registration_token": "provider-scoped-secret"
  }
}

Important broker rule

The broker identity must match on both sides. Keep provider_id, client_id, username, and registration_token stable. If AISBF sees the session but requests do not route, the first thing to check is usually a mismatched client_id or owner scope.

Sell access to local models through AISBF.

CoderAI broker mode is the local-model seller path: your machine dials AISBF, registers what it can run, and AISBF exposes that capacity through controlled provider routes.

Keep the GPU local

Run CoderAI on your workstation, home lab, private server, or rented GPU. The broker uses outbound WSS, so the worker does not need a public inbound port.

Advertise real capability

CoderAI can report hardware, endpoints, and model/media capabilities so AISBF can route buyers to the right worker.

Let AISBF manage access

AISBF sits in front for identity, tokens, quotas, routing, failover, and the market/access layer instead of exposing your raw CoderAI API.

AISBF market overview Broker protocol docs

Install and run.

The README’s quick start builds all backends, starts the server on localhost, then opens the admin UI. Default credentials are admin / admin and should be changed immediately.

Linux quick start

git clone git@git.nexlab.net:nexlab/coderai.git
cd coderai
./build.sh all
source venv_all/bin/activate
python coderai

Default server: http://127.0.0.1:8776

Container install: run the provided Docker / OCI image with coderai-docker and persistent /config, /models, and /cache volumes. Download the ready-to-use Docker install bundle →

Access points

/admin — admin dashboard
/chat — Web Studio
/v1/* — OpenAI-compatible API
/docs — interactive FastAPI docs

Linux CUDA/VulkanmacOS MetalWindows CUDA

API surface.

CoderAI presents a broad set of generation endpoints while keeping OpenAI-compatible routes for the common integration path.

Core OpenAI-style

GET /v1/models
POST /v1/chat/completions
POST /v1/completions
POST /v1/embeddings

Image and 3D

/v1/images/generations
/v1/images/edits
/v1/images/faceswap
/v1/3d/generate

Video

/v1/video/generations
/v1/video/dub
/v1/video/interpolate
/v1/video/to3d

Audio

/v1/audio/speech
/v1/audio/transcriptions
/v1/audio/clone
/v1/audio/convert

Operational shape.

CoderAI is designed like local infrastructure: model registry, request queueing, on-demand loading, offload rules, archive retention, auth, tokens, and reverse-proxy support.

Resource management

On-demand loading, idle unload, VRAM → RAM → disk offload, per-model settings, request queues, prompt cache reuse, and optional prompt aggregation.

Frontend/engine split

The config defaults include a thin frontend proxy plus one or more engine subprocesses so the UI can stay responsive while GPU workers load or generate.

Reverse proxy ready

The repo documents subdomain/root mounting and /coderai/ sub-path mounting using X-Forwarded-Prefix, long timeouts, large uploads, and disabled buffering for SSE.

Open source

CoderAI is GPLv3, authored by Stefy Lanza, with contributions via merge requests on Nexlab GitLab.

Read the docs Read the repository AISBF broker tutorial AISBF for developers