VoxServe: a serving system for SpeechLMs

VoxServe is a serving system for Speech Language Models (SpeechLMs). VoxServe provides low-latency & high-throughput inference for language models trained for speech tokens, specifically text-to-speech (TTS) and speech-to-speech (STS) models.

News

[2025-02] We released our paper: VoxServe: A Streaming-Centric Serving System for Speech Language Models

Usage

You can install VoxServe via pip:

pip install vox-serve 
vox-serve --model <model-name> --port <port-number>

Or, you can clone the code and start the inference server with launch.py:

git clone https://github.com/vox-serve/vox-serve.git
cd vox-serve
python -m vox_serve.launch --model <model-name> --port <port-number>

And call the server like this:

# Generate audio from text
curl -X POST "http://localhost:<port-number>/generate" -F "text=Hello world" -F "streaming=true" -o output.wav

# For models supporting audio input
curl -X POST "http://localhost:<port-number>/generate" -F "text=Hello world" -F "@input.wav" -F "streaming=true" -o output.wav

We currently support the following TTS and STS models:

chatterbox: Chatterbox TTS
cosyvoice2: CosyVoice2-0.5B
csm: CSM-1B
orpheus: Orpheus-3B
qwen3-tts: Qwen3-TTS-1.7B (custom voice mode only, other modes under development)
zonos: Zonos-v0.1
glm: GLM-4-Voice-9B
step: Step-Audio-2-Mini

And we are actively working on expanding the support.

./examples folder has more example usage.

Name		Name	Last commit message	Last commit date
Latest commit History 309 Commits
.github/workflows		.github/workflows
benchmark		benchmark
docs		docs
examples		examples
vox_serve		vox_serve
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VoxServe: a serving system for SpeechLMs

News

Usage

About

Uh oh!

Releases

Packages

Contributors 2

Languages

License

vox-serve/vox-serve

Folders and files

Latest commit

History

Repository files navigation

VoxServe: a serving system for SpeechLMs

News

Usage

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages