aliyun-qwen-livetranslate

Name: aliyun-qwen-livetranslate
Author: cinience/alicloud-skills

$npx mdskill add cinience/alicloud-skills/aliyun-qwen-livetranslate

Enables live speech translation using Alibaba Cloud Qwen LiveTranslate models

Solves real-time translation needs for meetings, interpretation, and captioning
Leverages Alibaba Cloud Model Studio Qwen LiveTranslate APIs for translation
Uses specified source and target languages with optional audio format settings
Returns translated text and optional source text or audio output to the user

SKILL.md

.github/skills/aliyun-qwen-livetranslateView on GitHub ↗

---
name: aliyun-qwen-livetranslate
description: Use when live speech translation is needed with Alibaba Cloud Model Studio Qwen LiveTranslate models, including bilingual meetings, realtime interpretation, and speech-to-speech or speech-to-text translation flows.
version: 1.0.0
---

Category: provider

# Model Studio Qwen LiveTranslate

## Validation

```bash
mkdir -p output/aliyun-qwen-livetranslate
python -m py_compile skills/ai/audio/aliyun-qwen-livetranslate/scripts/prepare_livetranslate_request.py && echo "py_compile_ok" > output/aliyun-qwen-livetranslate/validate.txt
```

Pass criteria: command exits 0 and `output/aliyun-qwen-livetranslate/validate.txt` is generated.

## Output And Evidence

- Save translation session payloads and response summaries under `output/aliyun-qwen-livetranslate/`.

## Critical model names

Use one of these exact model strings:
- `qwen3-livetranslate-flash`
- `qwen3-livetranslate-flash-realtime`

## Typical use

- Chinese/English meeting interpretation
- Live subtitles in another language
- Call-center agent assist with translated captions

## Normalized interface (audio.livetranslate)

### Request
- `model` (string, optional): default `qwen3-livetranslate-flash`
- `source_language` (string, required)
- `target_language` (string, required)
- `audio_format` (string, optional): e.g. `pcm`
- `sample_rate` (int, optional): e.g. `16000`

### Response
- `translated_text` (string)
- `source_text` (string, optional)
- `audio_url` or `audio_chunk` (optional, model dependent)

## Quick start

```bash
python skills/ai/audio/aliyun-qwen-livetranslate/scripts/prepare_livetranslate_request.py \
  --source-language zh \
  --target-language en \
  --output output/aliyun-qwen-livetranslate/request.json
```

## Notes

- Prefer the realtime model for continuous streaming sessions.
- Prefer the non-realtime flash model for simpler integration and lower client complexity.

## References

- `references/sources.md`