aliyun-qwen-tts-realtime
$
npx mdskill add cinience/alicloud-skills/aliyun-qwen-tts-realtimeProvides real-time speech synthesis using Alibaba Cloud Qwen TTS Realtime models
- Solves the need for low-latency, interactive speech synthesis in applications
- Depends on Alibaba Cloud Model Studio and DashScope SDK for model access
- Chooses from specific Qwen TTS Realtime models based on input requirements
- Returns audio as base64-encoded PCM chunks for immediate playback
SKILL.md
.github/skills/aliyun-qwen-tts-realtimeView on GitHub ↗
--- name: aliyun-qwen-tts-realtime description: Use when real-time speech synthesis is needed with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instruction-controlled realtime synthesis. version: 1.0.0 --- Category: provider # Model Studio Qwen TTS Realtime Use realtime TTS models for low-latency streaming speech output. ## Critical model names Use one of these exact model strings: - `qwen3-tts-flash-realtime` - `qwen3-tts-instruct-flash-realtime` - `qwen3-tts-instruct-flash-realtime-2026-01-22` - `qwen3-tts-vd-realtime-2026-01-15` - `qwen3-tts-vc-realtime-2026-01-15` ## Prerequisites - Install SDK in a virtual environment: ```bash python3 -m venv .venv . .venv/bin/activate python -m pip install dashscope ``` - Set `DASHSCOPE_API_KEY` in your environment, or add `dashscope_api_key` to `~/.alibabacloud/credentials`. ## Normalized interface (tts.realtime) ### Request - `text` (string, required) - `voice` (string, required) - `instruction` (string, optional) - `sample_rate` (int, optional) ### Response - `audio_base64_pcm_chunks` (array<string>) - `sample_rate` (int) - `finish_reason` (string) ## Operational guidance - Use websocket or streaming endpoint for realtime mode. - Keep each utterance short for lower latency. - For instruction models, keep instruction explicit and concise. - Some SDK/runtime combinations may reject realtime model calls over `MultiModalConversation`; use the probe script below to verify compatibility. ## Local demo script Use the probe script to verify realtime compatibility in your current SDK/runtime, and optionally fallback to a non-realtime model for immediate output: ```bash .venv/bin/python skills/ai/audio/aliyun-qwen-tts-realtime/scripts/realtime_tts_demo.py \ --text "This is a realtime speech demo." \ --fallback \ --output output/ai-audio-tts-realtime/audio/fallback-demo.wav ``` Strict mode (for CI / gating): ```bash .venv/bin/python skills/ai/audio/aliyun-qwen-tts-realtime/scripts/realtime_tts_demo.py \ --text "realtime health check" \ --strict ``` ## Output location - Default output: `output/ai-audio-tts-realtime/audio/` - Override base dir with `OUTPUT_DIR`. ## Validation ```bash mkdir -p output/aliyun-qwen-tts-realtime for f in skills/ai/audio/aliyun-qwen-tts-realtime/scripts/*.py; do python3 -m py_compile "$f" done echo "py_compile_ok" > output/aliyun-qwen-tts-realtime/validate.txt ``` Pass criteria: command exits 0 and `output/aliyun-qwen-tts-realtime/validate.txt` is generated. ## Output And Evidence - Save artifacts, command outputs, and API response summaries under `output/aliyun-qwen-tts-realtime/`. - Include key parameters (region/resource id/time range) in evidence files for reproducibility. ## Workflow 1) Confirm user intent, region, identifiers, and whether the operation is read-only or mutating. 2) Run one minimal read-only query first to verify connectivity and permissions. 3) Execute the target operation with explicit parameters and bounded scope. 4) Verify results and save output/evidence files. ## References - `references/sources.md`
More from cinience/alicloud-skills
- aliyun-adb-mysqlUse when managing Alibaba Cloud AnalyticDB for MySQL (ADB) via OpenAPI/SDK, including the user needs AnalyticDB resource lifecycle and configuration operations, status checks, or troubleshooting ADB API and cluster workflow issues.
- aliyun-adb-mysql-testSmoke test for aliyun-adb-mysql. Validate minimal authentication, API reachability, and one read-only query path.
- aliyun-aicontent-generateUse when managing Alibaba Cloud AIContent (AiContent) via OpenAPI/SDK, including the user needs AI content generation or content workflow operations in Alibaba Cloud, including listing assets, creating/updating generation configurations, checking task status, or troubleshooting failed content jobs.
- aliyun-aicontent-generate-testSmoke test for aliyun-aicontent-generate. Validate minimal authentication, API reachability, and one read-only query path.
- aliyun-aimiaobi-generateUse when managing Alibaba Cloud Quan Miao (AiMiaoBi) via OpenAPI/SDK, including the user asks for Alibaba Cloud MiaoBi content operations, including listing resources, creating/updating configurations, querying runtime status, and diagnosing API or workflow failures.
- aliyun-aimiaobi-generate-testSmoke test for aliyun-aimiaobi-generate. Validate minimal authentication, API reachability, and one read-only query path.
- aliyun-airec-manageUse when managing Alibaba Cloud AIRec (Airec) via OpenAPI/SDK, including the user needs recommendation-engine resource operations in Alibaba Cloud, including list/create/update flows, status inspection, and troubleshooting AIRec configuration or runtime issues.
- aliyun-airec-manage-testSmoke test for aliyun-airec-manage. Validate minimal authentication, API reachability, and one read-only query path.
- aliyun-alb-manageUse when managing and troubleshoot Alibaba Cloud ALB (Application Load Balancer), including the user asks to inspect, create, change, or debug ALB instances, listeners, server groups, rules, certificates, ACLs, security policies, or health checks in Alibaba Cloud.
- aliyun-alb-manage-testSmoke test for Alibaba Cloud ALB skill. Validates SDK auth, script compilation, list instances, and health check flows.