aliyun-qwen-multimodal-embedding
$
npx mdskill add cinience/alicloud-skills/aliyun-qwen-multimodal-embeddingGenerates multimodal embeddings for text, image, and video using Alibaba Cloud Model Studio.
- Solves tasks requiring cross-modal similarity, retrieval, or clustering.
- Uses Alibaba Cloud Model Studio APIs like `qwen3-vl-embedding`.
- Chooses model based on task needs, like latest or compatible versions.
- Returns normalized embeddings and metadata for downstream workflows.
SKILL.md
.github/skills/aliyun-qwen-multimodal-embeddingView on GitHub ↗
--- name: aliyun-qwen-multimodal-embedding description: Use when multimodal embeddings are needed from Alibaba Cloud Model Studio models such as `qwen3-vl-embedding` for image, video, and text retrieval, cross-modal search, clustering, or offline vectorization pipelines. version: 1.0.0 --- Category: provider # Model Studio Multimodal Embedding ## Validation ```bash mkdir -p output/aliyun-qwen-multimodal-embedding python -m py_compile skills/ai/search/aliyun-qwen-multimodal-embedding/scripts/prepare_multimodal_embedding_request.py && echo "py_compile_ok" > output/aliyun-qwen-multimodal-embedding/validate.txt ``` Pass criteria: command exits 0 and `output/aliyun-qwen-multimodal-embedding/validate.txt` is generated. ## Output And Evidence - Save normalized request payloads, selected dimensions, and sample input references under `output/aliyun-qwen-multimodal-embedding/`. - Record the exact model, modality mix, and output vector dimension for reproducibility. Use this skill when the task needs text, image, or video embeddings from Model Studio for retrieval or similarity workflows. ## Critical model names Use one of these exact model strings as needed: - `qwen3-vl-embedding` - `qwen2.5-vl-embedding` - `tongyi-embedding-vision-plus-2026-03-06` Selection guidance: - Prefer `qwen3-vl-embedding` for the newest multimodal embedding path. - Use `qwen2.5-vl-embedding` when you need compatibility with an older deployed pipeline. ## Prerequisites - Set `DASHSCOPE_API_KEY` in your environment, or add `dashscope_api_key` to `~/.alibabacloud/credentials`. - Pair this skill with a vector store such as DashVector, OpenSearch, or Milvus when building retrieval systems. ## Normalized interface (embedding.multimodal) ### Request - `model` (string, optional): default `qwen3-vl-embedding` - `texts` (array<string>, optional) - `images` (array<string>, optional): public URLs or local paths uploaded by your client layer - `videos` (array<string>, optional): public URLs where supported - `dimension` (int, optional): e.g. `2560`, `2048`, `1536`, `1024`, `768`, `512`, `256` for `qwen3-vl-embedding` ### Response - `embeddings` (array<object>) - `dimension` (int) - `usage` (object, optional) ## Quick start ```bash python skills/ai/search/aliyun-qwen-multimodal-embedding/scripts/prepare_multimodal_embedding_request.py \ --text "A cat sitting on a red chair" \ --image "https://example.com/cat.jpg" \ --dimension 1024 ``` ## Operational guidance - Keep `input.contents` as an array; malformed shapes are a common 400 cause. - Pin the output dimension to match your index schema before writing vectors. - Use the same model and dimension across one vector index to avoid mixed-vector incompatibility. - For large image or video batches, stage files in object storage and reference stable URLs. ## Output location - Default output: `output/aliyun-qwen-multimodal-embedding/request.json` - Override base dir with `OUTPUT_DIR`. ## References - `references/sources.md`
More from cinience/alicloud-skills
- aliyun-adb-mysqlUse when managing Alibaba Cloud AnalyticDB for MySQL (ADB) via OpenAPI/SDK, including the user needs AnalyticDB resource lifecycle and configuration operations, status checks, or troubleshooting ADB API and cluster workflow issues.
- aliyun-adb-mysql-testSmoke test for aliyun-adb-mysql. Validate minimal authentication, API reachability, and one read-only query path.
- aliyun-aicontent-generateUse when managing Alibaba Cloud AIContent (AiContent) via OpenAPI/SDK, including the user needs AI content generation or content workflow operations in Alibaba Cloud, including listing assets, creating/updating generation configurations, checking task status, or troubleshooting failed content jobs.
- aliyun-aicontent-generate-testSmoke test for aliyun-aicontent-generate. Validate minimal authentication, API reachability, and one read-only query path.
- aliyun-aimiaobi-generateUse when managing Alibaba Cloud Quan Miao (AiMiaoBi) via OpenAPI/SDK, including the user asks for Alibaba Cloud MiaoBi content operations, including listing resources, creating/updating configurations, querying runtime status, and diagnosing API or workflow failures.
- aliyun-aimiaobi-generate-testSmoke test for aliyun-aimiaobi-generate. Validate minimal authentication, API reachability, and one read-only query path.
- aliyun-airec-manageUse when managing Alibaba Cloud AIRec (Airec) via OpenAPI/SDK, including the user needs recommendation-engine resource operations in Alibaba Cloud, including list/create/update flows, status inspection, and troubleshooting AIRec configuration or runtime issues.
- aliyun-airec-manage-testSmoke test for aliyun-airec-manage. Validate minimal authentication, API reachability, and one read-only query path.
- aliyun-alb-manageUse when managing and troubleshoot Alibaba Cloud ALB (Application Load Balancer), including the user asks to inspect, create, change, or debug ALB instances, listeners, server groups, rules, certificates, ACLs, security policies, or health checks in Alibaba Cloud.
- aliyun-alb-manage-testSmoke test for Alibaba Cloud ALB skill. Validates SDK auth, script compilation, list instances, and health check flows.