aliyun-qvq

$npx mdskill add cinience/alicloud-skills/aliyun-qvq

Performs visual reasoning using Alibaba Cloud Model Studio QVQ models for image and chart analysis

  • Solves math problems from screenshots and analyzes diagrams or charts
  • Uses Alibaba Cloud Model Studio QVQ models (qvq-plus, qvq-max)
  • Applies step-by-step reasoning grounded in visual evidence
  • Generates structured outputs for integration into workflows
SKILL.md
.github/skills/aliyun-qvqView on GitHub ↗
---
name: aliyun-qvq
description: Use when visual reasoning is needed with Alibaba Cloud Model Studio QVQ models, including step-by-step image reasoning, chart analysis, and visually grounded problem solving.
version: 1.0.0
---

Category: provider

# Model Studio QVQ Visual Reasoning

## Validation

```bash
mkdir -p output/aliyun-qvq
python -m py_compile skills/ai/multimodal/aliyun-qvq/scripts/prepare_qvq_request.py && echo "py_compile_ok" > output/aliyun-qvq/validate.txt
```

Pass criteria: command exits 0 and `output/aliyun-qvq/validate.txt` is generated.

## Critical model names

Use one of these exact model strings:
- `qvq-plus`
- `qvq-max`

## Typical use

- Mathematical reasoning from screenshots
- Diagram and chart reasoning
- Visually grounded multi-step problem solving

## Quick start

```bash
python skills/ai/multimodal/aliyun-qvq/scripts/prepare_qvq_request.py \
  --output output/aliyun-qvq/request.json
```

## Notes

- Use `skills/ai/multimodal/aliyun-qwen-vl/` for standard image understanding.
- Use QVQ when the task explicitly needs stronger reasoning over visual evidence.

## References

- `references/sources.md`
More from cinience/alicloud-skills