scienceworld-room-scanner

$npx mdskill add zjunlp/SkillNet/scienceworld-room-scanner

Scan the current environment to generate a detailed inventory of visible objects, containers, and features.

  • Determining item locations or assessing the current physical state of a location.
  • Integrates with the environment's core spatial awareness and object detection APIs.
  • Executes upon entering a new area or when environmental status needs verification.
  • Delivers a structured observation detailing room contents, including nested container inventories.
SKILL.md
.github/skills/scienceworld-room-scannerView on GitHub ↗
---
name: scienceworld-room-scanner
description: This skill performs a 'look around' action to scan and describe the current room's contents, including visible objects, containers, and doors. Use when entering a new room or when the agent needs to locate specific items or assess the environment state. The skill outputs a detailed room description, which is essential for inventory discovery and situational awareness.
---
# Skill: Room Scanner

## Purpose
Execute a `look around` action to obtain a comprehensive description of the current room in the ScienceWorld environment. This description is the foundational step for any task requiring item location, environmental assessment, or navigation planning.

## Core Instruction
When this skill is invoked, the agent must perform the **`look around`** action.

## Trigger Conditions
Invoke this skill when:
1.  You first enter a new room via `teleport` or other movement.
2.  You need to locate a specific object or container mentioned in your task.
3.  The state of the room may have changed (e.g., after an interaction).
4.  You are formulating a plan and require an inventory of available resources.

## Output Processing
The observation from `look around` will contain:
*   **Room Name:** The identifier of your current location.
*   **Visible Objects & Agents:** A list of all entities in the room.
*   **Container Contents:** For open containers, a nested list of items inside (e.g., `a bowl (containing a red apple, a banana)`).
*   **Device States:** The status of interactive objects (e.g., `a stove, which is turned off`).
*   **Connections:** All accessible doors and their destination rooms.

**You must parse this output carefully.** Use it to update your mental model of the environment before proceeding with other actions like `pick up`, `examine`, or `use`.

## Integration Notes
*   This is a low-level, atomic skill. It should often be the first action in a sequence.
*   The observation it generates is critical context for subsequent decision-making. Refer back to it.
*   Do not overuse it. Once you have a recent description of a room, rely on that knowledge until you have reason to believe the state has changed.

## Example
**Task:** Survey the workshop after teleporting there.
1. `look around`
2. Observation: "This room is called the workshop. In it, you see: a table. On the table is: a battery, a blue light bulb, an orange wire, a yellow wire, a green wire. You also see: a blue box, an orange box. There is a door to the hallway."
3. Parse: available components include battery, light bulb, three wires; classification containers are blue box and orange box.
More from zjunlp/SkillNet