scienceworld-room-scanner
$
npx mdskill add zjunlp/SkillNet/scienceworld-room-scannerScan the current environment to generate a detailed inventory of visible objects, containers, and features.
- Determining item locations or assessing the current physical state of a location.
- Integrates with the environment's core spatial awareness and object detection APIs.
- Executes upon entering a new area or when environmental status needs verification.
- Delivers a structured observation detailing room contents, including nested container inventories.
SKILL.md
.github/skills/scienceworld-room-scannerView on GitHub ↗
--- name: scienceworld-room-scanner description: This skill performs a 'look around' action to scan and describe the current room's contents, including visible objects, containers, and doors. Use when entering a new room or when the agent needs to locate specific items or assess the environment state. The skill outputs a detailed room description, which is essential for inventory discovery and situational awareness. --- # Skill: Room Scanner ## Purpose Execute a `look around` action to obtain a comprehensive description of the current room in the ScienceWorld environment. This description is the foundational step for any task requiring item location, environmental assessment, or navigation planning. ## Core Instruction When this skill is invoked, the agent must perform the **`look around`** action. ## Trigger Conditions Invoke this skill when: 1. You first enter a new room via `teleport` or other movement. 2. You need to locate a specific object or container mentioned in your task. 3. The state of the room may have changed (e.g., after an interaction). 4. You are formulating a plan and require an inventory of available resources. ## Output Processing The observation from `look around` will contain: * **Room Name:** The identifier of your current location. * **Visible Objects & Agents:** A list of all entities in the room. * **Container Contents:** For open containers, a nested list of items inside (e.g., `a bowl (containing a red apple, a banana)`). * **Device States:** The status of interactive objects (e.g., `a stove, which is turned off`). * **Connections:** All accessible doors and their destination rooms. **You must parse this output carefully.** Use it to update your mental model of the environment before proceeding with other actions like `pick up`, `examine`, or `use`. ## Integration Notes * This is a low-level, atomic skill. It should often be the first action in a sequence. * The observation it generates is critical context for subsequent decision-making. Refer back to it. * Do not overuse it. Once you have a recent description of a room, rely on that knowledge until you have reason to believe the state has changed. ## Example **Task:** Survey the workshop after teleporting there. 1. `look around` 2. Observation: "This room is called the workshop. In it, you see: a table. On the table is: a battery, a blue light bulb, an orange wire, a yellow wire, a green wire. You also see: a blue box, an orange box. There is a door to the hallway." 3. Parse: available components include battery, light bulb, three wires; classification containers are blue box and orange box.
More from zjunlp/SkillNet
- alfworld-appliance-navigatorNavigates the agent to a target appliance (microwave, stove, fridge, or sinkbasin) needed for object processing. Use when you are holding an object that needs heating, cooling, or cleaning and must move to the correct appliance station. Identifies the required appliance from the task context and executes the movement action.
- alfworld-appliance-preparerPrepares a household appliance (microwave, oven, toaster, fridge) for use by ensuring it is in the correct open/closed state. Use when the agent needs to heat, cool, or cook an item and must first open or close the appliance before placing an object inside. Takes an appliance identifier as input and outputs a confirmation that the appliance is ready for the next action.
- alfworld-clean-objectCleans a specified object using an appropriate cleaning receptacle (e.g., sinkbasin). Use when a task requires an object to be in a clean state (e.g., "clean potato", "wash apple") before proceeding. Navigates to the cleaning location, performs the clean action, and confirms the object is now clean.
- alfworld-device-operatorOperates a device or appliance (like a desklamp, microwave, or fridge) to interact with another object. Use when the task requires using a tool on a target item (e.g., "look at laptop under the desklamp", "heat potato with microwave"). Locates both the device and target object, co-locates them, and executes the appropriate use action (toggle, heat, cool, or clean).
- alfworld-environment-scannerPerforms an initial scan of the ALFWorld environment to identify all visible objects and receptacles. Use when you first enter an environment and need to build a mental map for task planning. Processes raw observation text into a structured list of entities, categorizing them as objects or receptacles.
- alfworld-goal-interpreterParses the natural language task goal to extract actionable sub-objectives and required objects. Trigger this skill whenever a new task is assigned to break down complex instructions into clear, sequential targets. It interprets phrases like 'look at X under Y' to identify target objects (pillow), reference objects (desklamp), and spatial relationships (under).
- alfworld-heat-object-with-applianceUses a heating appliance (microwave, stoveburner, oven) to apply heat to a specified object. Use when the task requires warming or cooking an item (e.g., "heat some egg", "warm the mug") and a heating appliance is available. Takes the object name and appliance name as input and outputs the object in a heated state, ready for placement at the task's target location.
- alfworld-inventory-managementUse when the agent must collect and track multiple instances of the same object type in ALFWorld (e.g., "put two cellphone in bed"). This skill maintains a count of collected versus needed objects, guides systematic searching through receptacles, and ensures each found object is placed at the target before searching for the next.
- alfworld-locate-target-objectNavigates to a suspected location and identifies a target object. Use when your goal requires finding a specific object (e.g., "potato", "plate") and its location is not immediately known. Moves to a relevant receptacle (like a fridge or cabinet), checks its contents, and outputs the object's location or confirms its absence.
- alfworld-location-navigatorMoves the agent to a specified receptacle or object location within the Alfworld environment. Use this skill when the agent needs to physically approach a target to inspect or interact with it, such as when checking an object's state or preparing for pickup. The skill takes a target location name as input and executes the 'go to' action, resulting in the agent being positioned at the destination for subsequent operations.