Feature: Basic Agentic Pick and Place - reimplemented manipulation skills #1237

mustafab0 · 2026-02-11T23:53:35Z

Problem

ManipulationModule had no skill layer — only low-level RPCs requiring a custom IPython client.
No way for an LLM agent to invoke pick/place/move skills through the framework.
RRT planner rejected valid joint states due to floating-point drift past URDF limits.

Solution

Added 11 @skill() methods to ManipulationModule (pick, place, move_to_pose, scan_objects, etc.).
Changed base class to SkillModule so skills auto-register with agents via autoconnect().
Created xarm-perception-agent blueprint composing xarm_perception + llm_agent + human_input.
Added detection snapshot so pick uses stable labels instead of volatile live cache.
Added limit_eps=1e-3 tolerance to RRT planner joint limit validation.
Removed ManipulationClient and run_* RPC wrappers — agent CLI replaces them.
CoordinatorClient updated to route execution through task_invoke instead of removed RPCs.

Breaking Changes

ManipulationClient deleted — use dimos run xarm-perception-agent or direct RPC.
run_pick/run_place/run_place_back/run_go_init RPCs removed from ManipulationModule.

How to Test

Run dimos run coordinator-mock then dimos run xarm-perception-agent
Type scan the scene — verify objects listed with 3D positions
Type pick up the <object> — verify approach, grasp, retract sequence
Type place it back — verify placement at original pick position
Verify skills appear in dimos agentspy

closes DIM-351
closes DIM-419

…lues have tiny errors that can get out of joint limit range

greptile-apps · 2026-02-12T01:10:49Z

Greptile Overview

Greptile Summary

This PR adds an agent-invokable skill layer to manipulation by converting ManipulationModule to a SkillModule and introducing a set of @skill() actions (scan, pick/place, move_to_pose/joints, gripper control, go_home/init). It also adds a new xarm-perception-agent blueprint composing perception + an LLM agent + human CLI input, updates coordinator client calls to route through ControlCoordinator.task_invoke, removes the legacy ManipulationClient example, and loosens RRT joint-limit validation with a small epsilon to avoid floating-point drift rejecting valid states.

The changes fit into the existing architecture by leaning on SkillModule’s auto-registration of skills into agents via autoconnect(), while continuing to use the existing planning stack (WorldMonitor + PlannerSpec) and execution through the ControlCoordinator trajectory task API (now accessed via task_invoke).

Confidence Score: 2/5

This PR is not safe to merge until coordinator status polling and grasp RPC wiring issues are fixed.
Core new functionality (agentic pick/place) depends on coordinator task polling and grasp generation wiring. The PR currently calls a non-existent trajectory-task method (get_state) and drops TrajectoryStatus results expecting dicts, which will cause skills/CLI to report success early or never show progress. The grasp path also calls an RPC method that is neither declared nor implemented (GraspingModule.get_latest_grasps), causing runtime failures on the intended GraspGen-success path.
dimos/manipulation/manipulation_module.py, dimos/manipulation/control/coordinator_client.py

Important Files Changed

Filename	Overview
dimos/manipulation/control/coordinator_client.py	Switched trajectory status/execute/cancel to ControlCoordinator.task_invoke; current status handling expects dict but trajectory tasks return TrajectoryStatus, breaking wait_for_completion/status.
dimos/manipulation/manipulation_blueprints.py	Adds xarm-perception-agent blueprint composed from xarm_perception + llm_agent + human_input and extends RobotModelConfig usage (home_joints). No correctness issues spotted in this file.
dimos/manipulation/manipulation_module.py	Rebased ManipulationModule onto SkillModule and added many @Skill methods; introduced broken coordinator status polling (uses get_state) and broken GraspingModule RPC integration (calls non-existent get_latest_grasps).
dimos/manipulation/planning/examples/init.py	Removes ManipulationClient export and replaces docstring with a generic planning examples blurb; no functional code remains.
dimos/manipulation/planning/examples/manipulation_client.py	Deletes obsolete IPython ManipulationClient RPC helper (breaking change as described). No remaining code to review.
dimos/manipulation/planning/planners/rrt_planner.py	Adds small epsilon tolerance when validating start/goal joint limits to reduce false rejections due to floating-point drift.
dimos/manipulation/planning/spec/config.py	Extends RobotModelConfig with home_joints and pre_grasp_offset fields used by new skills; schema change is straightforward.
dimos/robot/all_blueprints.py	Registers new xarm-perception-agent blueprint entry.
pyproject.toml	Adds manipulation_module.py to largefiles ignore list; no runtime behavior changes.

Sequence Diagram

sequenceDiagram
  autonumber
  participant User as Human user
  participant HI as human_input (CLI)
  participant Agent as llm_agent
  participant Skills as SkillCoordinator
  participant Manip as ManipulationModule (SkillModule)
  participant WM as WorldMonitor
  participant Planner as RRTConnectPlanner
  participant CC as ControlCoordinator
  participant Task as Trajectory Task (JointTrajectoryController)

  User->>HI: type "pick up the cup"
  HI->>Agent: user message
  Agent->>Skills: select skill + args
  Skills->>Manip: pick(object_name)
  Manip->>WM: refresh_obstacles(min_duration)
  WM-->>Manip: obstacles added
  Manip->>Manip: _generate_grasps_for_pick()
  alt GraspingModule RPC wired
    Manip->>Manip: get_rpc_calls("GraspingModule.generate_grasps")
    Manip-->>Manip: grasp candidates (intended)
  else fallback heuristic
    Manip->>WM: list_cached_detections()
    WM-->>Manip: detections snapshot
    Manip-->>Manip: heuristic grasp pose
  end

  Manip->>Planner: plan_joint_path(start, goal)
  Planner-->>Manip: JointPath
  Manip->>Manip: generate JointTrajectory
  Manip->>CC: task_invoke(task, "execute", {trajectory})
  CC->>Task: execute(trajectory)
  Task-->>CC: accepted
  CC-->>Manip: result
  loop wait for completion
    Manip->>CC: task_invoke(task, "get_status" or "get_state")
    CC->>Task: get_status()/get_state()
    Task-->>CC: TrajectoryStatus / None
    CC-->>Manip: status
  end
  Manip-->>Skills: streamed progress strings
  Skills-->>Agent: tool results
  Agent-->>HI: assistant response

greptile-apps

_{9 files reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

dimos/manipulation/control/coordinator_client.py

greptile-apps · 2026-02-12T01:10:59Z

dimos/manipulation/manipulation_module.py

+        try:
+            generate = self.get_rpc_calls("GraspingModule.generate_grasps")
+            result_str = generate(object_name, object_id, True)
+            # If generate_grasps returned actual PoseArray via the grasps port,
+            # we need to get the poses. For now, check if it returned an error string.
+            if isinstance(result_str, str) and "No" in result_str:
+                logger.info(f"GraspGen returned: {result_str}, falling back to heuristic")
+            else:
+                # GraspGen succeeded — get poses from the grasps port or RPC
+                logger.info(f"GraspGen result: {result_str}")
+                # Try to get the grasp poses via RPC
+                try:
+                    get_grasps = self.get_rpc_calls("GraspingModule.get_latest_grasps")
+                    grasp_poses: PoseArray | None = get_grasps()
+                    if grasp_poses and len(grasp_poses.poses) > 0:
+                        return list(grasp_poses.poses)
+                except Exception:


Invalid Grasping RPC usage

_generate_grasps_for_pick() calls self.get_rpc_calls("GraspingModule.get_latest_grasps"), but rpc_calls only declares GraspingModule.generate_grasps (manipulation_module.py:120-123) and GraspingModule doesn’t implement any get_latest_grasps RPC (it publishes to an Out[PoseArray] port instead; see dimos/manipulation/grasping/grasping.py:37-106). This will raise ValueError at runtime on the “GraspGen succeeded” path.

greptile-apps · 2026-02-12T01:11:03Z

Additional Comments (1)

dimos/manipulation/manipulation_module.py
Polling wrong task method

This switched to client.task_invoke(..., "get_state", {}), but the trajectory task implementation exposes get_status() (returning a TrajectoryStatus) and does not implement get_state (see dimos/manipulation/control/trajectory_controller/joint_trajectory_controller.py:162-273). As written, task_invoke will return None and this method returns None, which also breaks _wait_for_trajectory_completion() (it treats None as “task not found” and returns success early at manipulation_module.py:1001-1003).

…tegrated

mustafab0 added 7 commits February 11, 2026 15:32

TF support on manipulation module and Object Input topic support

40da623

blueprint added for xarm7 and realsense robot

b72b11f

updated rrt planner to have error margins since real world encoder va…

3f008f5

…lues have tiny errors that can get out of joint limit range

coordinator client updated to use task invoke

21a6a29

implemented pick and place skill with agentic pipeline

d250b21

added xarm-perception-agent blueprint

cc29e83

deprecated manipulation client

550f4ac

mustafab0 requested review from JalajShuklaSS, alexlin2, leshy, paul-nechifor and spomichter February 11, 2026 23:53

greptile-apps bot reviewed Feb 12, 2026

View reviewed changes

mustafab0 changed the title ~~Feature: Basic Agentic Pick and Place - reimplimented manipulation skills~~ Feature: Basic Agentic Pick and Place - reimplemented manipulation skills Feb 12, 2026

mustafab0 added 2 commits February 11, 2026 17:34

typo — get_status should be get_state. Fixed

697bc6e

removed PoseArray which was unused as graspgen module is yet to be in…

2a69411

…tegrated

This was linked to issues Feb 12, 2026

Integrate agent with Manipulation module #1184

Open

Implement Manipulation Skills #1131

Open

Reimplemnt manipulation skills #1011

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Basic Agentic Pick and Place - reimplemented manipulation skills #1237

Feature: Basic Agentic Pick and Place - reimplemented manipulation skills #1237

Uh oh!

mustafab0 commented Feb 11, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Feb 12, 2026

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

greptile-apps bot Feb 12, 2026

Uh oh!

greptile-apps bot commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Feature: Basic Agentic Pick and Place - reimplemented manipulation skills #1237

Are you sure you want to change the base?

Feature: Basic Agentic Pick and Place - reimplemented manipulation skills #1237

Uh oh!

Conversation

mustafab0 commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Breaking Changes

How to Test

Uh oh!

greptile-apps bot commented Feb 12, 2026

Greptile Overview

Greptile Summary

Confidence Score: 2/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

greptile-apps bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mustafab0 commented Feb 11, 2026 •

edited

Loading