Neuro-Symbolic Learning for Long-Horizon Task Planning Under Complex Logical Constraints

Updated: Jun 7, 2026 by

Qiwei Du

Introduction

Long-horizon task planning becomes expensive when robots must reason over many objects and complex logical constraints, such as object affordances, spatial relationships, and sequential dependencies. Previous neuro-symbolic planners reduce this search space by predicting which objects matter, but they train the scorer using full-space plans while deploying it inside its own pruned search spaces. This mismatch means a scoring error can remove critical objects or keep irrelevant ones, making the simplified planning problem unsolvable or unnecessarily large.

To address this, by training from planner feedback online, our method improves both planning efficiency and robustness for benchmark tasks. For brevity, we refer to our method as iFlax to highlight our contribution in stabilizing the imperative learning process for task planning and addressing Flax’s exposure bias under complex logical constraints. We also validate the deployability of iFlax on a quadruped-based mobile manipulator (Spot) in both simulation and the real world.

Method Overview

iFlax formulates object-importance learning as a bilevel optimization problem: Given a PDDL task and its relational graph, a neural network predicts object-importance scores. The lower-level planner operates in the score-pruned search space and returns a feasible plan, providing adaptive pseudo-supervision for updating the neural scorer.

To stabilize this loop, iFlax uses a parallel 3R strategy: Repair recovers missing critical objects, Restart rebuilds a cleaner active set, and Rollback re-expands cautiously after an overly large expansion step.

Experiments

iFlax is evaluated on three challenging benchmarks: MazeNamo, SokoMindPlus, and LogisticsPlus, testing dense obstacle rearrangement, irreversible push dependencies, and transport dependencies with resource constraints. On MazeNamo, iFlax reduces the average failure rate by 80.04% and weighted planning time by 57.14% compared with prior SOTA method Flax.

Simulation

In Isaac Sim, iFlax is deployed on a quadruped-based mobile manipulator in MazeNamo-style tasks with additional robot-specific logical constraints. Movable obstacles include heavy boxes, tall containers, and short containers. Containers can only be placed on the ground, the robot must stand to pick tall containers, and it must sit to pick short containers.

Real-World Experiments

In the real world, iFlax runs on a Spot quadruped equipped with a Jetson AGX Orin computing board, an AgileX Piper manipulator, and a wrist-mounted RealSense D435i. The system builds a symbolic planning problem from sensed geometry, solves it with iFlax, and executes the returned high-level plan with grounded skills for navigation, container pickup and placement, bottle pickup and placement, and box pushing.

The first three videos are navigation tasks with the high-level goal reach location. The robot must identify which containers, bottles, and heavy boxes determine access to the goal, then alternate obstacle rearrangement and navigation.

Warehouse Mobile Manipulation

The remaining six videos are mobile manipulation tasks. The goals include move bottle to location, move bottle to location and bottle on the ground, move container to location, and put bottle upon box. These tasks are harder than pure reach-location goals because the final plan may need to satisfy both where an object should be and how it should be placed.

Publication

Neuro-Symbolic Learning for Long-Horizon Task Planning Under Complex Logical Constraints.
Qiwei Du, Zitong Zhan, Shaoshu Su, Bowen Li, Yi Du, Zhipeng Zhao, Taimeng Fu, Sebastian Scherer, Jiaoyang Li, Chen Wang.
arXiv preprint arXiv:2606.06877, 2026.

Reducing failures by 80% and eliminating exposure bias of neuro-symbolic task planning
```
@article{du2026iflax,
  title = {Neuro-Symbolic Learning for Long-Horizon Task Planning Under Complex Logical Constraints},
  author = {Du, Qiwei and Zhan, Zitong and Su, Shaoshu and Li, Bowen and Du, Yi and Zhao, Zhipeng and Fu, Taimeng and Scherer, Sebastian and Li, Jiaoyang and Wang, Chen},
  journal = {arXiv preprint arXiv:2606.06877},
  year = {2026},
  url = {https://arxiv.org/abs/2606.06877},
  website = {https://sairlab.org/iflax/},
  video = {https://youtu.be/aRkq4TXYEhM},
  cover = {/img/posts/2026-06-08-iflax/iflax_cover_video.mp4},
  addendum = {Reducing failures by 80\% and eliminating exposure bias of neuro-symbolic task planning}
}
```
```
Du, Qiwei and Zhan, Zitong and Su, Shaoshu and Li, Bowen and Du, Yi and Zhao, Zhipeng and Fu, Taimeng and Scherer, Sebastian and Li, Jiaoyang and Wang, Chen, "Neuro-Symbolic Learning for Long-Horizon Task Planning Under Complex Logical Constraints," arXiv preprint arXiv:2606.06877, 2026.
```

Latest News

Bundle Adjustment in the Eager-mode

A PyTorch-native framework for efficient 2nd-order optimization workflows accelerated by GPU.

VL-Nav: Neuro-Symbolic Reasoning-based Vision-Language Navigation

Neural reasoning with symbolic guidance in large-scale environments.

Learning When to Jump for Off-road Navigation

A traversability map for adaptive strategies beyond simple avoidance on challenging terrains.

Fast Task Planning with Neuro-Symbolic Relaxation

A fast yet reliable neuro-symbolic relaxation strategy to accelerate task planning.

CSE 473/573: Computer Vision and Image Processing

Syllabus for Spring 2026

The Summary of 2025

The Theme of SAIR Lab in 2025 is 👉 Transform 👈

PyPose Accumulated Over 160,000 Downloads in 2025 on PyPI

A PyTorch-based library for robot learning with physics-based optimization.

AnyNav: Visual Neuro-Symbolic Friction Learning for Off-road Navigation

A neuro-symbolic framework for friction learning and physics-informed off-road navigation.

Vision-Language Memory for Spatial Reasoning

A vision-language model with memory for long-horizon spatial reasoning.

iA*: Imperative Learning-based A* Search for Path Planning

A self-supervised path-planning method to imporve the search efficiency of A* algorithm.

CSE 473/573: Computer Vision and Image Processing

Syllabus for Fall 2025

iWalker: Imperative Visual Planning for Walking Humanoid Robot

A vision-to-control humanoid stepping controller enhanced by Imperative Learning

Imperative Learning

A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy

SAIR Lab Inspired K-12 Kids on the Robotics Day

An open-to-all interactive robotics day for all K-12 kids and their parents.

GroundSLAM: A Robust Visual SLAM System for Warehouse Robots Using Ground Textures

An extremly efficient and accurate SLAM solution for warehouse robots.

AirRoom: Objects Matter in Room Reidentification

A simple yet highly effective room reidentification system.

SuperPC: A Single Diffusion Model for Unified Point Cloud Processing

A diffusion model for point cloud completion, upsampling, denoising, and colorization.

Roboranking: Robotics Faculty Hub & University Ranking System

A one-stop resources for robotics faculty-student matching, fostering greater visibility.

AirSLAM: An Efficient and Illumination-Robust Point-Line Visual SLAM System

An efficient point-line vSLAM addressing both short-term and long-term illumination challengs.

The Summary of 2024

The Theme of SAIR Lab in 2024 is 👉 Hope 👈

iKap: Kinematics-aware Planning with Imperative Learning

A novel local planning system that integrates a robot's kinematics into its learning to create mo...

LogiCity: Advancing Neural-Symbolic AI with Abstract Urban Simulation

LogiCity is an innovative urban simulator to benchmark Neural-Symbolic AI.

Map it Anywhere: Empowering BEV Map Prediction using Large-scale Public Datasets

A data engine enables seamless curation and modeling map prediction from existing map platforms.

ICRA'25 Workshop on Foundation Models and Neuro-Symbolic AI for Robotics

A series of interactive talks on foundation models and neuro-symbolic AI for robotics.

CSE 473/573: Computer Vision and Image Processing

Syllabus for Fall 2024

PhysORD: A Neuro-Symbolic Approach for Physics-infused Motion Prediction in Off-road Driving

A neural-symbolic motion prediction model integrating the conservation law into neural networks

iMatching: Imperative Correspondence Learning

A self-supervised approach to learn feature matching

iMTSP: Solving Min-Max Multiple Traveling Salesman Problem with Imperative Learning

A Self-supervised Approach to Efficiently Solve Min-Max MTSP

Air Series Articles from Junior Researchers

Air Series is a collection of articles that are first authored by junior researchers.

SAIR STAR Award Announced

The highest honor in SAIR Lab

Introduction

Method Overview

Experiments

Simulation

Real-World Experiments

Warehouse Navigation

Warehouse Mobile Manipulation

Publication