Factorized Diffusion Policy

Abstract

Diffusion models have been extensively leveraged for learning robot skills from demonstrations. These policies are conditioned on several observational modalities such as proprioception, vision and tactile. However, observational modalities have varying levels of influence for different tasks that diffusion polices fail to capture. In this work, we propose ‘Factorized Diffusion Policies’ abbreviated as FDP, a novel policy formulation that enables observational modalities to have differing influence on the action diffusion process by design. This results in learning policies where certain observations modalities can be prioritized over the others such as vision>tactile or prop>vision. FDP achieves modality prioritization by factorizing the observational conditioning for diffusion process, resulting in more performant and robust policies. Our factored approach shows strong performance improvements in low-data regimes with 15% absolute improvement in success rate on several simulated benchmarks when compared to a standard diffusion policy that jointly conditions on all input modalities. Moreover, our benchmark and real-world experiments show that factored policies are naturally more robust with 40% higher absolute success rate across several visuomotor tasks under distribution shifts such as visual distractors or camera occlusions, where existing diffusion policies fail catastrophically. FDP thus offers a safer and more robust alternative to standard diffusion policies for real-world deployment.

Video

Method Overview

In this work, we propose a novel theoretical framework Factorized Diffusion Policies (FDP) for learning action diffusion models that decouples observational modalities for prioritization. At its core, FDP learns a residual model using some input modalities that have been omitted while training a base model with prioritized inputs. The base and residual model outputs are then composed to obtain samples from the full conditional action distribution. In addition, we present an architecture that enables efficient learning of the residual model in the FDP framework. We demonstrate that prioritization of modalities may yield significant gains in sample efficiency and naturally improves policy robustness to distribution shifts in the residual observations.

Real Robot Setup

Tasks

We evaluate Factorized Diffusion Policy (FDP) and the Diffusion Policy (DP) baseline across four real-world domains and report their task success rates. The domains are: Close Drawer as a simple task where the robot has to push the drawer; Put Block in Bowl that assesses the policy’s ability to perform precise pick-and-place actions; Pour in Bowl to evaluate the policy’s dexterity in operating near joint limits and Fold Towel to assess effectiveness in manipulating deformable objects.

Data Collection

We collect 50 demonstrations per domain on a Franka FR3 robot using a 6D space mouse, recording both proprioceptive and visual observations from two cameras—one mounted on the gripper and a static camera covering the workspace.

Real Robot Results

The trained policies are evaluated on four task variations in each domain: default: an in-distribution setup matching the conditions used during demonstration collection; color: the object’s color is altered to test robustness to visual appearance changes; distractor: novel, unseen objects such as vegetation props and soft toys are added to the scene to introduce clutter; and occlusion: visual input is intermittently blocked during policy rollout to simulate partial observability.

Close Drawer

DP Success

FDP (Ours) Success

DP Success

FDP (Ours) Success

DP Fail

FDP (Ours) Success

DP Fail

FDP (Ours) Success

Put Block in Bowl

DP Success

FDP (Ours) Success

DP Fail

FDP (Ours) Success

DP Fail

FDP (Ours) Success

DP Fail

FDP (Ours) Success

Pour in Bowl

DP Success

FDP (Ours) Success

DP Fail

FDP (Ours) Success

DP Fail

FDP (Ours) Success

DP Fail

FDP (Ours) Success

Fold Towel

DP Success

FDP (Ours) Success

DP Fail

FDP (Ours) Success

DP Fail

FDP (Ours) Success

DP Fail

FDP (Ours) Success

FDP Failure Cases

This section presents cases where Factorized Diffusion Policy (FDP) fails during execution. These examples highlight limitations under specific visual or task conditions.

Over fitted base model

Visual out of distribution

Over fitted base model

Visual out of distribution

Factorizing Diffusion Policies for Observation Modality Prioritization

Abstract

Video

Method Overview

Real Robot Setup

Real Robot Results

FDP Failure Cases