Aligning Exploration and Purpose: The Motivational Engine of PILLAR-Robots

The Italian RTO ISTC-CNR is a partner of the EU project PILLAR-Robots

Balancing Purpose and Curiosity in Autonomous Robots

Robots that operate in the real world live at the crossroads of two powerful drives. On the one hand, they are expected to faithfully accomplish the goals assigned by their users. On the other hand, they must remain curious explorers (constantly learning, refining their skills, and adapting to ever-changing environments). The work developed in PILLAR-Robots tackles this delicate balance through the creation of an integrated Motivational Engine: an innovative system that allows robots to pursue human-defined purposes while simultaneously engaging in autonomous, lifelong learning.

At the core of this development is a simple question: how can a robot decide what to do next when it has multiple “desires”? For example, it may want to improve its competence, discover new goals in the environment, or directly fulfil the task assigned by the user. The Motivational Engine introduces a structured way to manage these competing drives, bringing together three core capabilities that make the robot not just reactive, but truly proactive and purpose-driven.

Three Core Capabilities of the Motivational Engine

First, goal discovery. Rather than waiting for instructions, the robot actively interacts with its environment, detecting meaningful changes in perception and venturing into novel situations. From these experiences, it autonomously identifies potentially relevant states and stores them as candidate goals (building its own repertoire of future learning opportunities).

Second, goal selection. When multiple goals are available, the robot must decide where to focus. At the heart of this process lies a motivation selector. If the robot cannot complete the assigned task, it may temporarily shift its focus to exploration or skill acquisition. When it recognises that the task is within reach, it redirects its efforts toward completing the task. This dynamic arbitration allows the system to adapt to both its internal state and the environment.

Third, purpose bias. The PILLAR-Robots are built with a dedicated alignment mechanism that enables the robot to align its autonomous learning with the user’s mission. At its core is a “prototype” of the purpose, an abstract representation of features associated with successful task completion. This prototype can be provided externally or learned progressively through interaction. Once established, it subtly guides exploration and learning, steering the robot toward goals that are most likely to contribute to fulfilling the user’s mission.

Integrating the Motivational Engine into the PILLAR-Robots Architecture

Together, these three capabilities transform the robot into an agent that explores with curiosity, learns with strategy, and acts with purpose. These components have been integrated into an extended version of the H-GRAIL cognitive architecture and tested in both simulated environments and real robotic setup. The experiments considered scenarios with independent and interdependent goals, as well as changing domains where the task requirements shift over time. In these conditions, the integrated engine enabled the robot to more effectively focus its learning on goals relevant to the user’s purpose, while still maintaining autonomous exploration capabilities. 

The result is a working module that supports open-ended learning without losing alignment with user requirements. It provides a concrete operational pipeline for integrating exploration, competence development, and task fulfilment within a unified system. The Motivational Engine is now prepared for deployment within the broader cognitive architecture of PILLAR-Robots, supporting the development of robots that can learn continuously while remaining oriented towards meaningful objectives.