Evaluating Actuators in a Purely Information-Theory Based Reward Model

04/10/2018
by   Wojciech Skaba, et al.
0

AGINAO builds its cognitive engine by applying self-programming techniques to create a hierarchy of interconnected codelets - the tiny pieces of code executed on a virtual machine. These basic processing units are evaluated for their applicability and fitness with a notion of reward calculated from self-information gain of binary partitioning of the codelet's input state-space. This approach, however, is useless for the evaluation of actuators. Instead, a model is proposed in which actuators are evaluated by measuring the impact that an activation of an effector, and consequently the feedback from the robot sensors, has on average reward received by the processing units.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/10/2018

Binary Space Partitioning as Intrinsic Reward

An autonomous agent embodied in a humanoid robot, in order to learn from...
research
04/10/2018

The AGINAO Self-Programming Engine

The AGINAO is a project to create a human-level artificial general intel...
research
05/06/2023

A Novel Reward Shaping Function for Single-Player Mahjong

Mahjong is a complex game with an intractably large state space with ext...
research
06/14/2020

Tackling Morpion Solitaire with AlphaZero-likeRanked Reward Reinforcement Learning

Morpion Solitaire is a popular single player game, performed with paper ...
research
06/23/2020

Feature Expansive Reward Learning: Rethinking Human Input

In collaborative human-robot scenarios, when a person is not satisfied w...
research
11/30/2022

Time-Efficient Reward Learning via Visually Assisted Cluster Ranking

One of the most successful paradigms for reward learning uses human feed...
research
07/25/2023

Unbiased Weight Maximization

A biologically plausible method for training an Artificial Neural Networ...

Please sign up or login with your details

Forgot password? Click here to reset