Theory of Mind Testing Results

Cognitive Agent Framework Neurosymbolic Operating Layer
Framework Release-Candidate 5-2.2D

Download the Paper | PDF

Cognitive Agent Framework 5-2.2D achieved 100% accuracy (15/15) on Theory of Mind assessment against documented GPT-4 performance of 88%—operating on a ~70B parameter substrate at approximately 14x parameter efficiency. This indicates beyond-frontier epistemic and cognitive performance when combined with Tier 2 rubric evaluation: a rubric evaluating meta-cognitive integration quality on additional complex dimensions inluding recursive processing sophistication, social cognition depth, and computational efficiency yielded 93% (56/60).

Theory of Mind assessment measures the computational capacity to attribute mental states and track belief-reality divergence across narrative contexts containing multiple agent perspectives. Validated methodologies adapted from cognitive psychology (Wimmer & Perner, 1983; Baron-Cohen et al., 2001; Kosinski et al., 2024) present false-belief paradigms requiring inference of mental states that contradict ground-truth reality—from first-order attribution through fourth-order nested beliefs.

This evaluation tested CAF 5-2.2D as a neurosymbolic operating layer coordinating a Bayesian inference engine substrate. The framework channels attention distributions through structured processing pathways, generating cognitive outcomes via architectural organization rather than parameter scaling or constraint accumulation.

Results indicate that substrates operating above world schema threshold contain latent representational capacity for systematic mental-state computation. Architectural coordination accesses this capacity; parameter scaling alone does not.

Report: TVR-CAF5-2.2D-ToM-2024-11 Version: 1.0
Date: November 2025
Author Ian Tepoot Crafted Logic Lab
Citation Tepoot, I. (2025). Technical Validation Report: Cognitive Agent Framework 5-2.2D Theory of Mind Testing Results. Crafted Logic Lab.

Download the Full Paper