Research
Building Human-Centered AI
Systematic evaluation of CAF 5-2.2D against validated Theory of Mind methodologies (Kosinski et al., 2024; Wimmer & Perner, 1983). Results: 100% Tier 1 accuracy on ~70B substrate versus documented 88% GPT-4 ceiling; 93% on novel Tier 2 cognitive integration rubric. Full test battery documentation included.
The AI industry is headed for a mass-extinction event of its own making. But in the shadow of the Big AI behemoths small, nimble companies like Crafted Logic Lab can thrive.
Epistemic Integrity Reasoning Testing: 60 adversarial questions measuring if AI recognizes uncertainty, resists false certainty, and passes the Dunning-Kruger threshold before real-world deployment.
The AI industry keeps pushing ‘autonomy’, but what we actually need is evidence based reasoning and adversarial rigor that fact-checks under pressure. Real world gave an opportunity to test Clarisa in the wild.