CIMO LabsCIMO Labs

Getting Started

Find Your Path Through CIMO

The CIMO Framework has different entry points depending on your role and goals. Choose your pathway below to get started.

Production

Battle-tested, empirically validated

I'm an ML Engineer

I need to evaluate LLM systems, ship reliable models, and get valid confidence intervals.

Step 1: Understand the Problem

Start by understanding why naive evaluation fails under optimization pressure.

Step 2: Install and Run CJE

Get CJE running on your data in 5 minutes. No PhD required.

# Install CJE
pip install cje-eval

Step 3: Understand Your Results

Learn what the diagnostics mean and when to trust your estimates.

Step 4: Go Deeper (Optional)

Understand the theoretical foundations when you need them.

I'm a Researcher

I want to understand the theory, validate the claims, and potentially extend the framework.

Start: Theoretical Foundations

Begin with the core theory and assumptions that ground the framework.

Deep Dive: Pillar A (CJE)

The statistical machinery: calibration, off-policy evaluation, and uncertainty quantification.

Deep Dive: Pillar C (Y*-Alignment)

The geometry of optimization: manifolds, mediation, and the Goodhart Limit.

Empirical Validation & Governance

Examine the Arena benchmark and understand validation scope (model oracles vs human labels).

Validation Scope

Arena validates the statistical machinery (S→Y) using GPT-5 as oracle. Validating the Bridge (A0: Y→Y*) requires domain-specific human/business data.

Replicate & Extend

Run the benchmark yourself and contribute improvements.

I'm a Decision Maker

I need to understand the ROI, risks, and resource requirements before committing.

Start: The Business Case

Understand the cost-benefit analysis and risk mitigation value.

The Problem: Why Current Approaches Fail

Understand the concrete failure modes that CJE prevents.

Implementation Path

See what adoption looks like and resource requirements.

I'm Confused About Evaluation

I don't know what I don't know. Help me understand the landscape.

Start Here: The Core Problem

Begin with the fundamental issue: why metrics become unreliable under optimization.

Then: The CIMO Solution

Understand the three-pillar framework that addresses these failures.

Get Clarity: The Glossary

Learn the core concepts and notation. Return here whenever confused.

Choose Your Next Step

Based on your role, jump to the appropriate pathway above.