← Back to articles

The 0-5 scale (canonical descriptors)

Use these as your starting descriptors. Edit the noun phrases to match your context (replace "the standard" with the specific SOP / regulation / framework your team works to).

LevelNameDescriptor
0No exposureHas never performed the skill. Honest "I don't know", useful and required.
1AwarenessUnderstands what the skill is and when it applies. Can ask the right questions; cannot yet do the work.
2SupervisedCan perform the skill with direct supervision and check-back. Needs a second pair of eyes.
3IndependentPerforms the skill solo to the agreed standard. The benchmark for "qualified for the role".
4ProficientHandles non-standard cases and exceptions without escalation. Teaches and coaches juniors.
5Expert / trains othersSets the standard. Develops procedures. Certifies other people as 3 or 4. There should be relatively few 5s.

Evidence rules (what unlocks each level)

Without an evidence rule, ratings drift upwards. Pin them with this minimum:

The point of evidence rules is not bureaucracy. It is to make sure a "4" awarded in one corner of the organisation is the same shape as a "4" in another.

Who rates whom?

The three-rater pattern is the most reliable:

  1. Self-rating first. Honest, recorded, dated.
  2. Line manager second, without seeing the self-rating.
  3. Subject-matter expert / peer third, on skills the line manager doesn't personally hold at the required level.

When the three ratings agree, the score sticks. When they differ by more than one level, a 15-minute conversation reconciles them. Most reconciliations land between the self-rating and the manager rating; the SME is the tiebreaker on technical skills.

Refresh cadence and stale ratings

Set a review-by date on every cell. The matrix should automatically flag any rating older than the cadence (in our Excel template and PulseAI, this is a configurable amber/red flag).

Recommended cadences by skill type:

Calibration sessions

A calibration session is a 30-minute meeting between managers who rate against the same scale. Pick three skills, look at the score distributions side by side, and ask: "do these distributions feel right relative to each other?" If they don't, the descriptor needs sharpening, not the scores.

Run calibration quarterly for the first year, then twice a year. Without it, scales drift and the matrix gradually loses trust.

Worked example, three skill descriptors in three domains

Skill: CNC machining

Skill: Customer escalation handling

Skill: IFRS lease accounting

Governance one-pager

Keep the policy short. One page is the maximum that anyone will actually read. Include only:

  1. The scale and descriptors.
  2. The evidence rules per level.
  3. The who-rates-whom flow.
  4. The refresh cadence.
  5. The calibration cadence and owner.
  6. Who owns the matrix (named person + deputy).

Everything else is a footnote. The point is for every rater to be able to read the page in 4 minutes and start scoring.

Use the template's built-in policy sheet

The Excel template ships with this policy as a pre-filled sheet. Edit it to your context and you're ready to rate.

Get the template, £199 →