Evaluating skills

How to evaluate and review skills to ensure they improve agent workflows

Skills provide procedural knowledge and specific workflows that agents load when relevant. This guide explains how to view skill reviews and evaluate quality before deploying to your team.

Why evaluate skills?

Skills encode team knowledge and workflows. Skill reviews help you:

  • Assess if skills conform to the skills standard

  • Validate skill content quality (how likely skill is to help) before deploying to your team

  • Validate skill description quality (how likely skill is to activate) before deploying to your team

Viewing skill reviews

In the Tessl Registryarrow-up-right, skill reviews show multiple scores.

Example: React development skill reviewarrow-up-right

Review Score: Overall quality assessment (0-100%)

  • Structure and clarity of instructions

  • Conformance to Agent Skills Specification

  • Integration with agent workflows

Validation Score: Specific criteria passed/failed

  • Required frontmatter fields present

  • Proper trigger hints for activation

  • Step-by-step workflow structure

  • License and metadata completeness

Implementation Score: Code quality in examples

  • Working, tested code samples

  • Security best practices

  • Error handling

Activation Score: How well agents discover the skill

  • Clear trigger conditions

  • Relevant keywords and descriptions

  • Appropriate scope definition

Each skill review includes detailed validation results showing what passed, what needs improvement, and specific recommendations.

What scores mean:

  • 90%+ Review Score: High-quality skill, ready for production use

  • 70-89% Review Score: Good skill, may have minor improvements needed

  • Below 70%: Needs work before deployment

Use these scores to choose quality skills and identify what to fix in your own skills before publishing.

Automatic evaluation on publish

When you publish a skill to the registry using tessl skill publish, evaluations run automatically:

What happens automatically:

  • Skill is linted for format and structure

  • Quality review is performed

  • Review scores are calculated and displayed in the registry

Reviewing skills locally

Before publishing skills, validate them locally:

What lint checks:

  • SKILL.md file exists and is properly formatted

  • Required frontmatter fields (name, description)

  • Valid Agent Skills Specification format

What review checks:

  • Skill quality and completeness

  • Best practices conformance

  • Trigger hints for activation

  • Workflow structure

  • Metadata completeness

Fix issues locally, then publish. The registry will show updated review scores.

Here's a sample output:

Next steps

Last updated