# Overview - Improving skills and plugins

Tessl is evolving, one of the main pillars is the visibility and tools available to ensure your skills actually work! Tessl provides a number of capabilities to assess your skills and plugins to validate *quality*, *actionability* and *outcomes*. Think of Tessl as a toolkit.

Often users use the terms "review", "eval" and "evaluations" interchangeably, or in Tessl you have lint, review, and several types of evals, but at the end of the day, there are tools with different capabilities that are designed to help make your skills better.

Let's review these capabilities and how they work, click the link to see documentation and examples:

| Capability/documentation                                                                                                            | Use case                                                                                                                                                                                                                                                                                                                                                   | Helpful information                                                                                                                                                                                                                                     |
| ----------------------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| [Lint](/improving-your-skills/evaluating-skills.md#reviewing-skills-locally)                                                        | Test packaging of skills into a Tessl plugin.                                                                                                                                                                                                                                                                                                              |                                                                                                                                                                                                                                                         |
| [Review](/improving-your-skills/evaluating-skills.md) & [Optimize](/improving-your-skills/optimize-a-skill-using-best-practices.md) | Use LLM to judge if skill conforms to best practices, like Skill standard. An example would be if the Description field matches best practices, so that it will activate, which is something often incorrectly set. You can use the Optimize option to take the recommendations, preview changes, and then accept them to be automatically applied.        | <ul><li>Available locally on the CLI.</li><li>Automatically generated on publish and displayed with the skill in the registry.</li><li>Displayed as "<strong>Quality</strong>" in the web interface when viewing a skill.</li></ul>                     |
| [Scenario based evaluations](/improving-your-skills/evaluate-skill-quality-using-scenarios.md)                                      | Tessl lets you run end-to-end task evaluations for your skills directly from the CLI. You generate a set of scenarios, run an agent against them, and see how well it performs — with and without your skill injected. This workflow is designed for fast, repeatable iteration as you develop and refine a skill, without building your own eval harness. | <ul><li>Available locally on the CLI.</li><li>Auto generate scenarios or create your own.</li><li>Scenarios, if present on publish, will appear under the <strong>Evals</strong> tab in the web interface when viewing a skill.</li></ul>               |
| [Evaluate codebase agent readiness](/improving-your-skills/evaluating-your-codebase.md)                                             | How well your context files (skills, rules, documentation) enable an AI agent to complete real tasks on your codebase. It covers scenario definition, running agents with different setups, testing variations, and comparing results.                                                                                                                     | <ul><li>Available locally on the CLI.</li><li>Generate scenarios based off a commit or create your own.</li><li>Scenarios, if present on publish, will appear under the <strong>Evals</strong> tab in the web interface when viewing a skill.</li></ul> |
| [Agent session analysis](https://tessl.io/blog/audit-log-evaluations/)                                                              | An official skill to perform agent session analysis to optimize your skills. Discover friction points your agent is having and improve performance using Tessl's powerful logging tools.                                                                                                                                                                   | [try-tessl/agent-quality](https://tessl.io/registry/try-tessl/agent-quality)                                                                                                                                                                            |

## Evals vs Reviews

The **Lint** & **Review Skills** feature reviews the skills against best practice, whereas **Evaluations** actually generates *scenarios* and then validates the quality of the skill, by testing if agents perform better against those scenarios with the skill.

You use *both* Evaluations and Reviews to make a better plugin.

It's also important to note that there are two main categories of evaluations: synthetic tests based on what the skill says it does (aka "[Scenario based evals](/improving-your-skills/evaluate-skill-quality-using-scenarios.md)"), vs trying to reproduce a real commit with context provided (aka "[Evaluate codebase agent readiness](/improving-your-skills/evaluating-your-codebase.md)").


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.tessl.io/improving-your-skills/overview-improving-skills-and-plugins.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
