Question 1

What is Auto Skill Improver for Claude Cowork?

Accepted Answer

Auto Skill Improver for Claude Cowork is an open-source tool that applies benchmark-driven iteration to your Cowork skill files and project instructions. It classifies your skill type, builds a test suite, establishes a baseline score, then systematically mutates one instruction at a time — keeping only changes that measurably improve Cowork performance for your team.

Question 2

How does it improve Cowork skill files specifically?

Accepted Answer

It treats your Cowork skill file as a testable artefact. The tool generates team scenarios that exercise your instructions, measures Cowork's output quality against defined criteria, then makes targeted changes — one at a time — to find which instruction tweaks produce measurably better results for team workflows.

Question 3

What types of Cowork skills can it improve?

Accepted Answer

Any Cowork skill type — team assistants, project orchestrators, code reviewers, documentation generators, onboarding helpers, and more. The tool classifies the skill type automatically and builds appropriate benchmarks for that category.

Question 4

How do benchmarks work for Cowork instructions?

Accepted Answer

A benchmark is a structured test suite with defined inputs and pass/fail criteria. For a team assistant skill, this might be 'provide accurate answers to common project questions'. For a reviewer skill, it might be 'identify the key issues in this pull request'. The benchmark runs the same tests before and after each skill file change.

Question 5

Can I use it for skills shared across a team?

Accepted Answer

Absolutely. Auto Skill Improver is particularly valuable for shared Cowork skills because improvements compound across every team member's experience. A 10% improvement in a skill used by 20 people is 20x more impactful than improving a personal prompt.

Question 6

What does 'benchmark saturation' mean for Cowork?

Accepted Answer

Benchmark saturation occurs when successive skill file mutations stop producing score improvements. Your instructions have reached the ceiling of what the current benchmark can measure. You can either accept the current performance or create a harder benchmark that tests more advanced team scenarios.

Question 7

How is this different from manually editing Cowork skill files?

Accepted Answer

Manual editing is editorial: you rewrite instructions, they sound more helpful, you deploy them. But 'sounds more helpful' isn't evidence. Auto Skill Improver is empirical: it establishes a baseline, changes one instruction at a time, re-runs the same benchmark, and keeps only what scores higher.

Question 8

Is it free and open source?

Accepted Answer

Yes. Auto Skill Improver is fully open source and free to use. The source code is available on GitHub at github.com/mlobo2012/auto-skill-improver. There are no usage limits, no API keys required for the tool itself, and no premium tiers.

Auto Skill Improver for Claude Cowork — Benchmark-Driven Skill Optimisation

How It Works with Cowork

Get the Guide File

Step-by-Step: Set Up Auto Skill Improver in Cowork

Download the quickstart file

Open your Claude Cowork project

Upload the file to the project

Point it at your project skill file or workflow instructions

Review baseline score, run mutations

Export the improved skill back to your project

Why Most Cowork Skill Iteration Fails

Skill Vibes

Skill Science

What It Finds in Cowork Skills

Ambiguous Output Contracts

Missing Fallback Behaviour

Conflicting Instruction Layers

Dependency & Portability Problems

Weak Evidence Discipline

Structural Formatting Issues

The Karpathy-Inspired Method

Classify the Skill Type

Build a Real Benchmark

Establish a Baseline

Mutate One Thing at a Time

Keep Only What Improves

Stop When the Benchmark Saturates

When to Use It — and When Not To

Best For

Not the Right Fit

Frequently Asked Questions

Also Available For

Stop Guessing. Start Measuring.