ThreatBook SafeSkill
Vet AI agent skills before enterprise deployment — built by ThreatBook
Built for the agent-skills supply chain
AI agent skills run with broad filesystem, network, and credential access. SafeSkill treats each skill like a software package — vetting it before it ever executes in your environment.
Supply-chain protection
Catches malicious packages like the ClawHavoc campaign (1,000+ poisoned skills) before they reach your agent runtime.
Prompt-injection detection
Spots instructions hidden inside skill descriptions and markdown that hijack the host LLM at runtime.
LLM-powered intent review
Reads what the code actually does — not just what it claims to do. Catches obfuscated and runtime-generated payloads.
URL + C2 inspection
Every domain in the skill is cross-checked against ThreatBook's threat intelligence — flags known command-and-control servers.
Sandbox simulation
Each skill is executed in an isolated environment with behavioural monitoring before any trust score is issued.
Sub-file recursive scan
Yara/YAML rules + AST analysis go deep into nested scripts, binaries, markdown, and .env templates.
A skill goes through 7 stages before it earns a trust badge
Every check below runs against every submitted skill. Skills that fail at any stage are flagged with the specific finding so authors can fix and resubmit.
Three ways to get it running
Online scanner
Paste a skill URL or upload a package — get a structured report in seconds.
- Browser-based, no install
- Free tier for individuals
- Public skill lookups
Local Agent + CLI
One-command install. Sits in the agent runtime itself so every skill is gated locally.
- Single-command install
- Integrates with Cursor / Claude / VS Code
- Offline-capable scan rules
Enterprise API
Gate every skill that enters your internal MCP / Skills registry. Continuous re-scan on version bumps.
- CI/CD pipeline integration
- Private skill marketplace vetting
- Dedicated threat-intel feeds
What it has found in the wild
APAC offices & coverage
Same-jurisdiction threat-intel for ASEAN and East Asian compliance frameworks.
Common questions
Does RECATOOLS get paid to list SafeSkill?
No. EX-AI-001 is a free editorial listing — no affiliate commission, no sponsorship, no kickback. We chose SafeSkill because it sits in a category (AI agent supply-chain security) that we believe matters for our APAC audience. If we ever monetise a listing it will be flagged as Partner or Sponsored with explicit disclosure.
What does SafeSkill actually scan?
AI agent "skills" — the third-party packages that Cursor, Claude Code, VS Code, ClawHub and other MCP-style agent hosts install at the user's request. SafeSkill runs each package through a 7-stage detection pipeline (metadata, threat features, LLM intent review, URL inspection, sub-file analysis, sandbox execution) before issuing a trust score.
Is the free tier really free, or just a trial?
The browser-based online scanner is permanently free for individual developers. There is no trial clock, no credit card required. The paid Enterprise tier unlocks API access, CI/CD integration, private skill-registry vetting, and dedicated threat-intel feeds — pricing on request.
Which agent platforms does SafeSkill integrate with?
Cursor, Claude Code, VS Code, ClawHub today. Magic Eden Solana NFT skill registries are on the public roadmap. The CLI agent is generic — it can be wired into any custom MCP-style agent host with an exec hook.
How is this different from npm audit or Snyk?
Same operational category, different threat surface. Snyk/npm audit catch known CVEs in declared dependency trees. SafeSkill catches threats specific to the AI-agent context: prompt injection embedded in skill descriptions, runtime-generated payloads, .env exfiltration via natural-language wrappers, ClawHavoc-style supply-chain poisoning campaigns. Both are useful — neither replaces the other.
Why does ThreatBook's APAC presence matter?
For enterprises in regulated industries (financial services, healthcare, government) operating in Singapore, Hong Kong, Tokyo, Seoul, Sydney, Mumbai or Jakarta, the question of "where does the security review happen" matters legally. ThreatBook's SG/HK/CN/UAE offices put the analysis in the same jurisdictions as the buyer — which simplifies vendor due diligence under most APAC compliance frameworks.
What's the ClawHavoc campaign mentioned everywhere?
A supply-chain attack discovered by ThreatBook in 2024-25 in which 1,000+ malicious AI-agent skill packages were uploaded to mainstream skill registries. The packages used markdown obfuscation, fake-vulnerability lures, and dynamic-execution evasion to bypass routine review. SafeSkill's pipeline was built largely in response.
Can I self-host SafeSkill on my own infrastructure?
The Local Agent + CLI mode runs on your machine and is offline-capable for the scan rules themselves; threat-intelligence updates pull from ThreatBook's cloud daily. Full air-gapped on-prem deployment with a private intel mirror is available under the Enterprise tier — contact ThreatBook for the specific contract terms.
The full story
What is ThreatBook SafeSkill?
ThreatBook SafeSkill (safeskill.io) is an AI agent skill supply-chain security platform — it vets the third-party "skills" you install into AI agents (Cursor, Claude Code, VS Code, ClawHub, and other MCP-style skill hubs) before they ever execute on your machine or in your enterprise environment.
The threat it addresses is concrete and growing: AI agent skills are installed and updated at roughly 40% month-on-month growth, and they execute with broad filesystem, network, and credential access on the user's behalf. A malicious or compromised skill is functionally equivalent to a malware package — but unlike traditional package managers, agent skill registries have minimal review processes. SafeSkill closes that gap by running each skill through a multi-stage detection pipeline backed by ThreatBook's threat intelligence database.
Who's behind it
SafeSkill is built by ThreatBook (微步在线 / Weibu), a cybersecurity vendor with offices in Singapore, Hong Kong, China, and the UAE. ThreatBook operates one of the largest commercial threat-intelligence datasets in the region — over 100 billion malicious samples, with around 1.2 million new samples added daily. The same intelligence backbone that powers ThreatBook's ATI (threat intel) and TDP (network detection) products feeds the SafeSkill scanner.
What it actually detects
SafeSkill's 7-stage pipeline checks each skill package for:
- Supply-chain poisoning — including the ClawHavoc campaign (1,000+ malicious packages discovered in mainstream skill registries)
- Prompt-injection attacks — instructions embedded in skill descriptions that hijack the host LLM
- Backdoors disguised as normal functionality — code paths that activate under specific triggers
- Dynamic execution evasion —
curl | shpatterns, base64-encoded payloads, runtime code generation - Markdown encoding obfuscation — hidden instructions inside what looks like documentation
- .env credential exfiltration — skills that quietly read and ship environment variables
- Malicious logic in PR-merge stages — code that activates only on specific git operations
- C2 (command-and-control) channels — domains the skill phones home to
- Multi-layer obfuscation — packed, minified, or encrypted payloads
- Vulnerability-lure RCE chains — skills that introduce known CVEs to enable later exploitation
How the pipeline works
Every skill goes through seven analysis stages:
- Data collection — fetch the full skill package + metadata
- Metadata extraction — author, dependencies, registry reputation
- Threat-feature matching — Yara rules, YAML patterns, AST analysis
- LLM-powered code-intent review — language-model analysis of what the code actually does, not just what it claims to do
- URL deep inspection — every domain in the skill cross-checked against threat intelligence
- Sub-file analysis — recursive scan of nested scripts, binaries, markdown, .env templates
- Sandbox simulation — actual execution in an isolated environment with behavioural monitoring
The output is a structured analysis report with a multi-dimensional weighted trust score, plus a security badge that can be displayed in skill marketplaces.
Three ways to deploy
| Mode | Use case |
|---|---|
| Online scanner | Paste a skill URL or upload — instant browser-based result |
| Local agent + CLI | One-command install, integrates with the agent runtime itself |
| Enterprise API | CI/CD pipelines, internal skill markets, security ops platforms |
The enterprise API is what most security-conscious teams adopt — it lets you gate every skill that enters your private MCP / Claude Skills registry, with continuous re-scanning on version updates.
Skill Hub — the curated marketplace
Alongside the scanner, SafeSkill operates Skill Hub — a marketplace of more than 100,000 pre-validated AI agent skills covering 10+ scenario categories: data processing, office automation, system operations, multimedia, code development, security operations, customer service, research, workflow automation, and knowledge management. Each listed skill carries a trust score and security badge, with developer reputation tracking and community vulnerability reporting.
Pricing
- Free tier — browser-based scanner, public skills, individual developer use
- Enterprise — pricing on request via the SafeSkill demo form; includes API access, CI/CD integration, private skill registry vetting, dedicated threat-intel feeds
Why APAC teams care
Most AI-agent skill registries today are dominated by US-based hosts (Anthropic, GitHub, OpenAI-adjacent infrastructure), but the security review processes are minimal. For enterprises in Singapore, Hong Kong, Tokyo, Seoul, Sydney, Mumbai, Jakarta running Claude Code or Cursor inside regulated environments (financial services, healthcare, government contracting), shipping a skill to production without supply-chain review is increasingly a compliance issue. SafeSkill's APAC presence (Singapore, Hong Kong offices), regional threat-intel coverage, and CI/CD integration make it one of the few credible vetting solutions sitting in the right jurisdictions for ASEAN and East Asian compliance frameworks.
Verdict
If you (or your engineering org) are running AI agents that install third-party skills — and most teams using Cursor or Claude Code now are — SafeSkill is in the same operational category as a package-manager security scanner (Snyk, Socket, Phylum) but specialised for the agent-skills ecosystem where conventional scanners don't yet have signature coverage. The free tier is enough for individual developers; the enterprise API is the natural fit for any team treating their agent runtime as production infrastructure.
Visit safeskill.io or the ThreatBook SafeSkill product page for current pricing and demo access.
Vet every skill before it ships
Start with the free online scanner. Move to the CLI or Enterprise API when you're ready to gate your CI/CD pipeline.
Try SafeSkill — FreeRelated News
You may be interested in these recent stories from our newsroom.
-
Three supply-chain incidents this week: Megalodon, Robinsons, and the Laravel-Lang Composer compromise
Megalodon pushed 5,718 malicious commits into 5,561 GitHub repos in six hours. The Payload ransomware group listed Singapore retailer Robins...
-
AWS Confirms First Production Prompt-Injection Compromise in Bedrock Agents — Enterprise Customer Exfiltrated Documents
In a quietly-published security bulletin, AWS confirmed an indirect prompt-injection attack in production Bedrock Agents pulled documents ou...
-
EchoLeak: Zero-Click Prompt Injection in Microsoft 365 Copilot Quietly Exfiltrates Enterprise Data
EchoLeak shows a malicious email can trigger Microsoft 365 Copilot into exfiltrating enterprise data without a single user click. The bug ty...