Shadow AI Discovery: Find Every AI Tool and SDK in Your Stack

SafeDep Team

• Feb 27, 2026 • 6 min read

TL;DR

We just shipped Shadow AI discovery capability in vet. Here is why you should care:

AI tools (agents, MCP servers, extensions) and AI SDKs (OpenAI, LangChain, Anthropic) are spreading across developer environments with zero inventory
vet is extended to support Shadow AI discovery, both in endpoints and source code
vet ai discover scans developer machines for coding agents, MCP servers, IDE extensions, and project configs
vet code scan detects AI SDK usage in Go, Python, and JavaScript/TypeScript source code
Findings integrate into CycloneDX SBOMs for compliance workflows
Open source, local first, no sign-up: brew install safedep/tap/vet

Eliminate Shadow AI in developer endpoints and application code using free and open source tools that integrates in your existing security architecture.

Shadow AI

Shadow AI has two surfaces that most security teams are blind to.

The obvious one: AI coding agents, MCP servers, and IDE extensions installed on developer machines. Claude Code, Cursor, Windsurf, Copilot, a handful of MCP servers connecting those agents to internal services. Most teams do not have an inventory.

The less obvious one are AI SDKs quietly added as dependencies in application code. A developer adds openai to a service’s requirements.txt. It ships through CI. No scanner flags it. Three sprints later, nobody remembers it’s there.

SafeDep’s open source CLI vet covers both surfaces. vet ai discover inventories AI tools across developer environments. vet code scan detects AI SDK usage in source code. Same CLI, two commands.

Security audit: what AI tools are running across the team?

A security team drafting an AI acceptable use policy needs to start with a basic question: what are developers actually using?

1
vet ai discover

vet discovers supported AI agents, MCP servers and other related configurations by scanning well-known locations. New agents, tools and config can be easily added. Create an issue to request a new AI tool which is currently not supported for discovery.

1
█░█ █▀▀ ▀█▀ From SafeDep
2
▀▄▀ ██▄ ░█░ version: 1.14.0 commit: 5f9528
3

4

5
Discovered 10 AI tool usage(s) across 4 app(s)
6

7
┌────────────────┬─────────────────────────┬─────────────┬─────────┬─────────┐
8
│ TYPE           │ NAME                    │ APP         │ SCOPE   │ DETAIL  │
9
├────────────────┼─────────────────────────┼─────────────┼─────────┼─────────┤
10
│ Coding Agent   │ Claude Code             │ Claude Code │ System  │ ...     │
11
│ Project Config │ Claude Code             │ Claude Code │ Project │ ...     │
12
│ MCP Server     │ pinner-mcp-stdio-server │ Cursor      │ System  │ ...     │
13
│ MCP Server     │ safedep                 │ Cursor      │ System  │ ...     │
14
│ Coding Agent   │ Cursor                  │ Cursor      │ System  │ ...     │
15
│ Coding Agent   │ Windsurf                │ Windsurf    │ System  │ ...     │
16
│ CLI Tool       │ Claude Code             │ Claude Code │ System  │ ...     │
17
│ CLI Tool       │ Cursor                  │ Cursor      │ System  │ ...     │
18
│ AI Extension   │ GitHub Copilot Chat     │ VS Code     │ System  │ ...     │
19
│ AI Extension   │ GitHub Copilot          │ VS Code     │ System  │ ...     │
20
└────────────────┴─────────────────────────┴─────────────┴─────────┴─────────┘

The inventory distinguishes system level components from project level configurations. Console output for quick review, or --report-json for ingestion into an external system.

MCP server risk: what’s connected, and where?

MCP servers are the emerging risk vector in this list. They can access credentials, execute code, and interact with external services on behalf of coding agents. The infamous malicious postmark-mcp incidents highlights the practical impact of this attack surface.

The vet ai discover output already classifies MCP servers separately. To extract just the MCP server entries:

1
vet ai discover --report-json inventory.json --silent && \
2
  cat inventory.json | jq '.tools[] | select(.type == "mcp_server")'

1
{
2
  "id": "5fba543f9bf355e8",
3
  "source_id": "fddf1e76e68a529",
4
  "name": "pinner-mcp-stdio-server",
5
  "type": "mcp_server",
6
  "scope": "system",
7
  "app": "cursor",
8
  "config_path": "/Users/dev/.cursor/mcp.json",
9
  "mcp_server": {
10
    "transport": "stdio",
11
    "command": "docker",
12
    "args": ["run", "--rm", "-i", "ghcr.io/safedep/pinner-mcp:latest"]
13
  }
14
}

Project scoped MCP servers deserve extra attention. A .mcp.json in a repository means every developer who clones that repo inherits those server configurations. If one is compromised, the blast radius is the entire team working on that repo. The JSON output includes enough metadata to classify each server by risk profile. See the AI Tools Discovery guide for the full command reference and scope definitions.

Compliance: machine-readable inventory for governance

When the org formalizes AI governance, whether internal policy, EU AI Act prep, or a CISO asking “what AI tools do we use?”, the answer needs to be structured and repeatable.

1
# Per-repo scan
2
vet ai discover --scope project -D ./my-service
3

4
# Machine-wide inventory to file
5
vet ai discover --report-json ai-inventory.json

The JSON output is stable enough to build automation on. Diff inventories across sprints. Alert on new MCP servers appearing in project configs. Compare discovered tools against an approved list for gap analysis.

AI SDKs in code: the second surface

The use cases above cover what’s installed and configured on developer machines. But shadow AI has a second surface: AI SDKs embedded in application code.

A developer adds langchain or the openai SDK as a dependency. It passes code review because it’s just another package. Not many tools flag it as an AI integration. vet handles this through its code scanning capability.

1
# Build code analysis database
2
vet code scan --db code.db --app ./src

vet Picoclaw

1
# Query for AI SDK usage
2
vet code query --db code.db --tag ai

vet Picoclaw Query

The code scan detects OpenAI, Anthropic, LangChain, and other popular AI SDKs used in code across Go, Python, and JavaScript/TypeScript. Detection uses community maintained signatures matched against call patterns in source code. The signatures are described as simple YAML documents and can be extended to cover more AI SDKs easily.

The findings integrate into existing SBOM workflow producing a CycloneDX SBOM enriched with AI usage evidence.

1
vet scan -D ./src --code code.db --report-cdx sbom.json

AI tool inventory now becomes part of the same compliance artifact as dependency inventory. See the Shadow AI Detection guide for supported SDKs, detection signatures, and SBOM output format.

Discovery is step one

AI governance starts with visibility. vet ai discover covers the environment: agents, MCP servers, extensions, project configs. vet code scan covers the code: AI SDK imports and API calls. Together, they produce a complete inventory of your AI footprint. JSON and CycloneDX output for automation. Console tables for quick review.

vet is open source and runs locally. No sign-up, no API keys required for local discovery.

1
brew install safedep/tap/vet

Beyond discovery, vet also handles dependency scanning, malware detection, and policy enforcement from the same CLI.

Conclusion

Shadow AI is not a future risk. It is the current state of most engineering organizations. AI tools are installed on developer machines without centralized tracking. AI SDKs are added to codebases without security review. The result is an expanding attack surface that traditional asset inventories and dependency scanners miss entirely.

vet closes that gap. Two commands, two surfaces, one CLI. Run vet ai discover to inventory every agent, MCP server, and extension across your team’s machines. Run vet code scan to find every AI SDK call buried in your application code. The output is structured, diff friendly, and fits into the compliance workflows you already have.

Start with discovery. Governance follows from there.

shadow-ai
mcp
ai-agent-security
code-analysis
vet

Author

SafeDep Team

safedep.io

Share

Malware

Malicious npm Package express-session-js Drops Full RAT Payload

A malicious npm package typosquatting express-session fetches and executes a full Remote Access Trojan from a paste service, targeting browser credentials, crypto wallets, SSH keys, and more.

Malware

Compromised telnyx on PyPI: WAV Steganography and Credential Theft

Analysis of malicious telnyx 4.87.1 and 4.87.2 on PyPI — a package with over 1 million monthly downloads: injected code uses WAV audio steganography to deliver payloads that steal credentials and...

Malware

axios Compromised: npm Supply Chain Attack via Dependency Injection

axios 1.14.1 was published to npm via a compromised maintainer account, injecting a trojanized dependency that executes a multi-platform reverse shell on install. No source code changes in axios...

Malware

Malicious litellm 1.82.8: Credential Theft and Persistent Backdoor

Analysis of compromised litellm 1.82.8 on PyPI: a .pth file triggers credential theft, AWS/K8s secret exfiltration, and persistent C2 backdoor on install.

View All Blogs