Anthropic Claude Security Public Beta: AI Code Vulnerability Scanning

hero

Anthropic just opened Claude Security to public beta — an automated code vulnerability scanner powered by Claude Opus 4.7. If your team reviews pull requests, maintains a private codebase, or handles customer data in your repository, this tool is directly relevant to your workflow. The big shift is not the feature itself but what it signals: security review is moving from human eyes to model inference, and that changes your CI pipeline whether you opt in or not.


Quick answer

  • Claude Security is in public beta as of mid-2025, powered by Opus 4.7, and requires no separate API configuration to attach to a team workspace.
  • In beta testing it flagged over 10,000 defects across participating codebases — but the more important question is what data the model read while doing so.
  • Before you enable it: lock down repository access scope, audit the token used for connection, and decide whether scanner results sit above or below human review in your merge process.

Citation-ready summary

Verified on: 2026-06-01

Definition: Anthropic Claude Security is a static code analysis service that uses the Claude Opus 4.7 model to identify vulnerabilities in source code. It is designed to integrate at the team level without requiring manual API credential setup.

Main answer: The tool automates a step that previously required human reviewers or separate SAST (Static Application Security Testing) tooling. It reports vulnerability findings that teams can act on before merge — but the scanner's access scope, the token that authorizes it, and the destination of scan results must be explicitly reviewed before production use.

Use condition: These claims apply to the public beta environment as announced. Behavior may change at general availability. Teams with private repositories containing customer data or regulated information should verify data handling terms before enabling.


Key terms

SAST (Static Application Security Testing): A method of analyzing source code for security vulnerabilities without executing it. Claude Security falls into this category — it reads your code, not your running application.

Scope creep (in security tools): When a tool gains access to more resources than it needs for its stated function. Relevant here because "no API setup required" often means the authorization was granted at a broader workspace level, not per-repository.

Merge gate: A required check in a CI/CD pipeline that must pass before a pull request can be merged. Claude Security results can be configured as a gate (blocking merges) or as an advisory signal (informing reviewers without blocking).

Token revocation path: The documented process for revoking an API key or OAuth token that authorizes a third-party service. You need to know this before you hand a scanner access to your codebase.


1. Why this matters now

The workflow change is not subtle. Until now, security review in most teams meant: write code → open PR → wait for a human reviewer to catch issues → eventually merge. That human step carried implicit judgment about what counted as a real risk versus a false alarm.

Claude Security inserts a model before that human step — or, depending on how you configure it, instead of it. IBM and Glasswing joining the beta is a signal that enterprise teams are already operationalizing this pattern, not just experimenting with it.

The real pressure point is the phrase "no separate API setup required." That convenience is real, but it surfaces a question your team needs to answer: what level of authorization was implicitly granted when the service was connected? A scanner that reads your entire organization's repositories to find vulnerabilities in one service has a much larger blast radius than one scoped to a single repo.


2. The core idea

The central shift is this: Claude Security moves vulnerability detection from a human-reviewed step to a model-inferred step, and the correctness of the result is not the only thing that matters — the access path matters just as much.

Think of it like hiring a contractor to audit your building. A thorough auditor will find problems you missed. But before they walk through every room, you want to know exactly which rooms they can enter, what notes they take, where those notes are stored, and who can read them afterward. The audit's value depends on the trust envelope around the auditor, not just their skill.

Dimension Human reviewer Claude Security
Speed Hours to days Minutes
Consistency Variable Consistent per model version
Data exposure Limited to reviewer's access Depends on token scope
False positive rate Judgment-based Model confidence-based
Accountability Named person Service + configuration

The table shows the trade-off clearly: you gain speed and consistency, but you trade away the natural scope limitation that comes from having a person review code. A person only reads what they can see. A scanner reads what the token allows.


3. How to implement it

Before you run anything, gather three pieces of information from your workspace settings:

# 1. List which repositories the service account can access
gh api /orgs/{org}/installations --jq '.[].repositories_url'

# 2. Identify the token or OAuth app used for the scanner connection
gh api /orgs/{org}/oauth_authorizations --jq '.[] | select(.app.name | test("claude|anthropic"; "i"))'

# 3. Check where scan results are written (webhook target, dashboard, or PR comment)
gh api /repos/{owner}/{repo}/hooks --jq '.[] | {name, config}'

Once you know the scope, set the narrowest access you can before enabling the scanner:

# .github/claude-security.yml — example scope config (structure may vary at GA)
scan:
  include:
    - "src/**"
    - "lib/**"
  exclude:
    - "test/**"
    - "fixtures/**"
    - "**/*.env"
    - "**/*.pem"
result_output:
  destination: pr_comment       # options: pr_comment | webhook | dashboard
  block_merge_on: critical      # options: critical | high | any | none

Set block_merge_on: none initially. Treat the first two weeks as calibration: collect what the scanner flags, review each finding manually, and measure false positive rate before you promote it to a blocking gate.

To verify the scanner is working and scoped correctly:

# Introduce a known vulnerability in a test branch (e.g., SQL injection in a fixture file)
# Then confirm the scanner flags it and does NOT read outside your configured include paths

git checkout -b test/scanner-scope
echo "SELECT * FROM users WHERE id = '" + userInput + "'" > test_vuln.py
git push origin test/scanner-scope
# Open a draft PR and check scanner output within 5 minutes

Expected output in the PR: a comment from the Claude Security bot identifying the injection pattern, citing the file and line number, with a severity rating. If you see no comment within 10 minutes, check webhook delivery logs.


4. What to watch in production

False positive fatigue is the first failure mode. If the scanner flags 40 issues on every PR and developers start dismissing them without reading, the gate becomes theater. Set severity thresholds and review the false positive rate weekly for the first month.

Data residency is the second concern. If your codebase contains customer PII, financial data, or anything under GDPR or HIPAA scope, confirm where scan results are stored and for how long before enabling. "Scan result" is still a text artifact that may contain code snippets, and snippets may contain the sensitive data you were trying to protect.

Token hygiene: The authorization token connecting the scanner should be scoped to read-only code access, should be tied to a service account (not a developer's personal account), and should be rotated quarterly. Confirm a revocation path exists before you hand over access — if the service goes down or is compromised, you need to be able to cut access in under five minutes.

Mac vs. Linux CI differences: Local testing on macOS may not reproduce the same path resolution behavior as your Linux CI runners. Always validate scanner configuration in a CI environment, not just locally.

The practical rule of thumb: run Claude Security as an advisory layer first. Let it build a track record on your specific codebase. Promote it to a blocking gate only after you have measured its false positive rate and confirmed its data handling matches your compliance requirements.


Sources and checks

Verified on: 2026-06-01

Claim Evidence How to verify Limit
Claude Security is in public beta powered by Opus 4.7 Anthropic public beta announcement Check anthropic.com/news for the release note Beta terms may differ from GA behavior
Over 10,000 defects found in beta Cited in Anthropic announcement Request beta report or case study from Anthropic Self-reported; methodology not independently audited
No separate API setup required Anthropic product description Check workspace settings after connecting "No setup" likely means workspace-level OAuth, not no auth
IBM and Glasswing are early adopters Named in announcement Check their engineering blogs or press releases Partnership scope not publicly detailed
Scanner access scope is configurable Standard practice for SAST tools Review .yml config documentation at launch Config schema may change before GA

No independent third-party audit of Claude Security's data handling practices was available at the time of writing. Before enabling in a regulated environment, request Anthropic's data processing addendum and confirm it against your compliance requirements.


FAQ

When should I use Anthropic Claude Security?

It fits best when your team already does code review but lacks dedicated security engineering bandwidth. If pull requests are being merged without any security check today, Claude Security as an advisory layer is a low-friction starting point. If you already run a SAST tool like Semgrep or CodeQL, treat Claude Security as a second opinion rather than a replacement — at least until you have calibrated its false positive rate on your codebase.

What should I check before applying Anthropic Claude Security in production?

Three things in order: the repository access scope of the token used to connect the service, the destination and retention policy for scan results, and whether scanner output will block merges or serve as advisory only. These are not optional audits — getting them wrong means you could expose private code to a third-party service or create an unmonitored auto-merge path. Do all three before you flip the switch.

What is the easiest way to verify the result?

Introduce a deliberate, low-stakes vulnerability in a test branch — a SQL injection string in a comment or a hardcoded credential in a fixture file — and confirm the scanner catches it within a single PR cycle. Then verify the scanner did not read files outside your configured include paths. This two-minute smoke test tells you both whether detection is working and whether scope is respected.


Closing

Claude Security is a real productivity gain for teams without dedicated security reviewers — but its value is only as good as the access boundary you put around it. Audit the token scope, confirm result storage, and start it as an advisory layer before you let it block merges.

Next step: check your workspace's connected OAuth apps today and confirm which repositories the service can reach before the beta expands further.


🐦 Faster updates on X: @baegseungh7061
📚 More in this series: AI Insights
💌 Subscribe: Follow on X or grab the RSS

댓글