open methodology

How Closeread audits a codebase in 48 hours.

Ten specialists run in parallel. A heterogeneous reviewer refutes the weak findings. A citation gate validates that every survivor cites real code at a specific line. The signed packet is in your inbox within 48 hours.

This page is the full pipeline, step by step. The methodology is open source: github.com/FreeGuy-AI/closeread-io-methodology.

25

OSS audits run

$0.10

avg cost per audit

100%

findings cite code

Open

methodology + pricing

The 5-step pipeline

From repo URL to signed packet. Every step is logged, cost-stamped, and auditable.

Intake Hour 0

You provide a repo URL, branch, and an optional questionnaire (listing platform, target timeline, concerns you want foregrounded). We provision a read-only deploy key scoped to the audit window. The key never touches more than the one repo and is revoked on packet delivery. Your source code never leaves our Cloudflare-backed ephemeral compute and is not retained beyond packet generation. Findings reference filepath:line, never raw source.
Scanner swarm Hours 1-12

Ten specialist agents run in parallel, each focused on one buyer DD question. They read the codebase, surface findings, assign severity, assign a confidence score (0.0-1.0), and cite the specific file and line that triggered the finding. No specialist shares context with another; they each start from the same read-only checkout.
- License Compliance
- SCA / Dependencies
- API Versioning
- DB Migration Risk
- Security
- Reliability
- Test Coverage
- Stack + Hireability
- Key Person Risk
- Third-Party API
Adversarial review Hours 12-24

A reviewer model, running on a different provider than the specialists, receives each raw finding and attempts to refute it with specific evidence. The reviewer must cite a concrete reason a finding is wrong; "this might be a false positive" is not sufficient. Findings the reviewer cannot specifically refute survive. Findings it can refute specifically are dropped with a reason logged. Findings with a confidence score below 0.40 skip adversarial review entirely: low-confidence structural findings (like "we cannot determine the license of this vendored file") are themselves the signal. Sending them to a reviewer who also cannot determine the answer produces noise, not quality control.
Packet generation Hours 24-36

Surviving findings are assembled into the packet: a buyer-facing summary (the artifact a buyer's DD team would produce themselves in week one of LOI) and a developer-facing finding ledger with remediation guidance. The packet is rendered to Markdown and PDF, signed with an Ed25519 key, and stamped with the exact dollar cost of the compute that produced it. Every finding in the ledger cites the specific file and line that triggered it.
Delivery Hours 36-48

You receive the packet via email: Markdown source, branded PDF, and the cost metadata JSON. A 30-day Q&A window opens immediately. You can ask follow-up questions on any finding and we will answer from the codebase state at audit time. At delivery, we ask (separately, not required) whether you want to make the packet public. If yes, it goes on closeread.io. If no, it stays private. You can revoke consent and take down a public packet at any time.

Pipeline at a glance

Three stages. Each specialist produces raw findings. The reviewer passes or drops each one. The gate validates that surviving findings cite code before anything reaches the packet.

Why the reviewer runs on a different model

The specialists that surface findings run on Anthropic's models. The adversarial reviewer that challenges those findings runs on a different provider: DeepSeek, Google Gemini, or OpenAI, rotated randomly across audits.

This is not variety for variety's sake. A reviewer that shares a model family with the specialists cannot catch the specialists' systematic mistakes. If Claude-based specialists share a bias (a pattern of false positives on a specific framework, a blind spot on a particular license type), a Claude-based reviewer will share the same bias and pass the same bad findings. An adversarial review only works if the reviewer is epistemically independent from the producer.

This property is the load-bearing implementation of the heterogeneous rebut-pattern from ICML MAS 2025 research on multi-agent systems. The verifier needs to be independent from what it verifies. We applied that to the audit pipeline from Day 1.

Layer	Provider	Role
Specialists (all 10)	Anthropic	Surface findings from the codebase
Adversarial reviewer	DeepSeek · Gemini · OpenAI (rotated)	Refute weak findings with specific evidence
Code review (pipeline itself)	OpenAI Codex	Review every commit to the audit pipeline before merge
Local backend (Phase 2)	Llama 3.3 70B local via Ollama	On-machine reviewer for audits that must not leave the network

The reviewer backend rotates randomly across each audit so no single-provider bias accumulates across the cohort. Per-audit cost metadata records which backend was used and what each call cost. The full breakdown (Anthropic specialist cost + reviewer cost) is stamped on every packet.

The citation gate

Every finding that survives adversarial review passes through one more check before it reaches the packet: it must cite a specific file and line in the codebase. No citation, no finding.

This is not pedantry. Uncited findings in a due diligence packet are useless. A buyer's counsel cannot act on "this project may have license compliance issues." They can act on "this dependency at package.json:34 is GPL-3.0, which triggers source-disclosure requirements if linked at distribution time."

The gate also has a confidence floor: findings the specialist scored below 0.40 confidence skip adversarial review entirely. Low-confidence structural findings (like "we cannot determine the license of this vendored file") are not weaknesses in the pipeline. They are honest signals. Sending them to a reviewer that also cannot determine the answer produces rejection noise, not quality control. The confidence floor was set after observing that adversarial reviewers over-reject when uncertain: the default behavior without a floor is "reject findings that might be wrong" rather than "reject findings that are provably wrong." The floor forces the default back to "pass unless specifically refutable."

Default: pass

The reviewer must refute a finding with specific evidence to drop it. Uncertainty is not grounds for rejection. If the reviewer cannot point to a specific reason a finding is wrong, the finding survives.

Confidence floor: 0.40

Findings scored below 0.40 by the specialist skip adversarial review. These are genuine low-confidence signals, not noise. They ship in the packet with their confidence score visible, flagged as "low confidence, manual review recommended."

Sample finding

This is a real finding structure from a Closeread audit on an open-source codebase, anonymized. It shows what the developer-facing ledger looks like: finding type, severity, confidence, exact file and line, explanation, and remediation estimate.

FINDING GPL-3.0 dependency may trigger source-disclosure under copyleft.

Severity: HIGH Confidence: 0.85 Source: scanner.license_compliance

package.json:34

"bcrypt-cli": "^2.4.1"

This dependency is licensed GPL-3.0. If linked into closed-source software at distribution time, it may trigger source-disclosure requirements under the GPL. A buyer's legal review will flag this in week one of due diligence.

Recommended: replace with bcryptjs (MIT) or argon2 (Apache-2.0). Estimated effort: under 2 hours including test updates.

The buyer-facing summary groups findings by risk category (IP, operational, technical) and presents the top 10 by severity. The developer-facing ledger contains every surviving finding with full citations and remediation guidance. Both ship together in the same packet.

What this audit is not

Closeread audits are useful. They are not everything. Be clear on the distinction before you buy.

Not a legal opinion. A license finding tells you what license a dependency uses and why that matters in an acquisition context. It does not constitute legal advice. Run material findings by IP counsel before relying on them in an LOI or reps-and-warranties clause.
Not a security certification. The security specialist surfaces patterns associated with vulnerabilities (hardcoded secrets, missing input validation, insecure defaults). It does not run a penetration test, does not confirm exploitability, and does not certify that the codebase is secure. For a pen test, hire a pen tester.
Not a substitute for professional diligence. Closeread is a first-pass tool that surfaces what a buyer's technical DD team would find in week one. It does not replace a full technical diligence engagement. Use it to prepare for diligence, not to skip it.
Not a compliance audit. We audit the codebase, not the business. SOC 2, GDPR, HIPAA readiness is out of scope unless specifically contracted. The packet does not support compliance certification claims.
Not a valuation. The packet does not include a valuation or multiples analysis. It surfaces technical risk that affects valuation; quantifying that risk in dollar terms is the seller's and broker's job.

We audit. We do not certify. The distinction matters in a transaction context, and we will always say so clearly.

Pricing

Founding Alpha is open now. Eleven seats total. Ten paid at $500. One free, by invitation. Beta and Standard open after Alpha closes.

Founding Alpha

$500

10 paid seats, open now

48-hour turnaround
10-specialist swarm
30-day Q&A window
Ed25519-signed packet
Alpha price locked in
Public packet optional

Beta

$1,500

opens after Alpha closes (Day 90-120)

Same audit, matured methodology
48-hour turnaround
30-day Q&A window
Alpha waitlist members first

Standard

$2K+

complexity-tiered by codebase size

$2,000 to $10,000+
On-machine option (Year 2)
Custom SLA
Buyer-side packages available

Ready to apply? Six fields, under three minutes.

Apply for Founding Alpha →

Not ready to apply? Read the full methodology first: github.com/FreeGuy-AI/closeread-io-methodology. Sample packets on real codebases: closeread.io/samples.

If the methodology seems useful, the highest-leverage thing you can do is star the repo. No obligation, no email needed.

★ Star on GitHub →

built on

Anthropic model

OpenAI model

DeepSeek model

Google model

Cloudflare infra

Stripe payments

Heterogeneous model review on every audit. Specialists run on one provider; adversarial reviewer runs on another. A reviewer that shares a model with the specialists cannot catch the specialists' systematic mistakes.