Architecture

This document describes vigil’s internal structure, the analysis engine flow, and the analyzer protocol.

Project structure

vigil-cli/
  src/vigil/
    __init__.py                   # __version__
    cli.py                        # Click commands (scan, deps, tests, init, rules)
    config/
      __init__.py
      schema.py                   # Pydantic v2 models for configuration
      loader.py                   # Config loading and merging (YAML + CLI)
      rules.py                    # Catalog of 26 rules (RULES_V0)
    core/
      __init__.py
      finding.py                  # Severity, Category, Location, Finding
      engine.py                   # ScanEngine, ScanResult
      file_collector.py           # File discovery
      rule_registry.py            # RuleRegistry for rule access
    analyzers/
      __init__.py
      base.py                     # BaseAnalyzer Protocol
    reports/
      __init__.py
      formatter.py                # BaseFormatter Protocol + factory
      human.py                    # Terminal format with colors
      json_fmt.py                 # Structured JSON format
      junit.py                    # JUnit XML format
      sarif.py                    # SARIF 2.1.0 format
      summary.py                  # Summary generator (counts)
    logging/
      __init__.py
      setup.py                    # structlog configuration
  tests/
    conftest.py                   # Global fixtures
    test_cli.py                   # CLI tests
    test_core/
      test_finding.py
      test_engine.py
      test_file_collector.py
    test_config/
      test_schema.py
      test_loader.py
      test_rules.py
    test_reports/
      test_formatters.py
    fixtures/                     # Test files

Data models

Severity

String enum with 5 levels, ordered from highest to lowest criticality:

class Severity(str, Enum):
    CRITICAL = "critical"
    HIGH = "high"
    MEDIUM = "medium"
    LOW = "low"
    INFO = "info"

Using str, Enum allows direct comparison with strings and JSON serialization without conversion.

Location

Dataclass that indicates where the problem was found:

@dataclass
class Location:
    file: str                    # File path
    line: int | None = None      # Line (1-based)
    column: int | None = None    # Column (1-based)
    end_line: int | None = None  # End line (for ranges)
    snippet: str | None = None   # Code snippet

Finding

Dataclass that represents an individual finding:

@dataclass
class Finding:
    rule_id: str                        # "DEP-001", "AUTH-005"
    category: Category                  # Category.DEPENDENCY
    severity: Severity                  # Severity.CRITICAL
    message: str                        # Problem description
    location: Location                  # Where it was found
    suggestion: str | None = None       # How to fix it
    metadata: dict[str, Any] = field(default_factory=dict)

    @property
    def is_blocking(self) -> bool:
        return self.severity in (Severity.CRITICAL, Severity.HIGH)

The is_blocking property determines whether the finding should block a merge (by default, CRITICAL and HIGH are blocking).

Engine flow

The ScanEngine is the central orchestrator. Its run() method executes the complete pipeline:

                    run(paths)
                       |
            +----------v-----------+
            |  1. Collect files    |
            |  (file_collector)    |
            +----------+-----------+
                       |
            +----------v-----------+
            |  2. Run analyzers    |
            |  (for each analyzer) |
            +----------+-----------+
                       |
            +----------v-----------+
            |  3. Apply overrides  |
            |  (rule_overrides)    |
            +----------+-----------+
                       |
            +----------v-----------+
            |  4. Sort findings    |
            |  (by severity)       |
            +----------+-----------+
                       |
                       v
                  ScanResult

Step 1: Collect files

file_collector.collect_files() receives the user’s paths and returns a list of files to scan:

Resolves directories recursively with Path.rglob("*").
Filters by language extensions (LANGUAGE_EXTENSIONS).
Excludes configured patterns (node_modules/, .venv/, etc.).
Always includes dependency files (requirements.txt, package.json, etc.) regardless of the language filter.
Deduplicates while preserving order.

Step 2: Run analyzers

For each registered analyzer:

Checks whether it should run (_should_run()): respects --category and --rule filters.
Calls analyzer.analyze(files, config).
Collects the returned findings.
Catches exceptions per analyzer (a failed analyzer does not stop the others).

Step 3: Apply overrides

_apply_rule_overrides() processes the rules: section of the configuration:

If a rule has enabled: false, its findings are removed.
If a rule has severity: "low", the finding’s severity is modified.
If a rule is in exclude_rules (from --exclude-rule), it is removed.

Step 4: Sort

Findings are sorted by severity in descending order (CRITICAL first, INFO last) using SEVERITY_SORT_ORDER.

Analyzer protocol

Each analyzer implements the BaseAnalyzer protocol:

class BaseAnalyzer(Protocol):
    @property
    def name(self) -> str: ...

    @property
    def category(self) -> Category: ...

    def analyze(self, files: list[str], config: ScanConfig) -> list[Finding]: ...

Contract

name: Unique name of the analyzer (e.g., "dependency", "auth").
category: Category of findings it generates.
analyze(): Receives the list of files and the configuration, returns findings.

Rules for implementing an analyzer

Deterministic: The same input always produces the same output.
No side effects: Does not modify files, does not write to stdout.
Internal error handling: If a file cannot be read, the analyzer ignores it and continues.
Logging to stderr: Use structlog for debug/info logs.
Respect configuration: Read thresholds and options from ScanConfig.

Implementation example

from vigil.analyzers.base import BaseAnalyzer
from vigil.config.schema import ScanConfig
from vigil.core.finding import Category, Finding, Location, Severity

class DependencyAnalyzer:
    @property
    def name(self) -> str:
        return "dependency"

    @property
    def category(self) -> Category:
        return Category.DEPENDENCY

    def analyze(self, files: list[str], config: ScanConfig) -> list[Finding]:
        findings: list[Finding] = []
        # ... analysis logic ...
        return findings

No inheritance is required — only satisfying the Protocol (structural typing).

Configuration system

Three layers with progressive merging

Defaults (schema.py) < YAML file (.vigil.yaml) < CLI flags

Defaults: Defined as default values in Pydantic models (ScanConfig, DepsConfig, etc.).
YAML: Loaded with pyyaml and validated with Pydantic.
CLI: Click flags that override specific fields.

Loader

load_config() in config/loader.py:

Finds the config file (manual with --config, or auto-detection by walking up the directory tree).
Parses the YAML.
Creates a ScanConfig instance with the YAML values.
Applies CLI overrides on the instance.
Returns the final configuration.

Validation

Pydantic v2 automatically validates:

Data types (min_age_days is int, not string).
Valid values (fail_on is one of critical/high/medium/low).
Nested models (deps, auth, secrets, tests, output).

Rule catalog

The 26 rules are defined in config/rules.py as RuleDefinition instances:

@dataclass
class RuleDefinition:
    id: str                              # "DEP-001"
    name: str                            # "Hallucinated dependency"
    description: str                     # Long description
    category: Category                   # Category.DEPENDENCY
    default_severity: Severity           # Severity.CRITICAL
    enabled_by_default: bool = True
    languages: list[str] | None = None   # None = all
    owasp_ref: str | None = None         # "LLM03"
    cwe_ref: str | None = None           # "CWE-829"

RuleRegistry

Provides indexed access to the catalog:

registry.get("DEP-001") — get a rule by ID.
registry.all() — all rules.
registry.by_category(Category.AUTH) — rules in a category.
registry.by_severity(Severity.CRITICAL) — rules of a severity.
registry.enabled_rules(overrides) — enabled rules after applying overrides.

Formatters

Protocol

class BaseFormatter(Protocol):
    def format(self, result: ScanResult) -> str: ...

Factory

get_formatter(format_name) returns the correct class with lazy import:

"human"  -> HumanFormatter
"json"   -> JsonFormatter
"junit"  -> JunitFormatter
"sarif"  -> SarifFormatter

Output flow

ScanResult -> Formatter.format() -> string -> stdout or file

The CLI decides where to send the output:

Without --output: stdout.
With --output: writes to file (and also to stdout for human format).

Logging

structlog

vigil uses structlog for structured logging:

Verbose mode (-v): Level DEBUG, with timestamps and key-value pairs.
Normal mode: Level WARNING, minimalist output.
Output always to stderr: Logs never go to stdout. This allows vigil scan -f json | jq without contaminating the JSON with logs.

Example logs in verbose mode

2024-01-15 10:30:00 [info] files_collected count=42
2024-01-15 10:30:00 [info] analyzer_start name=dependency
2024-01-15 10:30:01 [info] analyzer_done name=dependency findings=2

External dependencies

Dependency	Version	Purpose
`click>=8.1`	CLI framework	Subcommands, options, automatic help
`pydantic>=2.0`	Validation	Configuration models with validation
`httpx>=0.27`	HTTP client	Requests to PyPI/npm (async-capable)
`structlog>=24.1`	Logging	Structured logging to stderr
`pyyaml>=6.0`	YAML parser	Loading configuration files

Development dependencies

Dependency	Version	Purpose
`pytest>=8.0`	Testing	Test framework
`pytest-cov>=5.0`	Coverage	Test coverage reporting
`ruff>=0.4`	Linting	Python linter and formatter

Design decisions

Why Protocol and not ABC

typing.Protocol (structural typing) is used instead of abc.ABC (nominal typing) for:

Flexibility: Analyzers don’t need to inherit from a base class.
Testing: It’s trivial to create fakes/mocks that satisfy the protocol.
Decoupling: Modules don’t depend on the base class.

Why dataclasses and not Pydantic for Finding

Finding, Location, and RuleDefinition are internal data models that don’t need validation.
Pydantic is reserved for user configuration where validation is critical.
Dataclasses are lighter and faster for data that is created internally.

Why structlog

Structured logging (key-value) facilitates parsing and filtering.
Clear separation of output (stdout) vs logs (stderr).
Centralized configuration with processors.

Why not async

vigil V0 is synchronous. The reasons:

Most operations are filesystem I/O, which is fast.
HTTP requests to the registry can be made with synchronous httpx.
The simplicity of synchronous code facilitates debugging and testing.
It can be migrated to async in future versions if performance requires it.