feat: add Quality Gates to enforce lint/type-check before marking features by cabana8471-arch · Pull Request #110 · AutoForgeAI/autoforge

cabana8471-arch · 2026-01-26T22:05:14Z

Summary

Implements the Quality Gates proposal from issue #96. This system enforces code quality checks before features can be marked as passing.

Auto-detect linters: ESLint, Biome (JS/TS), ruff, flake8 (Python)
Auto-detect type checkers: TypeScript (tsc), Python (mypy)
Custom scripts: Support for .autocoder/quality-checks.sh
Strict mode: Block feature_mark_passing if quality checks fail (default: enabled)

New MCP Tools

feature_verify_quality - Run quality checks on demand and see results

Modified Behavior

feature_mark_passing now runs quality checks before marking
In strict mode, returns error if lint/type-check fails
Quality results are returned with the response

Configuration

Create .autocoder/config.json in the project directory:

{
  "quality_gates": {
    "enabled": true,
    "strict_mode": true,
    "checks": {
      "lint": true,
      "type_check": true,
      "custom_script": null
    }
  }
}

To disable (not recommended):

{"quality_gates": {"enabled": false}}

Files Changed

File	Changes
`quality_gates.py`	New module with quality checking logic
`mcp_server/feature_mcp.py`	Added `feature_verify_quality`, modified `feature_mark_passing`
`progress.py`	Added `clear_stuck_features()` for auto-recovery

Test plan

Ruff lint passes
Test with TypeScript project (tsc detection)
Test with Python project (ruff/mypy detection)
Test strict mode blocking

Addresses #96

🤖 Generated with Claude Code

Summary by CodeRabbit

New Features
- Added a standalone quality verification tool and a new quality gates module to run linting, type checks, and optional custom checks; returns aggregated results and can be invoked independently.
- Integrated quality gates into feature workflows so checks run before updates and can optionally include quality results in responses.
Improvements
- Replaced in-process locks with atomic, cross-process-safe DB operations and explicit transactions to reduce race conditions.
- Added startup cleanup for stuck in-progress items and improved parallel-safe DB connection handling and dependency/existence validations.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

…tures Implements proposal from issue AutoForgeAI#96 - Quality Gates system that: - Auto-detects linters: ESLint, Biome (JS/TS), ruff, flake8 (Python) - Auto-detects type checkers: TypeScript (tsc), Python (mypy) - Supports custom quality scripts via .autocoder/quality-checks.sh - Runs quality checks before allowing feature_mark_passing - In strict mode (default), blocks marking if checks fail - Stores quality results for evidence New files: - quality_gates.py: Core quality checking module Modified files: - mcp_server/feature_mcp.py: Added feature_verify_quality tool, modified feature_mark_passing to enforce quality gates - progress.py: Added clear_stuck_features() for auto-recovery Configuration (.autocoder/config.json): ```json { "quality_gates": { "enabled": true, "strict_mode": true, "checks": { "lint": true, "type_check": true, "custom_script": null } } } ``` Addresses AutoForgeAI#96 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

coderabbitai · 2026-01-26T22:05:25Z

📝 Walkthrough

Walkthrough

Adds a quality-gates subsystem with a public feature_verify_quality tool, integrates quality checks into feature workflows, replaces in-process locks with transactional SQLite atomic operations for cross-process safety, and adds startup logic to clear stuck in_progress features.

Changes

Cohort / File(s)	Summary
Quality Gates Module `quality_gates.py`	New module implementing detection and execution of lint/type-check/custom checks; `QualityCheckResult`, `QualityGateResult`, `_run_command`, `run_lint_check`, `run_type_check`, `run_custom_script`, `verify_quality`, and `load_quality_config`.
MCP Server — Feature Tools & DB Atomicity `mcp_server/feature_mcp.py`	Adds public `feature_verify_quality() -> str`; integrates quality verification into feature flows (e.g., `feature_mark_passing`) with optional strict-mode blocking and optional `quality_result` in responses; replaces threading locks with explicit SQLite transactions (BEGIN IMMEDIATE/EXCLUSIVE) and atomic SQL updates; strengthens existence, self-dependency, circular/forward-reference checks.
Progress / DB Helpers `progress.py`	Adds `_get_connection(db_file: Path) -> sqlite3.Connection` for timeout/PRAGMA settings; replaces direct connects; adds `clear_stuck_features(project_dir: Path) -> int` to reset `in_progress` flags at startup (duplicate definition noted in diff).
Manifest & Dependencies `manifest_file`, `requirements.txt`, `pyproject.toml`, `setup.py`	Updated packaging/dependency entries to include the new module and any required packages for quality gates.
Responses & Integration Points `mcp_server/...`	Multiple MCP tool responses and command paths now optionally include `quality_result`; many operations converted to atomic SQL sequences to avoid races (create/claim/mark/skip/deps/priority/in_progress updates).

Sequence Diagram(s)

sequenceDiagram
    autonumber
    actor Client
    participant MCP as MCP Server\n(mcp_server/feature_mcp.py)
    participant QG as Quality Gates\n(quality_gates.py)
    participant DB as SQLite DB

    Client->>MCP: feature_verify_quality() / feature_mark_passing()
    MCP->>QG: load_quality_config(project_dir)
    MCP->>QG: verify_quality(do_lint?, do_type?, do_custom?)
    QG->>QG: detect tools & run checks (lint, type, custom)
    QG-->>MCP: QualityGateResult
    alt strict_mode && failed_checks
        MCP-->>Client: error (blocking)
    else checks passed or non-blocking
        MCP->>DB: BEGIN IMMEDIATE/EXCLUSIVE
        MCP->>DB: atomic UPDATE/INSERT (mark passing/flags/deps/etc.)
        DB-->>MCP: commit
        MCP-->>Client: success (optional quality_result)
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related issues

[Proposal] Quality Gates - enforce lint/type-check before feature_mark_passing #96 — Implements Quality Gates, feature_verify_quality, strict-mode behavior, and stuck-feature clearing; aligns with this PR's objectives.

Poem

🐰 I hop through code to check each gate,

Linters hum and types relate.
I nudge the stuck, I guard the pass,
Transactions hold — no race shall last.
A tidy rabbit, quick and spry.

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and specifically describes the main feature addition: quality gates that enforce lint/type-check validation before features can be marked as passing.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

cabana8471-arch · 2026-01-27T05:34:04Z

@coderabbitai review

coderabbitai · 2026-01-27T05:34:13Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 3

🤖 Fix all issues with AI agents

In `@mcp_server/feature_mcp.py`:
- Around line 293-323: The code runs load_quality_config and verify_quality
while a DB session/transaction is open, which can hold locks; move the quality
checks (load_quality_config and verify_quality) to execute before opening or
while the session is closed (or explicitly close the current session before
calling verify_quality and reopen afterwards). Specifically, call
load_quality_config(PROJECT_DIR) and run verify_quality(...) before you
fetch/modify the Feature object and before using session, or if a session must
exist, session.close() before verify_quality and then create a new session to
reload the Feature, update feature.passes/feature.in_progress and call
session.commit(); ensure you preserve and propagate quality_result into the
final result.

In `@progress.py`:
- Around line 235-256: The try/except block that opens the SQLite connection
(conn = sqlite3.connect(db_file)) can leak the DB handle on errors; update the
logic to ensure conn is always closed and transactions rolled back by using a
context manager (with sqlite3.connect(db_file) as conn:) or adding a finally
that calls conn.rollback() if needed and conn.close() if conn is defined, and
move cursor and commit/close logic inside that guarded scope; reference the
variables conn, cursor and the existing exception handlers to locate and replace
the current try/except so all error paths close the connection.

In `@quality_gates.py`:
- Around line 233-273: If a user explicitly configured a custom script
(script_path parameter not None) the function run_custom_script should treat a
missing file as a failing QualityCheckResult instead of returning None; change
the missing-file branch to return a QualityCheckResult dict with "name":
"custom_script", "passed": False, "output": a clear message that the configured
script (include script_path) is missing, and "duration_ms": 0 when script_path
was provided, while keeping the current behavior of returning None only when
script_path was None (i.e., the default script was not present); keep the rest
of run_custom_script (chmod, _run_command, truncation) unchanged.

mcp_server/feature_mcp.py

progress.py

quality_gates.py

…eAI#110) Combined quality gates with atomic SQL updates for parallel safety. - Quality checks run before marking features as passing - Atomic UPDATE prevents race conditions in parallel mode Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

…nzyl#100, leonvanzyl#108, leonvanzyl#109, leonvanzyl#110 PR leonvanzyl#110 (Quality Gates): - Move quality checks before DB session to avoid holding locks - Return error instead of None for missing configured custom script - Use contextlib.closing for SQLite connections in progress.py PR leonvanzyl#109 (Rate Limit): - Extract rate limit logic to shared rate_limit_utils.py module - Remove duplicated code from agent.py and test_agent.py PR leonvanzyl#108 (SQLite Parallel): - Sort imports alphabetically in feature_mcp.py PR leonvanzyl#100 (Config Diagnostics): - Add logger.warning for pkill_processes validation failures PR leonvanzyl#95 (Infrastructure Mock): - Add language tags to fenced code blocks in initializer template Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Move quality checks before DB session to avoid holding locks - Return error instead of None for missing configured custom script - Use contextlib.closing for SQLite connections in progress.py Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI agents

In `@mcp_server/feature_mcp.py`:
- Around line 30-51: The import block for the project-local module is not
alphabetically ordered causing a ruff I001; reorder the names imported from
api.database (atomic_transaction, create_database, Feature) into alphabetical
order (atomic_transaction, create_database, Feature) or simply run `ruff check
--fix mcp_server/feature_mcp.py` to auto-fix; ensure the reordered import
remains directly after the sys.path insertion and before other local imports so
import order is consistent with ruff expectations.

In `@quality_gates.py`:
- Around line 130-143: The npx invocation in _detect_type_checker should include
the --no-install flag to avoid implicit downloads in CI; update the branch that
returns ("tsc", ["npx", "tsc", "--noEmit"]) to instead return ("tsc", ["npx",
"--no-install", "tsc", "--noEmit"]) so npx fails if the binary isn't present
locally.

mcp_server/feature_mcp.py

quality_gates.py

…/type-check

SUMMARY: Fixed TypeScript build error in ProjectSetupRequired.tsx where startAgent was being called with a boolean instead of an options object. DETAILS: - The startAgent API function signature was updated (in previous PR merges) to accept an options object: { yoloMode?, parallelMode?, maxConcurrency?, testingAgentRatio? } - ProjectSetupRequired.tsx was still calling it with the old signature: startAgent(projectName, yoloMode) - passing boolean directly - Changed to: startAgent(projectName, { yoloMode }) - wrapping in options object This was the only remaining build error after merging 13+ PRs from upstream: - PR AutoForgeAI#112: Security vulnerabilities and race conditions - PR AutoForgeAI#89: Windows subprocess blocking fix - PR AutoForgeAI#109: Rate limit handling with exponential backoff - PR AutoForgeAI#88: MCP server config for ExpandChatSession - PR AutoForgeAI#100: Diagnostic warnings for config loading - PR AutoForgeAI#110: Quality gates (quality_gates.py) - PR AutoForgeAI#113: Structured logging (structured_logging.py) - PR AutoForgeAI#48: Knowledge files support (API, schemas, prompts) - PR AutoForgeAI#29: Feature editing/deletion (MCP tools) - PR AutoForgeAI#45: Chat persistence - PR AutoForgeAI#52: Refactoring feature guidance - PR AutoForgeAI#4: Project reset functionality - PR AutoForgeAI#95: UI polish, health checks, cross-platform fixes Build now passes: npm run build succeeds with all 2245 modules transformed.

When tsc is not locally installed, npx without --no-install may prompt or auto-download the package. Use --no-install to fail fast instead, which is more predictable for quality gate checks. Addresses CodeRabbit review feedback. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Includes npx --no-install fix to prevent auto-download of tsc. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Parameters run_lint, run_type_check, run_custom shadowed the function names run_lint_check, run_type_check, run_custom_script, causing "'bool' object is not callable" errors at runtime. Renamed to do_lint, do_type_check, do_custom. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

…/type-check

SUMMARY: Fixed TypeScript build error in ProjectSetupRequired.tsx where startAgent was being called with a boolean instead of an options object. DETAILS: - The startAgent API function signature was updated (in previous PR merges) to accept an options object: { yoloMode?, parallelMode?, maxConcurrency?, testingAgentRatio? } - ProjectSetupRequired.tsx was still calling it with the old signature: startAgent(projectName, yoloMode) - passing boolean directly - Changed to: startAgent(projectName, { yoloMode }) - wrapping in options object This was the only remaining build error after merging 13+ PRs from upstream: - PR AutoForgeAI#112: Security vulnerabilities and race conditions - PR AutoForgeAI#89: Windows subprocess blocking fix - PR AutoForgeAI#109: Rate limit handling with exponential backoff - PR AutoForgeAI#88: MCP server config for ExpandChatSession - PR AutoForgeAI#100: Diagnostic warnings for config loading - PR AutoForgeAI#110: Quality gates (quality_gates.py) - PR AutoForgeAI#113: Structured logging (structured_logging.py) - PR AutoForgeAI#48: Knowledge files support (API, schemas, prompts) - PR AutoForgeAI#29: Feature editing/deletion (MCP tools) - PR AutoForgeAI#45: Chat persistence - PR AutoForgeAI#52: Refactoring feature guidance - PR AutoForgeAI#4: Project reset functionality - PR AutoForgeAI#95: UI polish, health checks, cross-platform fixes Build now passes: npm run build succeeds with all 2245 modules transformed.

…orgeAI#100, AutoForgeAI#108, AutoForgeAI#109, AutoForgeAI#110 PR AutoForgeAI#110 (Quality Gates): - Move quality checks before DB session to avoid holding locks - Return error instead of None for missing configured custom script - Use contextlib.closing for SQLite connections in progress.py PR AutoForgeAI#109 (Rate Limit): - Extract rate limit logic to shared rate_limit_utils.py module - Remove duplicated code from agent.py and test_agent.py PR AutoForgeAI#108 (SQLite Parallel): - Sort imports alphabetically in feature_mcp.py PR AutoForgeAI#100 (Config Diagnostics): - Add logger.warning for pkill_processes validation failures PR AutoForgeAI#95 (Infrastructure Mock): - Add language tags to fenced code blocks in initializer template Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Add explicit encoding="utf-8" and errors="replace" to subprocess.run calls in quality_gates.py to fix Windows CP1252 encoding issues. Closes AutoForgeAI#138 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@quality_gates.py`:
- Around line 83-154: The Windows compatibility issue: replace POSIX-only
existence checks for local binaries (e.g., "node_modules/.bin/eslint",
"node_modules/.bin/biome", "node_modules/.bin/tsc" and venv paths like
"venv/bin/ruff", "venv/bin/flake8", "venv/bin/mypy") with OS-aware lookups—use
shutil.which to resolve executables (so it finds .cmd/.exe on Windows) and
additionally check the virtualenv Scripts directory ("venv/Scripts/...") when
the global tool isn't found; update the detection logic in the JS
lint/type-check block (the ESLint/Biome and tsconfig/tsc checks),
_detect_python_linter, and _detect_type_checker to prefer shutil.which results
and fall back to project_dir/venv/Scripts/<tool> (and
project_dir/node_modules/.bin/<tool> as needed) so local Windows installs are
detected.

quality_gates.py

Use shutil.which() with custom path parameter to detect executables in node_modules/.bin and venv directories across platforms. - ESLint/Biome: detect .cmd files on Windows via shutil.which() - Python venv: check venv/Scripts on Windows, venv/bin on Unix - TypeScript: same pattern for tsc detection - mypy: same pattern for venv detection Addresses CodeRabbit review feedback on PR AutoForgeAI#110. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Fix unpacked dict entry type mismatch at line 410 by adding explicit casts to dict[str, Any] for both unpacked dictionaries Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI agents

In `@quality_gates.py`:
- Line 368: Replace the deprecated datetime.utcnow() call with a UTC-aware
timestamp: change uses of datetime.utcnow().isoformat() to
datetime.now(datetime.timezone.utc).isoformat() (or
datetime.now(timezone.utc).isoformat()), and ensure the datetime.timezone (or
timezone) symbol is imported or referenced where the timestamp is constructed so
the value is timezone-aware and doesn't trigger DeprecationWarning.
- Around line 292-296: The call to _run_command currently hardcodes ["bash",
str(script_full_path)] which breaks on Windows; update the logic around the
invocation that prepares the command (referencing _run_command, script_full_path
and project_dir) to choose the shell based on platform and script type: on POSIX
keep bash/sh, on Windows detect if the script has a .ps1 extension and use
PowerShell (or use cmd.exe for .bat/.cmd), and fall back to locating "bash" with
shutil.which if Windows should support Git Bash/WSL; ensure the chosen shell and
arguments are constructed correctly before passing to _run_command and add a
small docstring or comment noting the platform behavior.

🧹 Nitpick comments (1)

quality_gates.py (1)

52-52: Consider moving import time to the top of the file.

Placing imports inside functions is unconventional in Python. While it works, having all imports at the module level improves readability and makes dependencies immediately visible.

quality_gates.py

- Replace deprecated datetime.utcnow() with datetime.now(timezone.utc) - Add cross-platform Windows support for custom scripts (.bat, .cmd, .ps1) - Move `import time` from function body to module level Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

coderabbitai bot reviewed Jan 27, 2026

View reviewed changes

mcp_server/feature_mcp.py Outdated Show resolved Hide resolved

progress.py Show resolved Hide resolved

quality_gates.py Outdated Show resolved Hide resolved

coderabbitai bot reviewed Jan 27, 2026

View reviewed changes

mcp_server/feature_mcp.py Show resolved Hide resolved

quality_gates.py Show resolved Hide resolved

fix: correct import order per ruff (uppercase before lowercase)

f7d9d15

getworken pushed a commit to getworken/autocoder that referenced this pull request Jan 27, 2026

Add quality_gates.py from PR AutoForgeAI#110 - Quality gates for lint…

2916bc0

…/type-check

getworken mentioned this pull request Jan 27, 2026

Merge 15+ PRs: Security, Windows fixes, Knowledge files, Chat persistence, and more #117

Open

cabana8471-arch added a commit to cabana8471-arch/autocoder that referenced this pull request Jan 27, 2026

Merge quality gates fixes from PR AutoForgeAI#110

22d010c

Includes npx --no-install fix to prevent auto-download of tsc. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

cabana8471-arch and others added 3 commits January 28, 2026 06:36

fix: sort imports alphabetically to fix ruff I001

9a0202b

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

fix: organize imports in feature_mcp.py

b4f4aae

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

getworken pushed a commit to getworken/autocoder that referenced this pull request Jan 29, 2026

Add quality_gates.py from PR AutoForgeAI#110 - Quality gates for lint…

b930d08

…/type-check

fix: add UTF-8 encoding for subprocess calls on Windows

374a565

Add explicit encoding="utf-8" and errors="replace" to subprocess.run calls in quality_gates.py to fix Windows CP1252 encoding issues. Closes AutoForgeAI#138 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

coderabbitai bot reviewed Jan 30, 2026

View reviewed changes

quality_gates.py Outdated Show resolved Hide resolved

fix: resolve mypy type error in quality_gates.py

8f98b51

- Fix unpacked dict entry type mismatch at line 410 by adding explicit casts to dict[str, Any] for both unpacked dictionaries Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

coderabbitai bot reviewed Jan 31, 2026

View reviewed changes

quality_gates.py Show resolved Hide resolved

quality_gates.py Outdated Show resolved Hide resolved

Conversation

cabana8471-arch commented Jan 26, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

New MCP Tools

Modified Behavior

Configuration

Files Changed

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related issues

Poem

Uh oh!

cabana8471-arch commented Jan 27, 2026

Uh oh!

coderabbitai bot commented Jan 27, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

cabana8471-arch commented Jan 26, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 26, 2026 •

edited

Loading