Add continuous autonomous security testing framework (v3.0) by devatsecure · Pull Request #35 · devatsecure/Argus-Security

devatsecure · 2026-03-04T06:26:03Z

Description

This PR introduces a comprehensive continuous autonomous security testing framework that closes the gap between periodic security validation and every-deploy security coverage. It adds five new core modules and supporting infrastructure to enable diff-aware scanning, automated fix generation with PR creation, persistent findings tracking, and LLM-powered vulnerability chain discovery.

Type of Change

New feature (non-breaking change which adds functionality)
Documentation update

Changes Made

Core Modules Added

scripts/diff_impact_analyzer.py — Diff-intelligent scanner scoping
- DiffClassifier: Classifies changed files as security-relevant or skippable using pattern matching
- DiffImpactAnalyzer: Expands changed files to their security blast radius via reverse dependency lookup
- DiffScopeBuilder: Combines classification and impact analysis into scanner-ready scopes with Semgrep CLI helpers
- Enables focused scanning on only changed files and their dependents, reducing scan time and noise
scripts/autofix_pr_generator.py — Automated PR generation from remediation suggestions
- AutoFixPRGenerator: Creates git branches, applies code fixes (diff or full-file replacement), commits with descriptive messages
- ClosedLoopOrchestrator: Orchestrates find → fix → verify → PR cycle with confidence-based filtering
- FixBranch, FixPR, LoopResult: Dataclasses for structured results and JSON serialization
- Generates merge-ready PRs with formatted bodies containing vulnerability context
scripts/findings_store.py — Persistent SQLite-backed findings database
- Cross-scan deduplication via content-based fingerprinting
- Automatic regression detection (previously fixed findings that reappear)
- Severity trend analytics and mean-time-to-fix metrics
- Historical context injection for LLM enrichment
- Thread-safe write operations with reentrant locking
scripts/app_context_builder.py — Unified application context model
- Inspects project structure, dependencies, imports, and infrastructure configuration
- Detects framework, language, auth mechanisms, API endpoints, middleware chains
- Identifies cloud provider, IaC files, Docker/Kubernetes presence
- Provides context to all pipeline phases for stack-aware scanning and enrichment
- Fast detection with file-count caps and early exits
scripts/agent_chain_discovery.py — LLM-powered vulnerability chain discovery
- Complements rule-based VulnerabilityChainer with AI-driven multi-step attack reasoning
- AgentChainDiscovery: Uses LLM to discover novel attack paths beyond static rules
- CrossComponentAnalyzer: Identifies inter-module vulnerability combinations
- AttackChain, CrossComponentRisk: Structured dataclasses for chain results
- Batch processing with configurable limits to control LLM costs

Supporting Infrastructure

scripts/sast_dast_validator.py — SAST-to-DAST validation bridge
- Generates targeted HTTP tests from SAST findings against live targets
- Validates exploitability in running applications (staging/preview/dev)
- Safety guards: blocks production by default, rejects private IP ranges, truncates responses
- Payload libraries for SQL injection, XSS, SSRF, path traversal, command injection
GitHub Actions Workflows:
- .github/workflows/argus-retest.yml: Automatically retests after fix PRs are merged
- .github/workflows/post-deploy-scan.yml: Triggers security validation on deployment success
Configuration: Extended scripts/config_loader.py with toggles for all v3.0 features
- enable_diff_scoping, enable_findings_store, enable_app_context, enable_agent_chain_discovery, etc.
Documentation:
- docs/CONTINUOUS_SECURITY_TESTING_GUIDE.md: Comprehensive guide mapping current capabilities, gaps, and implementation paths
- Updated README.md, `CHANGELOG.

https://claude.ai/code/session_017NQsm2eBxfioLrad1C7keZ

Note

^{Cursor Bugbot is generating a summary for commit c235c1e. Configure here.}

Maps Argus's current capabilities against the emerging continuous autonomous pentesting model (diff-aware scanning, AutoFix→PR→Retest loops, persistent knowledge base, agent-driven attack chaining, deployment-triggered scanning, code-to-runtime context). Includes concrete implementation paths and priority ordering for each gap. https://claude.ai/code/session_017NQsm2eBxfioLrad1C7keZ

Implements continuous autonomous security testing capabilities: - diff_impact_analyzer: Diff-intelligent scanner scoping with blast radius expansion via reverse dependency lookup - agent_chain_discovery: LLM-powered multi-step attack chain discovery with cross-component vulnerability analysis - autofix_pr_generator: AutoFix PR generation with closed-loop find→fix→verify orchestration - findings_store: SQLite-backed persistent findings with regression detection, MTTF, and historical context for LLM prompts - app_context_builder: Auto-detects framework, language, auth, cloud provider, IaC, middleware, and entry points for context-aware scanning - sast_dast_validator: SAST-to-DAST live validation with safety guards against production targets - GitHub Actions workflows for post-deploy scanning and automated retest Adds 13 config keys, integrates all modules into hybrid_analyzer.py pipeline, and includes 36 passing tests. https://claude.ai/code/session_017NQsm2eBxfioLrad1C7keZ

- README: Add v3.0 continuous security testing feature table, env vars, deployment-triggered scanning section, and guide doc link - CLAUDE.md: Add v3.0 summary, 6 new key files, and guide reference - CHANGELOG: Add v6.0.0 release notes with all 7 new modules, workflows, config keys, and 36 tests https://claude.ai/code/session_017NQsm2eBxfioLrad1C7keZ

github-actions · 2026-03-04T06:27:07Z

✅ Hybrid Security Scan Results

Status: No critical or high severity issues

📊 Findings Summary

Severity	Count
🔴 Critical
🟠 High
🟡 Medium
🟢 Low
📈 Total	****

🛠️ Tools Used

✅ Semgrep (SAST)
✅ Trivy (CVE Scanning)

📈 Metrics

⏱️ Scan Duration: s
💰 Cost: $

🔗 Links

Powered by Argus Hybrid Analyzer

.github/workflows/argus-retest.yml

+        run: |
+          BRANCH="${{ github.event.pull_request.head.ref }}"
+          # Extract vuln type and finding ID from branch name: argus/fix-{type}-{id}
+          VULN_TYPE=$(echo "$BRANCH" | sed 's|argus/fix-||' | sed 's|-[a-f0-9]*$||')
+          FINDING_ID=$(echo "$BRANCH" | grep -oP '[a-f0-9]{8}$' || echo "unknown")
+          echo "vuln_type=$VULN_TYPE" >> $GITHUB_OUTPUT
+          echo "finding_id=$FINDING_ID" >> $GITHUB_OUTPUT
+          # Get changed files from the PR
+          CHANGED_FILES=$(gh pr view ${{ github.event.pull_request.number }} --json files -q '.files[].path' || echo "")
+          echo "changed_files=$CHANGED_FILES" >> $GITHUB_OUTPUT


.github/workflows/argus-retest.yml

+          script: |
+            const regression = '${{ steps.regression.outcome }}';
+            const rescan = '${{ steps.rescan.outcome }}';
+            const allPassed = regression === 'success' && rescan === 'success';
+
+            const body = `## Argus Retest Results
+
+            | Check | Status |
+            |-------|--------|
+            | Regression Tests | ${regression === 'success' ? 'Passed' : 'Failed'} |
+            | SAST Rescan | ${rescan === 'success' ? 'Clean' : 'Issues found'} |
+            | **Overall** | **${allPassed ? 'Fix Verified' : 'Needs Review'}** |
+
+            ${allPassed ? 'The fix has been verified. The vulnerability is confirmed resolved.' : 'The retest found issues. Please review the scan results.'}
+
+            ---
+            *Argus Security Retest — triggered by merge of \`${{ github.event.pull_request.head.ref }}\`*`;
+
+            // Comment on the merged PR
+            await github.rest.issues.createComment({
+              owner: context.repo.owner,
+              repo: context.repo.repo,
+              issue_number: ${{ github.event.pull_request.number }},
+              body: body
+            });


.github/workflows/post-deploy-scan.yml

+        run: |
+          if [ "${{ github.event_name }}" = "workflow_dispatch" ]; then
+            echo "target_url=${{ inputs.target_url }}" >> $GITHUB_OUTPUT
+            echo "environment=${{ inputs.environment }}" >> $GITHUB_OUTPUT
+          else
+            echo "target_url=${{ github.event.deployment.payload.web_url || '' }}" >> $GITHUB_OUTPUT
+            echo "environment=${{ github.event.deployment.environment }}" >> $GITHUB_OUTPUT
+          fi
+          # Get diff since last successful scan
+          PREV_SHA=$(git log --format='%H' -2 | tail -1)
+          echo "prev_sha=$PREV_SHA" >> $GITHUB_OUTPUT
+          CHANGED=$(git diff --name-only $PREV_SHA HEAD | head -100)
+          echo "has_changes=$( [ -n "$CHANGED" ] && echo true || echo false )" >> $GITHUB_OUTPUT
+


scripts/sast_dast_validator.py

+            response = urllib.request.urlopen(
+                request,
+                timeout=self.timeout,
+                context=ctx,
+            )


cursor

Cursor Bugbot has reviewed your changes and found 4 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

cursor · 2026-03-04T06:28:55Z

.github/workflows/post-deploy-scan.yml

+                  max_duration=600,
+              )
+              orch = DASTOrchestrator(config=config)
+              results = orch.run('${{ steps.context.outputs.target_url }}')


Script injection via target URL in workflow

High Severity

The ${{ steps.context.outputs.target_url }} is interpolated directly into an inline Python string on line 86. If the URL contains a single quote (e.g., from workflow_dispatch user input or a crafted deployment payload), it breaks out of the Python string literal, enabling arbitrary code execution in the workflow runner. This value originates from ${{ inputs.target_url }} which is directly user-controlled.

Additional Locations (1)

.github/workflows/post-deploy-scan.yml#L43-L44

cursor · 2026-03-04T06:28:55Z

.github/workflows/argus-retest.yml

+      - name: Extract fix metadata
+        id: meta
+        run: |
+          BRANCH="${{ github.event.pull_request.head.ref }}"


Shell injection via branch ref in workflow

High Severity

${{ github.event.pull_request.head.ref }} is interpolated directly into a shell script. While the if condition ensures the branch starts with argus/fix-, the remainder is attacker-controlled. A branch name containing shell metacharacters like $(cmd) or backticks would execute arbitrary commands during the run step, potentially exfiltrating secrets like ANTHROPIC_API_KEY or GITHUB_TOKEN.

cursor · 2026-03-04T06:28:55Z

scripts/findings_store.py

+
+    def __init__(self, db_path: str = ".argus/findings.db") -> None:
+        self.db_path = db_path
+        self._lock = threading.Lock()


Lock advertised as reentrant but uses non-reentrant Lock

Low Severity

The module docstring (line 16) states "Thread-safe write operations with a reentrant lock" but the implementation uses threading.Lock(), which is non-reentrant. If any future code path (or subclass) attempts to re-acquire the lock from within a locked section, it will deadlock. The type needs to match the documented contract — either use threading.RLock() or correct the docstring.

cursor · 2026-03-04T06:28:55Z

scripts/findings_store.py

+        row = cur.fetchone()
+        if row is None:
+            return False
+        return row["status"] == "fixed"


Regression detection returns stale result after record_scan

Medium Severity

is_regression checks whether the finding's current status equals "fixed". But record_scan resets regressions' status to "open" immediately upon detection. So get_historical_context calling is_regression after record_scan will always return False for regressions, since their status was already flipped to "open". The method needs to check fix_history records rather than relying on transient status.

Additional Locations (1)

scripts/findings_store.py#L775-L776

…eview - Sanitize ${{ }} expression interpolation in post-deploy-scan.yml and argus-retest.yml by passing attacker-controlled values through env vars instead of direct shell/script interpolation (HIGH severity) - Fix is_regression() returning stale results by checking fix_history records instead of transient status field (MEDIUM) - Change threading.Lock() to threading.RLock() to match docstring (LOW) https://claude.ai/code/session_017NQsm2eBxfioLrad1C7keZ

claude added 3 commits March 4, 2026 05:39

devatsecure merged commit ea79556 into main Mar 4, 2026
16 of 39 checks passed

github-advanced-security bot found potential problems Mar 4, 2026

View reviewed changes

cursor bot reviewed Mar 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add continuous autonomous security testing framework (v3.0)#35

Add continuous autonomous security testing framework (v3.0)#35
devatsecure merged 3 commits intomainfrom
claude/add-security-testing-guide-BClsS

devatsecure commented Mar 4, 2026 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

github-actions bot commented Mar 4, 2026

Uh oh!

Check failure

Check failure

Check failure

Check warning

cursor bot left a comment

Uh oh!

cursor bot Mar 4, 2026

Uh oh!

cursor bot Mar 4, 2026

Uh oh!

cursor bot Mar 4, 2026

Uh oh!

cursor bot Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

devatsecure commented Mar 4, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Changes Made

Core Modules Added

Supporting Infrastructure

Uh oh!

Uh oh!

github-actions bot commented Mar 4, 2026

✅ Hybrid Security Scan Results

📊 Findings Summary

🛠️ Tools Used

📈 Metrics

🔗 Links

Uh oh!

Check failure

Check failure

Check failure

Check warning

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Mar 4, 2026

Choose a reason for hiding this comment

Script injection via target URL in workflow

Uh oh!

cursor bot Mar 4, 2026

Choose a reason for hiding this comment

Shell injection via branch ref in workflow

Uh oh!

cursor bot Mar 4, 2026

Choose a reason for hiding this comment

Lock advertised as reentrant but uses non-reentrant Lock

Uh oh!

cursor bot Mar 4, 2026

Choose a reason for hiding this comment

Regression detection returns stale result after record_scan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

devatsecure commented Mar 4, 2026 •

edited by cursor bot

Loading