Blog Articles by Rebecca Spiegel https://checkmarx.com/author/rebecca/ The world runs on code. We secure it. Thu, 23 Apr 2026 04:42:05 +0000 en-US hourly 1 https://checkmarx.com/wp-content/uploads/2024/06/cropped-cx_favicon-32x32.webp Blog Articles by Rebecca Spiegel https://checkmarx.com/author/rebecca/ 32 32 Guardrails for Agentic Development https://checkmarx.com/blog/guardrails-for-agentic-development/ Thu, 23 Apr 2026 04:42:03 +0000 https://staging.checkmarx.com/?p=108459 Software used to be written in sprints. Now it is generated continuously, reviewed in minutes, and deployed before most security teams have time to react.

That speed has a cost. According to the World Economic Forum, 94% of surveyed leaders expect artificial intelligence to be the most significant driver of change in cybersecurity, even as 87% identify AI-related vulnerabilities as the fastest-growing cyber risk. Vulnerabilities now propagate faster than traditional controls can detect them. At this speed, pausing development to evaluate risk isn’t just inconvenient – it’s operationally impossible.

Security has to move with the code, not after it.

The Scale Problem: Vulnerability Growth Is Now an Operational Constraint

Alongside this explosion of AI-driven development, the global vulnerability landscape is expanding just as fast. The Forum of Incident Response and Security Teams (FIRST) projects that 2026 will be the first year to exceed 50,000 published vulnerabilities, with a median forecast of approximately 59,427. Manual prioritization is no longer viable.

The numbers alone don’t capture the operational shift. For decades, application security programs depended on predictable development cycles that created natural pauses for risk evaluation. AI-driven development removes those pauses. Code is generated automatically, refined instantly, and deployed continuously. Traditional security gates depend on time – and continuous delivery eliminates it.

A recent breach illustrates how quickly this can play out in practice. Using GenAI, a small group of attackers generated more than 400 custom scripts and launched attacks across 300+ servers, automating what once required coordinated teams and significantly compressing the time from vulnerability discovery to exploitation.

Security is no longer restrained by detection capability, but by decision capacity. The challenge is no longer whether organizations can find vulnerabilities. It is whether security teams can make decisions fast enough to keep pace with the volume and velocity of AI-generated code. Teams must determine which risks matter most and how to reduce exposure fast. As the National Institute of Standards and Technology (NIST) has observed, secure practices must be embedded directly into lifecycle stages rather than appended as separate steps.

From Engineering Problem to Organizational Risk 

AI security has moved from an engineering concern to an organizational one. Leaders increasingly recognize that the risks associated with AI-driven development extend beyond technical vulnerabilities to affect organizational resilience, regulatory compliance, and customer trust.

The risks are operational: misconfigured dependencies can introduce vulnerabilities, unauthorized access can compromise systems, and automated workflows can propagate errors at scale. Governance and engineering are describing the same problem from different angles, and the challenge isn’t choosing between them but integrating them.

That integration doesn’t happen through policy updates alone. It requires rethinking how AI development is structured, how dependencies are managed, and how security controls are embedded into the workflow itself.

The supply chain is where that challenge becomes most visible.

The Expanding Supply Chain: From Software Components to AI Systems

Supply chain risk has always been a central concern in software security, but AI introduces new layers of dependency that are harder to observe and harder to control. Traditional software supply chains included libraries, frameworks, and external services, while modern supply chains include AI models, prompts, orchestration layers, and runtime agents.

The European Union Agency for Cybersecurity reports that supply chain risks currently account for 10.6 percent of identified threat categories. As software ecosystems become more interconnected, the number of potential entry points for attackers increases – and dependencies that once served as conveniences become liabilities.

The recent attack on Cursor illustrates how quickly trust assumptions can be exploited. Attackers chained a prompt injection with a sandbox bypass and remote tunneling capability to obtain shell access simply by convincing a developer to open a repository. Because the attack leveraged legitimate tooling features rather than traditional malware, perimeter defenses offered little protection. Modern supply chain risk increasingly originates inside the developer workflow itself, where trusted automation interacts directly with untrusted code and content. Developer tool itself has become part of the attack surface.

This is part of a broader pattern. When applications automatically retrieve and execute external artifacts without verifying their integrity or origin, malicious actors can substitute compromised components into the execution environment – turning trusted dependencies into attack vectors.

These attacks do not rely on advanced exploitation techniques, but on assumptions: when AI systems automatically trust external components, attackers can manipulate that trust.

The Two-Layer Guardrail Architecture 

Security leaders are responding to these challenges by adopting a new architectural principle that treats AI-generated code as untrusted input until it has been verified through independent controls. This guardrail model gives this principle operational form, organizing security into two complementary control loops: prevention (inner loop) and enforcement (outer loop).

The preventive loop operates at the moment code is created. This is the stage when developers have the most context, and the remediation costs are lowest. Errors caught here can be immediately corrected, before vulnerabilities have a chance to propagate downstream. Traditional review processes were designed for a world where code arrived in batches. AI-generated code arrives continuously, which means security controls must operate continuously as well.

Checkmarx’s Developer Assist embodies this approach. Working within the developer workflow, it analyzes generated and handwritten code in real time, identifies patterns associated with security risk, and provides contextual guidance before changes leave the local environment. Security shifts from reactive to proactive, preventing vulnerabilities during creation instead of after deployment.

The enforcement loop operates at the moment that code is integrated. This is where organizations apply consistent policies across environments, evaluate risk in context, and establish accountability for security decisions.

Checkmarx’s Triage Assist and Remediation Assist operationalize this by analyzing vulnerabilities in relation to runtime behavior, dependency relationships, and exposure paths. Rather than surfacing large volumes of findings ranked by severity alone, the agent evaluates which issues are realistically attackable and generates prioritized remediation actions.

This distinction between detection and decision-making is central to the scalability of modern security programs. At the scale of today’s vulnerability landscape, the ability to prioritize effectively determines whether security controls reduce risk or simply generate noise.

A Continuous Security Workflow for AI-Driven Development 

The guardrail model operates as a continuous workflow across three stages.

As code is written, Developer Assist evaluates changes in real time, identifying potential vulnerabilities and recommending secure alternatives before anything leaves the local environment.

At integration, automated pipelines execute security scans and policy checks. Triage Assist and Remediation Assist analyze the results, determining which vulnerabilities pose immediate risk and generating remediation guidance. Human approval is required before deployment, keeping accountability for critical decisions with the team.

After deployment, monitoring systems track system behavior, detect anomalies, and feed insights back into the development workflow. This feedback loop allows security controls to dynamically adapt as new threats emerge, rather than waiting for the next scheduled review.

The result is a system where security is no longer a checkpoint, but a continuous property of the development process itself.

What Effective Governance Looks Like in Practice 

Organizations that succeed in AI-driven development environments build operational guardrails that define how software is generated, validated, and deployed. Governance works when risk management is translated into enforceable controls embedded directly into the development workflow.

A practical governance baseline for 2026 includes the following minimum controls:

1. Govern AI Systems as First-Class Assets 

AI coding tools, agents, and model components are now part of the software supply chain. They must be managed with the same rigor as production infrastructure.

Operational controls:

  • Maintain a centralized inventory of AI coding tools, agents, and model dependencies  
  • Assign accountable owners for each AI component  
  • Classify AI systems by risk and operational impact  
  • Enforce approved usage policies for AI development tools

Why this matters: AI systems become governable infrastructure rather than unmanaged productivity tools.

2. Enforce Risk-Based Prioritization

At the current vulnerability scale, remediation effectiveness depends on prioritization accuracy. Organizations cannot fix everything; they must fix what is exploitable in their environment.

Operational controls:

  • Implement contextual risk scoring based on exploitability and exposure  
  • Automate triage workflows within pull request and CI/CD pipelines  
  • Define remediation service-level objectives tied to business risk  
  • Suppress non-exploitable findings to reduce operational noise  

Why this matters: Engineering effort is directed toward vulnerabilities that materially increase risk.

3. Centralize Software and AI Supply Chain Controls

Modern supply chains include models, prompts, orchestration components, and external dependencies. These assets must be continuously verified to prevent trust-based compromise.

Operational controls:

  • Enforce dependency provenance and integrity validation  
  • Maintain software and AI bills of materials (SBOM and AI-BOM)  
  • Monitor repositories and registries for unverified or malicious components  
  • Apply consistent security policies across software and AI dependencies  

Why this matters: Supply chain risk becomes observable and controllable across the full development ecosystem.

4. Establish Guardrails for Agentic Systems

Agentic workflows introduce operational risk because automated systems can execute actions without direct human oversight. Governance must define the boundaries within which agents operate.

Operational controls:

  • Assign unique identities to all AI agents and automation services  
  • Define permission boundaries for agent actions  
  • Enforce tool invocation policies for automated workflows  
  • Maintain auditable records of agent decisions and system interactions  

Why this matters: Agent behavior becomes predictable, accountable, and traceable across environments.

5. Standardize Prompts as Governed Inputs

Prompts influence system behavior and security outcomes. They must be treated as controlled inputs rather than informal instructions.

Operational controls:

  • Develop approved prompt templates for common development tasks  
  • Enforce prompt validation and policy checks before execution  
  • Log prompt activity for audit and incident response  
  • Restrict generation patterns that introduce known security risks 

Why this matters: Prompt usage becomes consistent, auditable, and aligned with secure development practices.

6. Measure Security Performance Using Operational Metrics

Governance is only sustainable when performance is measurable. Metrics must reflect real risk reduction, not activity volume.

Operational controls:

  • Track time-to-fix for exploitable vulnerabilities  
  • Measure reduction in non-actionable findings  
  • Monitor policy compliance across development workflows  
  • Report risk reduction outcomes to executive leadership 

Why this matters: Security performance becomes quantifiable and aligned with business resilience.

The World Economic Forum emphasizes that guardrails and secure-by-design practices must accompany AI adoption. Without them, misconfiguration and adversarial manipulation become predictable outcomes rather than isolated incidents. The lesson is straightforward: resilience depends on architecture, not intervention.

The Strategic Shift Ahead

The most common mistake organizations make when confronting AI security challenges is assuming that the solution is more tools. Tools improve visibility, but they cannot compensate for structural misalignment between development speed and security processes.

Traditional security models assumed that software was produced by humans operating predictable workflows, but that assumption no longer holds. Software is now continuously produced by AI and security controls must operate at the same speed. This isn’t an incremental change; it’s a different relationship between development and risk. Organizations that recognize this shift early will be able to adapt their workflows and maintain confidence in their software. Those that don’t will find themselves managing the complexity of modern systems instead of reducing risk.

The organizations that succeed in the age of agentic development won’t rely on stronger gates or stricter controls. They’ll build smarter guardrails.

Want to see this in practice? Explore Triage Assist and Remediation Assist. Still evaluating your options? Check out the Agentic AppSec Buyer’s Guide.

]]>
Stop Manual Triaging, Start Agentic Fixing https://checkmarx.com/blog/stop-manual-triaging-start-agentic-fixing/ Tue, 07 Apr 2026 06:49:03 +0000 https://staging.checkmarx.com/?p=108185 Most security leaders are not struggling because they lack visibility. They are struggling because execution does not scale.  

Triage capacity, decision consistency, and remediation throughput are being outpaced by modern development velocity, especially as AI-assisted coding becomes standard practice. The operating environment has changed. There is more code, more change, more dependencies, and more AI tooling.  

These conditions are turning manual triage into a governance and audit liability. The only sustainable path forward is to move security decisions and remediation into the pull request. There, risk decisions are documented, fixes are verified, and accountability already exists through governed, reviewable AI-assistance that accelerates execution without surrendering control. 

The Reality Security Executives Are Living In 

Security and AppSec leaders are seeing the same pattern repeat across teams: findings accumulate, remediation cycles aren’t keeping pace with development, and audit conversations are becoming more demanding. The reason isn’t just rising expectations, but the reality that modern software delivery keeps expanding the attack surface while simultaneously making risk decisions harder to track and standardize.

This exposure isn’t an insufficient tooling or coverage problem. Application security testing has expanded significantly over the past decade, introducing more scan types, deeper integrations, and great vulnerability visibility. Despite this progress, a consistent pattern remains: organizations often release new software even when they know risk is present, simply because their operating model cannot keep pace with delivery. 

Checkmarx research found that 81% of organizations knowingly shipped vulnerable code, and 98% experiences a breach tied to vulnerable code within the last year. Risk exposure is not cause by a lack of awareness; it’s a breakdown in execution. As AI-assisted development becomes the norm, that pressure only grows.

Leaders at major software organizations have publicly stated that AI now generates a significant share of their code, with some estimates ranging from 20 to 30 percent of code in repositories and more than a quarter of new code.  

As software production accelerates, risk enters at the same pace. And when risk enters the system faster than teams can triage it, remediation becomes unpredictable. That unpredictability is what boards and regulators ultimately penalize. Detection has scaled. Execution has not. 

Detection Scaled, Execution Didn’t. Now What? 

Application security programs have made substantial progress over the last decade. Most enterprises now operate with broad coverage that includes SAST, SCA, CI/CD integrations, and developer-focused tooling. However, many programs are now over-detecting vulnerabilities relative to their ability to act on findings. This imbalance creates operational friction and weakens overall security outcomes. 

The symptoms are consistent across organizations. Triage bottlenecks form as teams struggle to review large volumes of findings. Identical issues are handled differently across teams, creating inconsistency in decision-making. Findings remain unresolved for extended periods because priority is unclear, or remediation effort is high. Backlogs grow with items that are technically valid but not treated as immediate risk, while other issues are escalated without sufficient context. 

This dynamic explains why more findings rarely reduce exposure. When everything is flagged, nothing gets fixed. Security leaders feel this as a credibility gap: dashboards show activity, but stakeholders want to know if the organization is getting more secure. That’s a hard question to answer when decision-making is inconsistent, and remediation throughput cannot be predicted. 

The NIST Secure Software Development Framework reinforced this shift, by requiring organizations to document how risk decisions are made and demonstrate evidence of secure development practices. Detection alone is not sufficient. Organizations must show that vulnerabilities were evaluated in context and resolved in a consistent, auditable way.  

Detection is not the end state. Evidence-backed execution is. 

Why Manual Triage Is Now a Business Risk 

At enterprise scale, manual triage is no longer simply inefficient. When security teams manually interpret findings across hundreds of repositories, inconsistency is inevitable and becomes an operational and governance risk. Depending on the team reviewing, identical findings receive completely different treatment. One team immediately remediates, another dismisses it, and another marks it as accepted risk without a standardized rationale. That inconsistency becomes a liability as regulators and auditors increasingly expect organizations to formally document how vulnerabilities are categorized and managed. When the answers to basic governance questions vary across the organization, regulators interpret that variability as a lack of control. Who made the decision? What evidence supported it? Which policy is applied? Without consistent answers, risk exposure becomes unpredictable and difficult to defend.  

These growing expectations arrive precisely when teams are least equipped to meet them. Security teams face ongoing budget pressures and staffing shortages while vulnerability volumes keep rising. Headcount cannot scale in proportion to code volume.  Instead, organizations need to scale execution by relying on more consistent, efficient, and automated approaches to vulnerability management. 

The Pull Request Is the New Control Plane 

If risk is introduced continuously throughout development, governance must be applied at the point where decisions are actually made. In modern engineering environments, that point is the pull request. 

The pull request is where code changes become official: approvals granted, discussions recorded, checks enforced, and ownership is established. It is the only place where execution can be observed and governed at the same speed as development.   

Security decisions belong where code is reviewed, approved, and merged, not buried in dashboards and ticketing systems. 

 Checkmarx’s Triage Assist and Remediation Assist operate directly within pull requests, ensuring that risk decisions are made in the same place where change control already exists. The principle is straightforward: if security execution is not visible within the pull request, it cannot be governed. 

From Alerts to Outcomes: What Changes 

This shift does not eliminate human involvement; It just changes where human judgment is applied. Instead of spending time manually investigating large volumes of findings, teams focus on policy definition, exception handling, and approval. 

Triage Assist introduces a contextual, risk-based prioritization model that converts scan output into decision-grade outcomes. It evaluates vulnerabilities using attackability-driven analysis, combining reachability, exploitability, and policy context to determine which issues require action. This approach moves triage away from severity-based sorting toward context-based decision-making. Findings are classified into clear outcomes such as false positive, acceptable risk, or action required, enabling consistent and defensible decisions across teams. The shift toward context-driven decisioning aligns with broader industry efforts such as the Vulnerability Exploitability eXchange (VEX), which communicates the exploitability of a vulnerability in context , not simply if it’s present. 

Remediation Assist addresses the next stage of execution. Once a decision is made, it generates reviewable, merge-ready fixes directly within the pull request workflow. These fixes are delivered as diffs or remediation pull requests that align with existing development processes. Nothing merges automatically; developers review and approve changes as part of their standard workflow, preserving governance while accelerating remediation throughput. 

Together, Triage Assist and Remediation Assist transform application security from a process centered on alerts to one focused on outcomes. 

Governed AI, Not Autonomous Chaos 

Security leaders don’t need autonomous systems making unchecked changes to code. They need governed execution that improves speed while maintaining control. This distinction matters more as AI expands both development capabilities and the attack surface that comes with it. 

New risks, including prompt injection, supply chain manipulation, and excessive agent permissions, require careful control over how AI is used within development workflows. Even systems that include human oversight can introduce risk if decisions are not transparent or if context is incomplete. Industry frameworks such as the OWASP Top 10 for Large Language Model Applications highlight emerging risks including prompt injection, supply chain manipulation, and excessive agent permissions, reinforcing the need for controlled and explainable execution. 

Read more: When the AI Lies: A New Threat Emerges for “Human-in-the-Loop” Security 

A governed approach to AI-driven security execution is grounded on clear principles. Human review remains mandatory through established approval workflows. Decision rationale is preserved to support auditability. Usage is scoped and controlled across repositories and environments. Automated changes are never merged without review. 

This model ensures that AI accelerates your execution without also introducing uncontrolled behavior. It aligns with the needs of regulated, audit-driven environments where traceability and accountability are essential.

What Success Looks Like for Security Executives 

The goal isn’t better visibility alone, but predictable execution with measurable outcomes: reduced time to decision, faster remediation cycles, and smaller vulnerability backlogs. Standardized, evidence-based triage reduces the need to repeatedly evaluate the same issues across teams, which improves both efficiency and consistency. 

Higher fix acceptance rates and fewer regressions indicate that remediation is delivered in ways that fit developer workflows, without destabilizing applications. Consistent outcomes across teams means that governance is being applied systematically rather than left to individual judgement.

Audit readiness matters just as much. Security artifacts must be tied directly to execution, including pull request discussions, approvals, and documented decisions. This reduces reliance on retrospective explanations when auditors and boards come asking. These outcomes are becoming more critical as exploitation windows continue to shrink. Vulnerabilities are often exploited shortly after disclosure, which mean delayed triage is no longer a workflow preference. It’s a business risk.

The Strategic Shift 

Security leaders do not need more alerts, they need more finished work. Moving away from manual triage is not about reducing security effort, but about operationalizing security in a way that scales with modern development. 

Effective application security requires decisions that are grounded in context, remediation delivered within the development workflow, and governance preserved through auditable processes. This is the shift toward agentic application security, where the gap between how quickly software is created and how quickly risk is understood and mitigated can be closed without slowing innovation. 

Ready to move from manual triage to scalable, governed execution? Explore Checkmarx’s Agentic AI Buyer’s Guide to see how leading teams are operationalizing this shift.

]]>
Why Vulnerability Detection Doesn’t Scale   https://checkmarx.com/blog/why-vulnerability-detection-doesnt-scale/ Wed, 25 Mar 2026 17:03:01 +0000 https://staging.checkmarx.com/?p=107874 Most AppSec teams are not failing to detect risk. They’re just failing to remediate it fast enough. 

Security programs now find more vulnerabilities than they can fix, and remediation hasn’t kept pace with how fast teams ship code. AI-generated code has made that gap worse, adding volume and complexity faster than security processes have adapted. 

Coverage has expanded, scanning is continuous, and visibility is no longer the bottleneck – but the ability to act on that visibility at scale hasn’t kept up. Backlogs grow, MTTR stays stubbornly high, and the same classes of vulnerabilities reappear across releases, even as detection improves. 

The gap isn’t that security teams lack maturity. It’s that AppSec was never built to operate at this scale. 

Detection Has Scaled. Execution Has Not. 

A growing share of organizations now acknowledge shipping software with known vulnerabilities to keep delivery moving But the pace of exploitation is accelerating at the same time. Research shows that the average time to exploit newly disclosed vulnerabilities has dropped dramatically in recent years, with attackers increasingly weaponizing vulnerabilities within days of disclosure and sometimes within hours. In 2025, nearly one-third of known exploited vulnerabilities were exploited on or before the day they were publicly disclosed, leaving organizations little time to evaluate and remediate risk.  

The distance between discovery and remediation is no longer a theoretical chasm. It is operational, measurable, and increasingly visible to boards, regulators, and customers. 

Detection Solved Visibility, Not Outcomes 

For more than a decade, AppSec investments focused on improving detection – and that initially worked. Coverage expanded across proprietary code and open-source dependencies, scanning became faster, findings became richer,  and dashboards and reporting improved. But risk outcomes did not. 

Security teams now operate in an environment where visibility is abundant, but action is constrained. Thousands of findings accumulate without clear prioritization, causing analysts to spend hours validating reachability and exploitability. At the same time, developers receive findings without enough context to determine what actually matters. Different teams end up making different decisions on identical issues and the result is a system that knows more but fixes less. 

This mismatch is becoming more pronounced as AI continues to accelerate development velocity. Leaders at major software organizations have publicly stated that a significant portion of new code is now generated with AI assistance and only reviewed by engineers before release. 

More code, shipped faster, means more potential risk, and more risk requires more capacity to remediate, not just more capacity to detect. Detection surfaced the problem. It didn’t solve it.  

The Execution Gap Is the New AppSec Bottleneck 

The execution gap is not a single failure point, but the accumulation of small inefficiencies that compound at scale.  

Triage still depends on human judgment that gets repeated inconsistently across teams, with prioritization varying based on who happens to review a given finding. Fix guidance is often advisory, leaving developers to interpret and implement solutions themselves. And governance tends to exist in policy documents instead of the workflows where decisions are actually made. Individually, these issues are manageable. At AI-scale, they become systemic, thereby compounding AppSec challenges, not because teams lack tools, but because the system connecting detection to action is inconsistent. When execution varies, risk becomes unpredictable and auditability degrades. When workflows depend on manual interpretation, service-level commitments become unenforceable. 

 What looks like a technical problem is, at its core, actually an operational one. 

The Pull Request Is the Only Place Execution Can Scale

For years, AppSec findings have flowed into tickets, dashboards, and reports – but that’s not where fix decisions get made.  

Execution happens in the pull request, where code is reviewed, discussed, approved, and merged. It is where context exists, accountability is enforced, and decisions are recorded by default. Pull requests can be configured to block merges until required checks pass, including security scanning results.  

In practice, this means remediation decisions and risk acceptance already occur in the pull request workflow, whether security teams formally recognize it or not. So why are security decisions still being made outside of this workflow? 

From Detection to Decision Infrastructure 

Modern AppSec needs a system that can turn findings into decisions. Not every vulnerability is exploitable; some represent real, reachable risk, others are false positives, and others fall within acceptable risk thresholds depending on context. Today, this distinction is made manually and inconsistently.

Decision infrastructure changes that. It classifies findings with reasoning, distinguishes between what must be fixed and what can be deprioritized, and surfaces those decisions directly in the pull request. It enables guided, reviewable remediation that is aligned with how the application actually works.

The industry has largely moved toward context-driven prioritization, with modern vulnerability management frameworks emphasizing exploitability and real-world impact over severity scores alone. But translating detection signals into actionable risk decisions requires decision infrastructure, and without it, the value of detection is incomplete.

Where Triage and Remediation Actually Happen 

Modern AppSec programs are beginning to embed agentic workflows directly in the pull request, delivering security decisions and fixes where code is actually reviewed and merged. Checkmarx One is built on this model. Triage Assist addresses the first execution bottleneck: deciding what requires immediate action. It evaluates findings using contextual analysis to determine whether a vulnerability is reachable, exploitable, and relevant within the application environment. Instead of presenting developers with raw scan output, it produces decision-ready outcomes that identify which issues must be fixed, deferred, and or represent acceptable risk under policy. 

This shift replaces manual triage queues with consistent, evidence-based decision logic that can be applied across repositories, teams, and applications. Decisions become standardized, rationale becomes visible and governance becomes enforceable. 

Remediation Assist addresses the second execution bottleneck: turning decisions into completed work. Once an issue is identified as requiring action, remediation guidance is delivered directly in the pull request as a reviewable code change that aligns with the application’s frameworks and dependencies. Developers evaluate the proposed fix using their existing review process, preserving accountability and control while accelerating resolution. Human approval remains mandatory, but developers don’t need to start from scratch when addressing security issues. The path from detection to remediation becomes shorter, more predictable, and easier to govern. 

Together, these capabilities transform the pull request into a true execution layer for application security. Risk decisions are made where code changes occur, fixes are delivered where developers work, and evidence is recorded where auditors expect it.  

And this approach scales. 

Why Governance Must Be Embedded, Not Enforced Later 

When governance exists outside the workflow, it is optional by default. It relies on teams remembering to follow it, interpreting it correctly, and applying it consistently. At scale, that approach breaks down. 

But when governance is embedded in execution, it becomes part of how work gets done. This principle is increasingly reflected in secure software development standards. Frameworks such as the NIST Secure Software Development Framework emphasize the importance of maintaining evidence and artifacts that demonstrate how security decisions were made and implemented throughout the development lifecycle. 

That requirement changes what AppSec governance actually means.  Policy alone isn’t sufficient; what matters is documented execution that preserves human oversight, where prioritization criteria are applied consistently, remediation is scoped and controlled, and decisions are captured automatically without additional administrative overhead. This is the difference between policy and control. Triage and remediation capabilities built directly into development workflows don’t replace decision-making, they structure it, bringing prioritization, reasoning, and fix guidance into the pull request where decisions are already happening. The result is governed execution, not autonomous remediation.

What Changes When Execution Is Operationalized 

When execution is built into the workflow, the system begins to behave differently. 

Manual triage effort drops because classification is no longer repeated across teams. Time to decision shrinks because context and reasoning are already available. Remediation becomes more consistent because developers are not guessing what matters or how to fix it. Risk acceptance becomes explicit, not implied. Auditability improves because decisions are captured as part of normal development activity.  

Perhaps most importantly, the relationship between security and engineering changes. When developers receive clear, contextualized guidance in the pull request, security stops being perceived as noise and starts being seen as actionable input, reducing friction. The conversation shifts from asking why a finding exists to deciding what action should be taken. 

This shift is increasingly necessary as development complexity grows. Software supply chain risk, third-party dependencies, and AI-assisted development are expanding the attack surface faster than traditional workflows can keep pace.

Recent reporting shows that most organizations have experienced supply chain attacks within the past year, reinforcing the need for consistent, scalable remediation processes.  By adopting this approach, AppSec can scale effectively without requiring a proportional increase in headcount. 

The Necessary Shift to Agentic AppSec 

Detection is table stakes in 2026. The organizations that optimize detection alone will continue to generate more findings than they can act on. Backlogs will grow and the gap between visibility and execution will widen. Security teams will remain overwhelmed, and risk decisions will remain inconsistent.  

The organizations that operationalize execution will close that gap. They will make decisions where work happens. They will standardize how those decisions are made. They will embed governance into the workflow instead of enforcing it after the fact. And they will measure success not by how much they find, but by how much they fix.  

This shift is defining modern, agentic application security. 

Read the next article: Attackability: Why Context, Not Reachability, Should Drive Remediation

Learn how modern AppSec teams prioritize vulnerabilities based on reachability, exploitability, and real-world impact to reduce noise and focus remediation where it matters most. 

]]>
Attackability: Why Context, Not Reachability, Should Drive Remediation  https://checkmarx.com/ai-llm-tools-in-application-security/reachability-was-a-breakthrough-but-now-its-not-enough/ Tue, 24 Mar 2026 15:30:14 +0000 https://staging.checkmarx.com/?p=107714 For years, reachability has been the security industry’s go-to approach for vulnerability prioritization. Instead of flagging every vulnerable dependency, the idea was to determine whether an application could actually reach the vulnerable function. This marked a meaningful step forward in application security, shifting focus to code that executes in production. 

But reachability is not exploitability. A function can be reachable and still pose no practical risk if it sits behind authentication, processes only trusted inputs, or is mitigated by runtime controls. Reachability confirms that code can run, not that an attacker can abuse it. 

Modern software development requires more than execution analysis. 

Checkmarx Triage Agent addresses this head on with Attackability: AI-driven triage that traces attacker-controlled inputs from real ingress points to potential impact and verifies which controls prevent exploitation – and which do not.  

The result is triage based on demonstrated exploitability, not theoretical reachability. 

What Reachability Actually Tells You 

Most SCA tools define reachability at the function level: is there a path from your code to the vulnerable function? If yes, the finding is flagged as reachable. If not, it’s deprioritized. 

That’s useful, but it’s also incomplete. Here’s what reachability doesn’t tell you: 

  • Whether the input reaching that function is attacker-controlled, or only comes from trusted internal sources 
  • Whether there’s a real ingress point (a public API, a webhook, a file upload) that a real attacker could use 
  • Whether required preconditions exist, like a specific protocol behavior or a privileged network position 
  • Whether controls on the path, such as a safe parser, an authentication check, or an allowlist, already break the exploit chain 
  • What the actual impact would be: RCE, data exposure, privilege escalation, or something else

A finding can be technically reachable yet completely unexploitable in production. When that happens, engineering time is wasted for no reason, and real risk competes for attention. 

Security teams don’t need to know “can this function run?” they need to know “can an attacker exploit this in our application, given our ingress points, our controls, and our runtime environment?” 

That’s the difference between reachability and attackability.  

How Attackability Works 

Checkmarx Triage Assist introduces Attackability: AI-driven triage that traces attacker-controlled input from real ingress points to potential impact, and validates which controls prevent exploitation and which do not. 

Attackability follows a consistent five-step flow regardless of the scanner type: 

  1. Identify the vulnerable capability and candidate sink. Confirm what the vulnerable library, pattern, or API surface is, and form an initial hypothesis about how exploitation would occur. 
  2. Prove or disprove a real execution path. Trace whether the vulnerable code path is reachable in the repository, including direct calls, indirect framework behavior, and configuration-driven invocation. 
  3. Validate attacker control and real ingress. Identify how data enters the system (API endpoints, file uploads, queues, webhooks, scheduled jobs) and whether an external attacker can actually influence the data that reaches the sink. 
  4. Validate controls and preconditions. Check whether security controls apply on the relevant path: safe parsing, allowlists, auth boundaries, sanitization, runtime hardening. Document any required preconditions, such as a MITM position or specific deployment settings. 
  5. Conclude exploitability and explain impact. Give a clear verdict (exploitable, not exploitable, or risk accepted with rationale), state the concrete impact, and provide a minimal-disruption remediation

This moves the conversation from “is this reachable?” to “is this exploitable?” 

Not Just for SCA 

Attackability isn’t limited to dependency finding; it applies the same reasoning across most scanner types. 

For SAST findings, it connects a detected code pattern to a real exploit chain by asking whether there’s a genuinely attacker-controlled source, whether the data flow reaches a dangerous sink, and whether controls on the path already prevent exploitation. A tainted data flow that never crosses an authentication boundary, or that’s constrained by an allowlist, can be reachable in code without being attackable in production. 

For IaC and cloud misconfigurations, it evaluates whether a configuration issue is externally accessible and whether it creates a realistic path to impact, factoring in exposure surfaces, identity controls, and network controls. 

For container findings, it assesses whether a vulnerable package is used at runtime, whether the container runs with elevated privileges, and whether the affected component is exposed through a reachable service. 

For secrets detection, it evaluates whether the credential is valid, scoped, and exposed in a way an attacker can actually leverage, factoring in repository visibility, rotation state, and downstream blast radius. 

What Makes the Output Credible 

The Attackability data is useful precisely because it’s verifiable. It includes concrete code references showing how the library or sink is used, a path narrative describing the chain from ingress to sink to impact (including where the chain breaks if the finding isn’t exploitable), explicit control validation, and a precise impact statement. 

This matters more than triage speed. It means developers can see exactly how the issue is triggered and what minimal change breaks the chain. It means risk acceptance decisions are documented with evidence, so security and engineering teams are aligning on facts (not assumptions).  

Reachability Is Just a Starting Point 

Reachability made vulnerability data more relevant. But reachability is not enough. 

Checkmarx Triage Assist’s Attackability adds attacker context, environmental context, and control validation, turning a reachability result into something a team can actually make a decision on. 

Ready to go deeper? Read the Agentic AI Buyer’s Guide to understand what separates decision-grade triage from theoretical analysis or watch the Checkmarx Triage Assist demo video to see Attackability in action.

]]>
The ROI of Agentic AI AppSec https://checkmarx.com/blog/the-roi-of-agentic-ai-appsec/ Mon, 29 Dec 2025 18:20:11 +0000 https://staging.checkmarx.com/?p=106266 ROI Looks Different in the AI Era 

LLMs now accelerate how code is written, refactored, and merged. Traditional “scan-and-fix later” workflows can’t keep up with that pace; they push findings downstream, inflate rework, and slow releases. The financial impact shows up as extra PR rewrites, pipeline reruns, context switching, and escalations.  

The fix is to move AppSec to the point of creation, inside the IDE, so issues are prevented or remediated while the developer’s mental stack is fresh. 

Agentic AppSec is autonomous, context-aware assistance that validates and remediates during coding, not after the commit. Gartner frames this category as AI Code Security Assistance (ACSA); Checkmarx One Assist operationalizes it through Developer Assist. 

A Practical ROI Model You Can Take to Your CFO 

For all the talk about developer productivity and AI acceleration, most AppSec leaders still struggle to express value in the language of finance. Your CFO doesn’t want “shift left” jargon and vulnerability counts, they want a structured model that translates engineering efficiency into measurable return. 

When we analyze ROI for Checkmarx One Developer Assist, we focus on five value buckets that both finance and engineering already recognize. Each ties directly to operational metrics your leadership team tracks, making it easy to build a defensible business case for Agentic AppSec: 

1. Mean Time to Remediate (MTTR) 

Inline findings and explainable fixes inside the IDE compress triage and remediation from hours to minutes. Since developers resolve vulnerabilities in context, fewer issues escape to late-stage testing or production, where every fix costs exponentially more. The result is measurable improvement in DORA MTTR and a more predictable release cadence. 

2. Throughput (Features per Period) and Lead Time for Changes 

    Every context switch, jumping from IDE to portal, waiting on a review, or rerunning a build, creates friction that slows throughput. When developers fix in-place, PR churn decreases and pipelines stabilize. That efficiency shows up directly as more completed work per sprint and a measurable reduction in Lead Time for Changes, one of the most visible metrics to executives tracking delivery velocity. 

    3. False-Positive Drag

    Noise has a cost. Each false positive wastes time erodes trust in tools, and slows adoption. By combining high-fidelity detection with explainable remediation, Developer Assist reduces alert fatigue across the SDLC. A Checkmarx case study found that Best Buy reduced false positive by 80%, illustrating the real economic drag of noisy security and the ROI of precision. 

    4. Rework and Failure Cost  

    Rework is one of the most underestimated drains on engineering productivity. Every post-merge defect triggers retesting, re-review, and sometimes a full CI/CD rerun. By catching vulnerabilities inside the IDE, Developer Assist prevents this expensive cycle before it begins. The result is fewer failed builds, lower operational overhead, and more stable release plans, which are benefits that directly translate into reduced operational expenses (OpEx) and improved predictability. 

    5. Developer Experience (Retention and Flow) 

    Security tools succeed or fail on adoption. If they slow engineers down, they’re disabled or ignored. Developer Assist meets developers where they work, offering AI-powered help that feels like collaboration, not interruption. Tools that improve flow and reduce cognitive friction boost both sentiment and retention, gains that compound over time into sustainable throughput and morale. 

    A CFO’s Takeaway 

    When you put it all together, these five metrics – TTR, throughput, false-positive drag, rework cost, and developer experience – form a complete Agentic AppSec ROI model. It ties productivity, quality, and cost together in one narrative that resonates from the engineering floor to the boardroom. Agentic AppSec is a measurable accelerator of business outcomes. The data is already in your DevOps pipeline, and the only question is whether you’re ready to quantify it. 

    Mechanics, Not Magic, Make Value 

    Every second counts when prevention happens in the IDE. By embedding detection, validation, and remediation directly where developers work, the result is measurable productivity and stronger security posture at the same time. 

    Detect earlier, fix faster (MTTR and failure avoidance) 

    Developer Assist analyzes source, manifests, IaC, and container descriptors as you type, surfacing explainable findings and one-click “Fix with Assist” flows right in the editor. Early detection reduces “late discovery” work and lowers the chance of broken builds. 

    Explainable AI remediation (trust drives adoption) 

    Structured prompts plus verified remediation data mean developers see why a change is needed, not just a diff. That “explain then apply” pattern speeds reviews and keeps security aligned to developer intent: critical for sustained adoption. 

    Integrated coverage (fewer tools, fewer gaps) 

    Because Developer Assist is powered by the Checkmarx platform, teams benefit from proven detection across SAST, SCA, IaC, secrets and container risks delivered in a consistent, IDE-first workflow. Reducing tool switches and consolidating signals also simplifies reporting upstream. 

    When AppSec becomes an active participant in development, not a passive gate at the end of it, security scales with the speed of code creation. Developer Assist bridges that gap, merging developer efficiency with enterprise-grade validation. The impact is cumulative: fewer missed vulnerabilities, faster clean builds, and quantifiable time savings that turn secure coding into a measurable business advantage. 

    Estimate Your ROI in Two Steps 

    Step 1: Time saved per issue 

    • Without IDE-level remediation: assume ~1–3 hours per issue (triage, rework, rebuilds).
    • With Developer Assist: much of that time collapses into minutes because context is fresh and changes are applied inline. 

    Step 2: Multiply by avoided rework 

    • Count how many security-related build failures/reruns you had last quarter.
    • Apply your blended engineering hourly rate to the time you didn’t spend reworking those PRs.

    Want a walkthrough? Our team can map DORA metrics to pre- vs post-Assist performance using your pipeline data. Let’s talk. 

    What Makes a Tool Actually Agentic? And Does It Matter for ROI? 

    Gartner’s AI Code Security Assistance (ACSA) lens emphasizes pre-commit, intent-aware control vs reactive scanning. In practice, this means fewer defects make it to late stages (where each fix is 3–10x more expensive than in development) and the ones that do arrive are already annotated with context. That’s why agentic beats “scan later” in cost curves. 

    Developer Assist pays for itself by eliminating rework at the source. When security happens in the IDE, you fix faster, ship faster, and report outcomes that resonate from dev teams to the board. 

    Read More: The Executive’s Guide to Quantifying Agentic AppSec ROI, From IDE Metrics to Board-Ready Numbers. 

    Download: The Agentic AI Buyer’s Guide 

    ]]>
    The Executive Guide to Quantifying Agentic AppSec ROI, From IDE Metrics to Board-Ready Numbers https://checkmarx.com/blog/the-executive-guide-to-quantifying-agentic-appsec-roi/ Mon, 29 Dec 2025 17:54:26 +0000 https://staging.checkmarx.com/?p=106264 For executives, proving the ROI of security investments has always been complex. Traditional AppSec tools report on vulnerabilities, found and fixed, but those metrics rarely translate into tangible business value. Agentic AI AppSec, led by Checkmarx One Developer Assist, changes that equation. 

    By embedding explainable, real-time remediation directly into the IDE, Developer Assist helps enterprises measure impact in terms that matter to both engineering and finance: time saved, quality improved, and cost avoided.  

    Here’s how to make that case with metrics your business already trusts. 

    Start With Metrics Your Business Already Trusts 

    The first rule of security ROI: your CFO doesn’t buy “scan accuracy”. They buy measurable outcomes that improve throughput, reduce cost, or accelerate delivery. 

    That’s why the most credible ROI models for Agentic AppSec align with the DORA metrics engineering leaders already track, and extend them with quality and cost indicators. 

    Lead Time for Changes (Cycle Time) 

    Inline, IDE-level guidance shortens the time between code commit and deployment. 

    By surfacing vulnerabilities and fixes as developers code, teams spend less time revisiting PRs or waiting on security reviews. Fewer bottlenecks mean faster feature delivery and shorter feedback loops. 

    Change Failure Rate 

    Agentic AppSec catches misconfigurations, insecure dependencies, and code smells before a commit, not after a build breaks. Fewer failed builds and hot-fixes translate directly to higher release stability and lower unplanned work, which impacts both velocity and engineering morale. 

    Mean Time to Remediate (MTTR) 

    Traditional tools force developers to context-switch between security reports and code. Developer Assist embeds explainable remediation right inside the IDE. 

    Developers understand why a fix matters and can resolve it immediately, reducing MTTR across sprints and improving compliance reporting accuracy. 

    False-Positive Rate 

    Precision isn’t just a technical metric, it’s an economic one. Every false positive consumes developer time. Best Buy, a Checkmarx customer, reduced false positives by 80% with Checkmarx One, reclaiming hundreds of developer hours per quarter. That reclaimed time is a quantifiable efficiency gain. 

    Translate Engineering Signals into Dollars 

    Once you’ve anchored your metrics, it’s time to connect them to financial impact. The key is reframing engineering efficiency as cost avoidance and productivity gain. Here’s how to quantify each dimension:

    Rework Avoided 

    Rework is the silent tax on software delivery. Every time a vulnerability is caught post-merge, the fix requires retesting, redeploying, and re-reviewing. 

    To calculate the value of avoiding that rework: 

    1. Gather last quarter’s data on security-related build failures or reruns. 
    2. Estimate the average time spent on each (triage + fix + retest). 
    3. Multiply that time by your blended engineering hourly rate. 
    4. Attribute the reduction in failures after Developer Assist adoption as the savings delta.

    What you’ll find is that even a modest 10% reduction in rework yields measurable ROI: because rework compounds across builds, QA cycles, and deployment delays. 

    Time-To-Value Acceleration 

    Time is revenue. Faster, cleaner releases mean features reach customers sooner, accelerating the revenue recognition timeline. Developer Assist’s inline guidance prevents bottlenecks that block PRs or delay merges. Tie your improvement in Lead Time for Changes directly to your product roadmap milestones. Finance already understands the concept of time-to-market; now they’ll see how in-IDE AppSec directly impacts it. 

    Alert Fatigue Reduction 

    Noise doesn’t just frustrate developers, it drains resources. Every false positive triggers a triage cycle that adds no business value. By reducing false positives through explainable AI and high-fidelity scanning, Developer Assist saves real hours. Use the Best Buy 80% reduction benchmark as a directional proxy in your initial model, and replace it with your own metrics after a 30-day pilot. 

    What “Agentic” Changes in the Cost Model 

    Executives are hearing the term Agentic AI more often, but what it really means for ROI is straightforward: it shifts AppSec from a reactive process to an autonomous, context-aware assistant. As Gartner’s framing of AI Code Security Assistance (ACSA) describes, these systems assist developers with policy-aware validation in real time, closing the gap between development and security. 

    That shift has two major financial effects: 

    1. Defect prevention instead of post-factum correction. 
      Fewer defects reach production, and those that do carry richer metadata for faster triage. 
    1. Cost compression. 
      The cost of fixing a defect late in the lifecycle is 3–10x higher than fixing it during development. By detecting and resolving issues at the creation point, Developer Assist drives a direct cost avoidance multiple. 

    In essence, agentic AppSec redefines security from a cost center into a throughput engine, one that pays dividends in efficiency, developer satisfaction, and customer trust. 

    From Metrics to Board-Ready Outcomes 

    Agentic AI AppSec doesn’t just change how developers work; it changes how executives justify security investment. By reframing technical metrics into measurable outcomes like reduced rework, accelerated delivery, fewer false positives, and higher developer efficiency, Developer Assist gives both CISOs and CFOs a clear ROI narrative supported by real data. Security isn’t slowing you down anymore. It’s making every release faster, safer, and smarter. 

    How Checkmarx One Developer Assist Implements Agentic for ROI 

    Inline Prevention and Explainable Fixes

    The combination of IDE-native detection and explainable remediation shortens MTTR and reduces Change Failure Rate, two Google DORA metrics with direct Operating Expenses impact. 

    Fewer Tools to Juggle, Clearer Reporting Up the Stack 

    Because Developer Assist is powered by the Checkmarx platform, you get consistent detection across SAST/SCA/IaC/Secrets/Containers with in-IDE guidance, and unified reporting for execs. That reduces swivel-chair time and makes trend reporting credible. 

    Adoption That Sticks 

    If developers don’t trust a tool, it won’t move metrics. Checkmarx content emphasizes just-in-time, in-flow assistance that teaches while fixing, which is critical for sustained adoption and compounding ROI. 

    Your 30-Day Proof Plan (Feel Free to Copy/Paste) 

    Week 1 – Baseline 

    • Extract last-quarter DORA metrics (Lead Time, Change Failure Rate, MTTR). 
    • Pull counts for security-related build failures and average time per failure. 

    Week 2 – Pilot

    • Enable Developer Assist for 1–2 active teams in VS Code/Cursor/Windsurf. 
    • Track: inline fixes applied, PRs with fewer revisions, build failures avoided. 

    Week 3 – Compare

    • Contrast pilot teams vs. control on DORA metrics + failure counts. 
    • Capture anecdotal feedback on explainability and dev flow. 

    Week 4 – Roll-up 

    • Convert time deltas into dollar savings. 
    • Exec slide: “From IDE events → DORA improvement → cost avoided.” 

    FAQs Execs Will Ask (and Concise Answers) 

    Is this just SCA in the editor? 

     No. Developer Assist brings in-IDE guidance backed by the Checkmarx platform across code, dependencies, IaC, secrets, and container descriptors with explainable remediation, not just alerts. 

    How is this different from reactive scanning? 

     It prevents issues before they hit the repo/CI and annotates fixes with context developers understand, improving both MTTR and adoption.  

    Is there analyst alignment for this approach?  

    Yes! Gartner’s AI Coding Security Assistant (ACSA) concept describes exactly this: policy-aware assistants validating code at creation. 

    Close the Loop Between the IDE and the Boardroom 

    Agentic AppSec isn’t a cost center; it’s a throughput engine. With Developer Assist, leaders see cleaner sprints, fewer reruns, faster releases, and measurable MTTR gains, all traceable to in-IDE prevention and explainable remediation. 

    DownloadThe Agentic AI Buyer’s Guide 

    Read: The ROI of Agentic AI AppSec 

    ]]>
    The Agentic Future of AppSec: Measuring Impact and Securing the AI-Powered SDLC https://checkmarx.com/blog/the-agentic-future-of-appsec-measuring-impact-and-securing-the-ai-powered-sdlc/ Sun, 30 Nov 2025 12:28:25 +0000 https://staging.checkmarx.com/?p=105918 Generative AI has accelerated software creation beyond anything the modern SDLC was built to handle. Developers now produce functional code in seconds using assistants like GitHub Copilot, Cursor, and Replit AI. Yet the same speed that fuels innovation also multiplies exposure, sending hidden vulnerabilities, unverified dependencies, and insecure logic into production faster than security can keep up. 

    AppSec isn’t broken, it’s just outpaced. 

    The shift to AI-generated and hybrid code demands a new kind of security approach: one that protects during code creation, not just after it’s written. This is the foundation of Agentic Application Security (Agentic AppSec) powered by AI Code Security Assistance (ACSA). 

    The Modern Threat Landscape: Securing the Point of Creation 

    Risk now originates from the prompt, not necessarily the pipeline. As developers adopt AI tools, organizations are facing threats never seen before in traditional AppSec: 

    Prompt Injection: Malicious inputs that manipulate AI assistants to generate insecure or exfiltrating code. 

    Lies-in-the-Loop (LITL) Attacks: Poisoned training data causing assistants to hallucinate unsafe dependencies. 

    Hallucinated Logic: Code that appears valid syntactically but violates security or compliance policies. 

    Shadow AI: Unauthorized AI tools or assistants contributing code without oversight. 

    Model Poisoning & Insecure Defaults: Using public models or repos that embed unsafe logic patterns by design. 

    Legacy scanning tools can’t catch these. They analyze outputs, not origins. They flag vulnerabilities, not intent. This is why the future of AppSec can no longer rely on reactive scanning – it requires real-time reasoning. 

    From Scanning To Thinking: The Rise of Agentic AppSec 

    Agentic AppSec platforms, like Checkmarx One Assist, redefine software security by embedding autonomous, reasoning agents inside the development process itself. Instead of waiting for code to hit the CI/CD, AI lives directly inside the IDE, understanding context, enforcing policy, and reasoning through risk as developers type. 

    Three Pillars of Chemarx One Agentic Security 

    1. Developer Assist: Validates both AI-generated and human written code inline. Blocks unsafe completions and explains secure alternatives in real time. 
    2. Policy Assist: Applies governance dynamically before code ever leaves the local environment. Aligns AI-generated logic with enterprise and regulatory policies. 
    3. Insights Assist: Correlates developer behavior, policy enforcement, and security telemetry to generate business-level ROI metrics. 

    Together, these agents form a reasoning loop that doesn’t just find problems, but learns from them too. 

    Why Traditional Metrics No Longer Measure Success 

    In the AI era, security teams can’t rely on historical KPIs alone. “Number of vulnerabilities closed” or “SLA adherence” no longer capture true AppSec performance when code volume, origin, and intent have changed. Instead, leading organizations must adopt Agentic AppSec KPIs that quantify speed, adoption, and quality together. 

    Metric  Traditional Focus  Agentic Focus 
    MTTR (Mean Time to Remediate)  Time to patch known vulns  Time to prevent vulnerabilities pre-commit 
    Throughput  Releases per month  Secure releases per month 
    Developer Adoption  Policy compliance  IDE engagement + fix acceptance rate 
    Cost-per-Vulnerability  Average fix cost  Cost avoided through inline prevention 
    Security Drift  # of open CVEs  % of AI-generated code validated at creation 

    Quantified Impact: What Agentic AppSec Delivers 

    • 30–40% faster remediation, thanks to inline prevention and safe refactor guidance. 
    • 20–25% throughput gain due to fewer pipeline breaks and reduced rework. 
    • 35% reduction in cost-per-vulnerability by stopping issues before commit. 
    • 60–70% reduction in dependency-upgrade effort as a result of intelligent package versioning and blast-radius analysis. 
    • 90%+ developer satisfaction driven by real-time explainability and low-noise UX. 

    These metrics aren’t theoretical. They’re based on data collected from early deployments of Developer Assist in enterprise-scale SDLC environments. 

    Linking ROI to Business Outcomes 

    1. Faster Delivery, Lower Risk 

    Preventing vulnerabilities pre-commit means fewer broken builds and faster merges without compromising safety. Organizations using Checkmarx Assist reported up to 2x faster release cadence in AI-assisted projects while maintaining full compliance coverage. 

    2. Measurable Security Efficiency 

    By quantifying avoided rework, failed pipeline reruns, and unplanned incident response hours, teams can translate effective security into business terms. Each prevented vulnerability saves roughly $300–$500 in remediation cost, not counting the downstream CI/CD impact. 

    3. Executive Visibility 

    Agentic AppSec solutions tie developer telemetry directly to business outcomes. Metrics like “AI code validated before commit” and “MTTR reduction  per release” give CISOs, CTOs, and CFOs a shared language for ROI, transforming AppSec from a cost center into a performance driver. 

    The Future: AppGenSec Powered by ACSA 

    Industry analysts have begun defining the next era of secure software development through two complementary lenses: 

    • Forrester’s AppGenSec: Proactive, generative security that embeds protection into the act of code creation. 
    • Gartner’s ACSA (AI Code Security Assistance): Agentic, real-time systems that validate AI and human authored code inline. 

    Together, they redefine the model: AppGenSec powered by ACSA, where autonomous agents reason, remediate, and report in real time. Security no longer just follows code; it needs to think with it. 

    Action Steps: Preparing for the Agentic SDLC 

    1. Map AI usage in your SDLC. Identify where code is being generated by assistants – and where validation doesn’t yet exist. 
    2. Adopt pre-commit validation. Shift security left of CI/CD by integrating IDE-native scanning. 
    3. Correlate AppSec metrics with developer telemetry. Align MTTR and throughput with adoption and release velocity. 
    4. Pilot Agentic Assistants. Start with Developer Assist in a high-velocity team and measure real-world ROI within 90 days. 
    5. Scale governance. Use Policy Assist to enforce assistant-level compliance before code ever merges. 

    If you’re starting to plan your adoption of Agentic AI, make sure to download The Agentic AI Buyer’s Guide to understand how leading organizations are securing AI-assisted development today, and what to look for as this market evolves.

    The Bottom Line: Security Must Evolve as Fast as AI 

    AI won’t wait for security to catch up, and neither will the market. Successful organizations will be those who anticipate vulnerbilities instead of chasing them. By embedding agentic reasoning, policy awareness, and developer-centric UX into every stage of coding, Checkmarx One Assist transforms AppSec from reactive gatekeeping into proactive enablement. Build faster, safer, and smarter with agentic AI. 

    Continue Learning: 

    ]]>
    Confronting Insecure Shadow AI: Six Must-Have Capabilities https://checkmarx.com/blog/confronting-insecure-shadow-ai-six-must-have-capabilities/ Sun, 30 Nov 2025 12:28:01 +0000 https://staging.checkmarx.com/?p=105916 The speed of software delivery is no longer set by pipelines or processes; it’s driven by prompts. Generative AI has transformed how code is created, shared, and deployed, dramatically improving developer productivity. Yet, visibility and governance haven’t kept pace. 

    Developers across enterprises are using GitHub Copilot, Cursor, and Replit AI to generate production code – often outside approved workflows. This invisible layer of AI-authored logic known as Shadow AI, untracked, AI-generated code entering production systems without policy enforcement or security validation. 

    The problem isn’t intent, it’s infrastructure. Traditional AppSec tools were built for pipelines, not for prompts. They see only the output of the development process, never the influence of the assistant that helped shape it. To secure the AI-powered SDLC, organizations need a new kind of platform that’s agentic, context-aware, and developer-native. 

    The Shift From Reactive to Agentic AppSec 

    Legacy AppSec tools scan static artifacts long after code is written. Agentic AppSec tools, live inside the development experience. 

    Agentic AppSec analyzes during the coding process and adapts to developer intent, enforcing organizational policies in real time. This process helps prevent insecure logic before it leaves the IDE and is pushed to production. 

    The distinction is simple but profound: 

    Traditional AppSec  Agentic AppSec 
    Post-commit scanning (SAST, DAST, SCA)  Pre-commit validation and guidance 
    Operates on repositories  Operates inside the IDE 
    Detects known patterns  Understands intent and origin 
    Alerts after merge  Prevents vulnerabilities before merge 
    Static policies  Context-adaptive governance 

    Checkmarx One Developer Assist, powered by AI Code Security Assistance (ACSA), embodies this shift. Its developer-side agents analyze code as it’s written (both human and AI-generated), providing inline fixes, safe refactors, and contextual reasoning without exposing source code outside the customer environment. 

    Evaluating an Agentic AppSec Platform: Six Dimensions That Matter 

    Choosing the right platform means understanding what differentiates “agentic” from “automated.” Below is a practical framework drawn from real Checkmarx deployments and independent buyer evaluations: 

    1. Real-Time, Intent-Aware Validation 

    Agentic systems don’t just parse syntax; they interpret intent. 

    • Do they run continuously as developers write and modify code (not only at save or commit)?  
    • Can they correlate completions to assistant influence and block insecure logic inline?  
    • Do they explain why a fix is necessary, linking to policy, CVE, or data-flow context?  
    • Are unsafe suggestions from AI assistants intercepted before PR submission? 

    Example: Developer Assist recognizes when Copilot-generated code inserts an outdated encryption algorithm. It blocks the suggestion, explains the risk, and recommends a compliant alternative, all within the IDE. No context switching required. 

    2. Developer-Centric UX and Trust 

    Adoption is critical. A technically strong tool that developers ignore provides zero ROI. 

    • Is setup frictionless across IDEs like VS Code, JetBrains, Cursor, Windsurf, or Eclipse?  
    • Are results explainable, with clear diffs and one-click safe refactors?  
    • Can developers adjust noise levels, suppress false positives, or override with justification?  
    • Is latency low enough (<200 ms feedback) to maintain flow? 

    When developers realize that security can accelerate rather than interrupt their work, adoption skyrockets. 

    3. Governance, Explainability, and Auditability 

    Agentic AppSec doesn’t governance, it embeds it instead. 

    • Can roles, policies, and severities be defined per team, repo, or language?  
    • Are AI actions logged and explainable ( e.g. “flagged due to unsafe deserialization pattern; see rule 143”)?  
    • Can leaders audit overrides and monitor security drift over time?  
    • Does the system provide policy compliance dashboards for SOC 2, FedRAMP, or ISO 27001 mapping? 

    Governance isn’t a separate console anymore; it’s a continuous feedback layer between the developer and the enterprise. 

    4. Shadow AI Detection and Control 

    Every AI assistant represents a new integration surface and potential risk vector. Shadow AI occurs when developers use GenAI tools that generate or insert code outside sanctioned workflows. So even if the final code passes syntax checks, it may contain hidden dependencies, unvetted packages, or logic trained on insecure repositories. 

    Key capabilities to demand: 

    • Detection: Identify AI-authored snippets by token pattern, prompt signature, or model fingerprint. 
    • Attribution: Map completions to the tool of origin (Copilot, Replit, Cursor, Windsurf). 
    • Risk Scoring: Flag AI-influenced logic that bypasses review or policy validation. 
    • Policy Enforcement: Block commits from unapproved assistants or require inline re-validation. 
    • Reporting: Provide dashboards showing AI usage by team, repo, or project. 

    Given the ubiquitous adoption of Gen AI coding practices, shadow AI isn’t a hypothetical risk anymore. 

    5. ROI and Throughput Gains 

    Agentic AppSec doesn’t just shift when vulnerabilities are found; it also changes how much they cost to fix. 

    According to Checkmarx’s internal ROI analysis (2025): 

    • MTTR improved by 30–40% with inline remediation versus post-merge fixes. 
    • Development throughput increased by 20–25% due to fewer broken builds and CI/CD reruns. 
    • Cost-per-vulnerability dropped by 35%, with early detection eliminating redundant rework cycles. 
    • Safe Refactor capabilities cut dependency-upgrade effort by up to 60–70%, reducing technical debt at scale. 

    These metrics correlate directly with improved DORA outcomes, including faster lead time for changes, reduced change-failure rate, and higher deployment frequency. 

    6. Ecosystem and Integration Fit 

    No AI agent operates in isolation. An effective platform must connect seamlessly across your engineering stack: 

    • IDEs: VS Code, JetBrains, Cursor, Eclipse, Windsurf. 
    • Version Control: GitHub, GitLab, Bitbucket. 
    • CI/CD: Jenkins, Azure DevOps, CircleCI, GitHub Actions. 
    • Package Managers: npm, PyPI, Maven, and Go modules with real-time SCA policy checks. 
    • SIEM/SOAR: Splunk, ServiceNow for alert ingestion and incident correlation. 

    Checkmarx’s open APIs enable these integrations while maintaining strict data sovereignty. No source code leaves the customer’s environment. 

    These six dimensions align closely with the broader evaluation framework outlined in The Agentic AI Buyer’s Guide, which covers Agentic AppSec, Shadow AI governance, and AI Code Security Assistance (ACSA) in depth. This free guide helps benchmark vendors, capabilities, and ROI expectations when adopting AI.

    The Shadow AI Reality: Unseen Code, Unscanned Risk 

    Picture this scenario: a backend developer experimenting with Cursor generates a new authentication handler. Cursor auto-imports an outdated JSON-web-token package containing a known CVE. Because the commit passes linting and functional tests, it merges successfully, but the vulnerability isn’t caught until weeks later, when CI/CD scanning reveals it post-deployment.  

    That’s the shadow AI gap. Developers weren’t careless – the tooling chain wasn’t built to recognize intent or origin. Agentic AppSec platforms close that gap by embedding reasoning at the moment of creation – before commit, before merge, before any exposure. 

    Building an Evaluation Shortlist 

    When comparing vendors, prioritize the following questions: 

    1. Does the platform operate natively within the IDE and correlate assistant influence?  
    2. Can it enforce pre-commit policy gates without sending code externally?  
    3. Does it quantify throughput and MTTR gains with customer-verified data?  
    4. Is explainability built in, can every decision be traced and justified? 

    Ask for proof, not promises – real customer metrics, not theoretical benchmarks. 

    The Business Case: Why It Matters Now 

    The economics of software delivery are shifting fast. AI has removed the bottleneck of creation, but not the cost of correction. Every vulnerability found after commit costs exponentially more to fix, and the gap only widens with each assistant-authored line of code. 

    By shifting validation left of commit, agentic AppSec platforms deliver measurable ROI: 

    • Security Debt Reduction: Early prevention reduces accumulated risk.  
    • Velocity Retention: Inline fixes avoid blocking developers mid-flow.  
    • Regulatory Alignment: AI governance satisfies evolving compliance mandates.  
    • Cross-Team Synergy: Security, DevOps, and compliance work from shared telemetry. 

    In other words, security finally scales with speed. 

    Closing the Loop: Visibility, Velocity, and Verification 

    Shadow AI isn’t going away. If anything, the next generation of AI assistants will be more autonomous, creative, and capable of introducing even more subtle vulnerabilities that bypass traditional defenses. 

    Agentic AppSec turns risk into resilience. By validating intent, governing policy, and embedding reasoning directly inside the developer’s workspace, platforms like Checkmarx One Assist transform AppSec from a reactive gate into a proactive guide. 

    The result: fewer vulnerabilities, faster releases, and a measurable reduction in AppSec overhead without slowing innovation. 

    Next in the series: Measuring Impact and Securing the AI-Powered SDLC.

    ]]>
    Why Context Is the New Code: Building AI-Resilient AppSec From the IDE https://checkmarx.com/blog/why-context-is-the-new-code-building-ai-resilient-appsec-from-the-ide/ Sun, 30 Nov 2025 12:27:11 +0000 https://staging.checkmarx.com/?p=105914 AI doesn’t just speed up software delivery; it rewires it. 

    Code is no longer meticulously handcrafted line by line. It’s assembled through prompts, completions, refactors, and pattern reuse across dozens of tools that rarely speak the same language. This transformation has made context, not code, the new foundation of application security.  

    Agentic AppSec, powered by AI Code Security Assistance (ACSA), works autonomously alongside developers, safeguarding what legacy scanners miss: the origin, intent, and policy context behind each generated line of code. 

    Legacy Scanning Fails to Track Origin and Intent 

    In the age of AI, it is problematic that traditional scanners can’t answer three critical questions:

    • Who wrote this line? 
    • Why was it added? 
    • Under what policy? 

    Without this context, AppSec is reactive, not preventive. Traditional AppSec tools were designed for static repositories, not dynamic, AI-driven co-creation. They excel at scanning what code does but miss why it exists and how it was created. When developers rely on Copilot, Replit, or Cursor, those assistants generate logic that may appear flawless, may also subtly violate architectural or security assumptions. 

    Post-commit scanning introduces three fundamental security blind spots: 

    • No visibility into origin or assistant influence
      Was the logic written by a developer, generated by AI, or blended? Without this metadata, scanners can’t differentiate trustworthy code from potentially hallucinated logic. 
    • No detection of unapproved tool usage (shadow AI)
      Teams often don’t know which completions were accepted, rejected, or modified, leaving the risk of blind spots and compliance gaps. 
    • No contextual correlation to developer intent
      A scanner can flag an unsafe crypto library, but it can’t tell you that the developer was prompted by an AI suggestion that bypassed organizational policy. 

    AI has fundamentally shifted the attack surface “from code to context.” Security must now validate more than just syntax and patterns, but also intent, assistant behavior, and adherence to secure-by-design principles as code is created, not weeks later in CI. 

    Defining Context-Aware Validation 

    Modern AppSec can no longer rely on static rules or pattern-matching alone. With AI assistants contributing to live code, security must understand the “why” and “how” behind every line, not just the “what.” Context-aware validation bridges that gap by connecting code behavior with its origin, purpose, and policy alignment. 

    So, what exactly does “context” mean inside a modern, AI-assisted SDLC? 

    The Five Dimensions of Context 

    Context Dimension  What It Captures  Why It Matters 
    Origin  Whether code was written by a human, AI-generated, or hybrid  AI completions can replicate unsafe patterns or hallucinate insecure dependencies 
    Intent  The purpose behind the change (new feature, refactor, package upgrade)  Security risk varies dramatically based on intent and data flow 
    Dependencies  Libraries recommended or generated by the assistant  AI-suggested packages can introduce malicious or outdated components 
    Policy Alignment  Organizational, regulatory, and licensing rules  Prevents policy drift and compliance risk before merge 
    Assistive Behavior  How AI tools were prompted and used  Enables traceability, drift detection, and governance 

    Together, these dimensions create a live map of how code evolves; connecting human intent, AI influence, and organizational policy in real time. This means your AppSec posture must interpret how and why a completion entered the codebase, connecting origin to outcome. 

    Why Context Matters for Security 

    Let’s take a common example: two different uses of the same API. 

    A human developer safely implements a crypto function, validating inputs, and managing key rotation. An AI assistant, trained on uncurated code, inserts the same API with insecure defaults. To a static scanner, both look nearly identical. A context-aware engine would recognize that the second instance came from an AI completion, correlate it with dependency history, and flag the lack of policy-aligned handling. It would also explain the reason for the alert: “This pattern originated from an AI assistant and violates internal crypto policy XYZ.” This example highlights the key difference between noisy alerts and actionable intelligence. 

    Evaluating Tools for Context-Aware Validation 

    If you’re assessing solutions that claim to provide “real-time AI security,” use the following framework to separate genuine agentic AppSec systems from traditional scanners with new labels: 

    1. Intent-Aware Validation 

    • Does it run during code creation, not just after commit?  
    • Can it block or guide unsafe completions pre-PR?  
    • Does it treat AI-generated code as a first-class input, not just text?  
    • Can it detect differences in “same API, different posture”?  
    • How much latency does it add inside large IDE projects? 

    Developer Assist Example: The Checkmarx One Developer Assist engine operates inline within VS Code and JetBrains. It flags insecure logic before commit, explains why, and suggests an inline fix – all without sending source code outside your environment. 

    2. Developer Experience and Trust 

    Security only works when it fits developer flow:

    • Are alerts contextual and editable inline?
    • Do explanations make sense in plain English, not scanner jargon?
    • Can teams tune signal-to-noise by severity, language, or repo?
    • Are suggested diffs small, testable, and reversible?

    A trusted agent feels like a senior engineer reviewing your PR, not a tool for scolding your IDE. 

    3. Governance and Explainability 

    Real security requires visibility for both developers and leadership:

    • Can roles and policies be scoped to repos, teams, or languages?
    • Are AI actions traceable and auditable?
    • Does it explain why a completion was blocked or altered?
    • Are overrides logged with justification and timestamp?
    • Does it provide drift detection to highlight shifts in behavior or coverage?

    4. Shadow AI Management 

    Unapproved AI use introduces real governance risk:

    • Can it detect Copilot, Cursor, or Replit AI usage across repos?  
    • Can it identify patterns that match known AI code templates?  
    • Does it surface hidden dependencies or untracked logic chains?  
    • Are shadow AI trends visible per team, repo, or language?  
    • Does it provide compliance-ready reports? 

    5. ROI and Throughput 

    Inline remediation produces measurable results when done right. In 2025 production pilots, Developer Assist has shown: 

    • Up to 30% reduction in mean time to remediate (MTTR) through inline, explainable fixes. 
    • 20–25% improvement in development throughput by reducing broken builds and CI reruns. 
    • ~35% drop in cost per vulnerability when issues are prevented pre-commit versus post-merge. 

    These aren’t abstract metrics but proven reclaimed developer time and avoided security debt. For more detailed evaluation, download the The Agentic AI Buyer’s Guide for further criteria, governance requirements, and ROI metrics that distinguish truly agentic, context-aware AppSec platforms from traditional scanners with new labels.

    Why Developers Should Care 

    For developers and engineering managers, context-aware validation isn’t about checking boxes; it’s about regaining control over your workflow:

    Fewer broken builds. Catch security flaws and dependency risks before they enter CI/CD. 

    Less context switching. No need to jump between IDE and scanner portals. 

    Smarter dependency management. Know the blast radius of each import before you commit. 

    Faster delivery, safer outcomes. AI-assisted velocity without rework or regressions. 

    The fastest teams aren’t the ones that skip security; they make it invisible, intuitive, and built into the moment of creation. 

    Real-World Example: From Completion to Compliance 

    Consider this example. 

    A developer prompts Cursor to build a REST endpoint. Cursor suggests a handler and a convenience parser package. Developer Assist immediately identifies the package’s vulnerable version, explaining: “This version contains CVE-2024-XXXX and violates dependency policy. Recommend v3.2.1.”  

    The developer accepts the fix. Then the engine flags an eval() call inserted by the assistant’s template, explaining: “Dynamic evaluation may expose unsanitized inputs. Replace with validated input flow.” A one-click fix securely rewrites the function, clean, compliant code is committed, and ready to merge with zero rework. The Assist workflow demonstrates context-aware security in action: the agent understood context, prevented risk and accelerated delivery 

    Preparing Your Organization for Context-First AppSec 

    Here are five strategic steps to implement  with your DevSecOps teams: 

    1. Map your AI workflows. Identify where assistants influence code generation. 
    2. Define policies inline. Example: “AI-generated code must pass in-IDE validation before commit.” 
    3. Start with a pilot. Measure rework avoided, MTTR, and build success rates. 
    4. Correlate with DORA metrics. Show faster, safer releases, not just fewer findings. 
    5. Scale with trust. Keep latency low, explanations local, and developer autonomy intact. 

    Security is no longer just shifting left but shifting into the act of coding. The IDE has become the new security perimeter, and context is the new code. By embedding Agentic AppSec through Developer Assist, teams can code at AI speed without losing control.  This approach closes the AI code security gap not by slowing developers down, but by empowering them to move fast while staying secure. 

    Next in the series: Six Must-Have Capabilities in an AppSec Platform to Confront the Rise of Insecure Shadow AI.

    ]]>
    GenAI Software Supply Chain Security Gap: Why Traditional AppSec Can’t Keep Up https://checkmarx.com/blog/genai-software-supply-chain-security-gap-why-traditional-appsec-cant-keep-up/ Sun, 30 Nov 2025 12:26:23 +0000 https://staging.checkmarx.com/?p=105910 Every day, development teams are generating more code than ever, but a growing portion is now being written by generative AI. Generative AI assistants like GitHub Copilot and Replit  AI and others are writing boilerplate code, refactoring modules, and even generating entire functions. This surge in productivity also brings new risks. 

    Traditional application security (AppSec) tools follow a simple cycle: write code, commit, scan, and fix. This model works when humans write the code, scan the repository post-commit, and then security teams triaged. However, when AI starts writing significant portions of the application, that pipeline changes. 

    In this article we explore how GenAI changed the software supply chain and created a new “security gap,” that traditional scanners fail to protect. 

    How GenAI is Reshaping the Software Supply Chain 

    The modern software supply chain now begins at the prompt. In today’s workflow, this developer sequence is almost universal: 

    1. A developer prompts their AI assistant. 
    2. The model emits code almost instantaneously. 
    3. The developer accepts, modifies, and commits. 

    While this workflow feels frictionless and boosts velocity, code generation has quietly become the new entry point for risk. 

    Vulnerabilities that once emerged downstream in outdated dependencies, open-source packages, or legacy codebases, now originate upstream from creation time. GenAI is known for accelerating coding, but at the same time also accelerates the introduction of logic errors, insecure defaults, and unvetted dependencies that can slip into builds and production. 

    Checkmarx’s 2025 State of Code Security Survey found that while 99% of development teams now use AI-assisted coding tools, only 29% have implemented formal AI security controls. In other words, nearly every development team is using AI to generate or modify production code, but only a fraction is properly monitoring how that code is being produced, validated, and reviewed. 

    This lack of oversight has redefined the security perimeter. Code that once flowed through structured write and review cycles is now written in seconds by a non-human contributor, without the typical visibility that developers and AppSec teams previously relied on. 

    The result is a new class of vulnerabilities: syntactically correct, functionally sound, yet logically unsafe. These flaws compile, pass tests, and even ship to production, but they can violate security policies or expose sensitive logic that scanners miss until it’s too late. 

    The Software Supply Chain Has a New Upstream Input 

    Traditionally, the software supply chain referred to open-source packages and dependencies entering a build, following a predictable sequence: source, build, dependencies, and deploy. Security programs were designed around this flow – scanning code as it moved downstream, validating open-source components, and monitoring what entered the build. But now, AI-generated code has become a new upstream source in the supply chain. 

    The software supply chain now begins long before a build ever runs. Each AI-assisted coding session can introduce a new upstream input that never existed in traditional development models: 

    • Code completions and scaffolding created by large language models (LLMs) 
    • AI-suggested dependencies and third-party packages added automatically 
    • Automated refactoring and template rewrites performed by assistants such as Copilot, Cursor, or Windsurf 

    Every one of these elements has become part of the modern software supply chain, but they often bypass manual review and existing AppSec gates. This creates a blind spot at the point of code creation, inside the IDE itself. To close this gap, security coverage must begin earlier and extend all the way into the development environment where GenAI actually operates.  This gap at the creation layer is where the most critical risks now emerge. 

    The Emerging Security Gap

    Vulnerabilities Slip in Unseen  

    AI generated code may follow the correct syntax correctly, but it doesn’t always follow best security practices. Consider this: 

    • The AI scaffolds an endpoint with eval() or insecure default configs. 
    • It pulls in a package from npm or PyPI with known vulnerabilities because the assistant “thought it matched”. 
    • It ignores enterprise-specific security standards because the prompt didn’t specify them. 

    These scenarios highlight risk before the code even hits the repository. Checkmarx recognizes that large language models weren’t trained in secure-coding practices and that “slick-looking code” can still collapse under attack. Additionally, typical SAST tools only reviews code once committed, creating a security blind spot. 

    Why Post-Commit Scanning Fails 

    Traditional AppSec relies on the scan after commit model. In AI-assisted development, that’s like checking the locks after the intruder’s already inside: by the time a SAST or SCA tool runs, the vulnerable logic is merged or possibly even deployed. 

    There are several reasons: 

    • Latency: By the time a scan runs, the code is merged and maybe deployed. Fixing now is more expensive. 
    • Context loss: The prompt and both AI and developer edits often leave no trace. Scans only see the code, not how it was created. 
    • AI behavior: Code produced by AI may not map to traditional heuristics; it may use unusual patterns or mix libraries in novel ways. Static pattern scanners may miss these. 

    In a recent blog about the IDE as a critical attack surface, we argued that DevSecOps teams must “shift detection to the moment vulnerabilities are introduced.” This shift from reacting  to vulnerabilities after they appear to preventing them as they’re written,  defines the new standard of AI-aware, developer-first security. 

    New Attack Vectors: LITL, Prompt Injection, Shadow AI 

    As AI becomes a co-author in software creation, the attack surface is expanding in subtle and dangerous ways. Traditional vulnerabilities like SQL injection and buffer overflows haven’t disappeared, but they’ve been joined by new, AI-specific threats that exploit not only how generative systems think, but also how they code. 

    • Lies-in-the-Loop (LITL): manipulates how AI coding assistants learn from feedback, injecting unsafe logic that looks valid to both the developer and the model.  
    • Prompt Injection: Attackers manipulate inputs to assistants, so they generate insecure code or embed malicious logic.  
    • Shadow AI: Developers use unsanctioned assistants (Copilot, Replit AI) bypassing security review – introducing unvetted logic and packages. 

    Checkmarx has documented these attack vectors in depth and continues to warn that prevention must start at the coder’s cursor, not after deployment. Defending against these emerging risks means embedding security in real-time, where AI and human developers meet: the IDE. 

    The Organizational Risks of “Security Drift” 

    When developers use GenAI to generate code but continue choosing tools without the oversight this new workflow requires, the organization faces “security drift.” Practices begin to diverge from centralized policy, often without visibility. 

    Consider the data: 60% of organizations say GenAI usage is unapproved, yet still occurs. This means that developers are using unsanctioned assistants that introduce risky dependencies and create AI-generated code that bypass SCA (software composition analysis) and secrets scans. Security teams often only identify these vulnerabilities only after they’re deployed. 

    This drift isn’t just technical; it’s cultural. When the speed of delivery outpaces the speed of review, risk becomes embedded. 

    Why Traditional AppSec Tools Can’t Plug the Gap 

    1. Post-commit scanning is too late
      By the time a scanner runs, the damage is done. Control flow may be established; packages imported, secrets exposed. Fixing becomes more expensive.  
    2. Tool origin ignorance
      Most tools do not distinguish between manually written and AI suggested code. They treat all code equally, which means patterns unique to AI-generated code (e.g., hallucinated dependencies, repeated boilerplate, prompt-inference flaws) may go undetected.  
    3. Fragmented workflows
      Development workflows vary from IDE completions, CLI generators, AI notebooks, hybrid teams. Traditional scanners remain focused on repo or CI/CD, not the moment code is written. 

    Reframing What Secure Software Development Means 

    To fill this security gap, organizations must rethink: 

    • When security validation happens (shift far left). 
    • Where it happens (inside the editor/IDE, not just commit). 
    • Who and what it protects (humans and AI assistants). 

    The new development model, dotted with AI suggestions, demands security that understands intent, not just syntax. Validation has to happen during code creation,  regardless of whether it’s written by human or an AI assistant. 

    Tools like Checkmarx One Dev Assist address this new reality by analyzing dependencies, secrets, and logic inline, enforcing policy before code ever leaves the IDE. As code is authored, whether by human or AI, an intelligent engine validates intent, dependencies, secrets, API usage in real time. Checkmax One Developer Assist delivers what modern AppSec demands: real-time validation of logic and dependencies, inline fixes, and prevention rather than remediation.  

    Action Items for Developers Using AI

    Here are actionable steps for engineering leaders and developer teams: 

    Audit your use of AI assistants 

    • Which tools are being used? Are they sanctioned? 
    • Are completions being reviewed or audited? 

    Embed security in the authoring phase 

    • Choose tools that scan as code is typed or generated, not just at commit. 
    • Highlight unsafe API patterns (e.g., eval(), permissive configs) at generation time. 

    Extend SCA and secrets scanning into AI-generated code 

    • Ensure the platforms you use detect packages brought in via assistant suggestions, validate versions, assess “blast radius”. 

    Establish governance for GenAI use 

    • Define which assistants are allowed. 
    • Track usage, flag unapproved tools (shadow AI). 
    • Embed policy enforcement into the workflow. 

    Measure uplift and risk reduction 

    • Track vulnerabilities found pre-commit vs post-commit. 
    • Monitor time-to-remediate (MTTR) for AI-generated code. 
    • Use developer productivity metrics to justify investment in new tooling. 

    Want a structured framework to operationalize these steps across teams? The Agentic AI Buyer’s Guide includes evaluation questions, governance requirements, and ROI metrics to help standardize GenAI security controls across the SDLC.

    Next Steps 

    The software supply chain is no longer just about managing dependencies and deployment pipelines. Application security needs to begin at the cursor; the moment code is generated. GenAI has sped up development, but it also accelerated vulnerability introduction. And that needs to be addressed. 

    If your AppSec strategy still relies on post-commit scans, you’ll be stuck chasing issues that are already embedded in your codebase. Security must move upstream by validating intent, dependencies, secrets, and patterns in real time, inside the IDE.  

    Further Learning: Read the companion article “Why Context Is the New Code: Building AI-Resilient AppSec From the IDE” to discover a practical framework for securing your applications in this new world. 

    ]]>