What Is Gen 3 Email Security? The Definitive Definition

Get reasoning, in your inbox.

Threat research and field notes from inside customer inboxes. Twice a month, no spam, unsubscribe anytime.

Keep reading .

Generation 1: The Gateway Era (mid-1990s to ~2018)

The first generation of enterprise email security was built to solve a specific, well-understood problem: high-volume, indiscriminate attacks. The defining architecture was the Secure Email Gateway (SEG) — a system that sat inline between the internet and the corporate mail server, inspecting every inbound message before delivery.

Gen 1 platforms shared four core characteristics:

Signature-based detection. Known malicious URLs, file hashes, and sending domains were catalogued and compared against every inbound message. If a match was found, the message was blocked.
Inline positioning. All mail was routed through the gateway by changing MX records. The gateway had to process every message before delivery, creating a perimeter model of defence.
Rules and heuristics. Messages were scored against manually maintained rulesets — suspicious sender patterns, attachment types, keyword lists, SPF/DKIM/DMARC failures.
Retrospective updating. New threat signatures were published after attacks were observed and analysed. The protection lag between a new attack appearing and detection being deployed was measured in hours to days.

The major vendors of the Gen 1 era — built genuinely effective platforms for the threat environment they were designed for. That environment was characterised by mass-distributed, payload-carrying attacks: malicious attachments, known bad URLs, spam at scale. Signature matching worked because attacks were repetitive. Blocklists worked because malicious domains stayed malicious long enough to be catalogued.

Gen 1 began to fail when the threat environment changed. Business Email Compromise (BEC) — attacks that carry no payload, no malicious URL, and no signature — arrived in volume around 2015–2016. A gateway that looked for what an email contained was structurally blind to attacks that weaponised who an email appeared to come from. A message asking a finance controller to wire money to a vendor account contains nothing a signature-based system can flag. Gen 1 could not detect it.

Generation 2: The Behavioural Era (~2018 to ~2023)

The second generation was a direct architectural response to Gen 1's BEC blind spot. The foundational insight was this: if you cannot detect an attack by examining the content of a single message, you need to examine the context of that message — specifically, whether the sender, the request, and the recipient relationship make sense against an established baseline of normal behaviour.

Gen 2 platforms shared four core characteristics:

API-native deployment. Rather than sitting inline, Gen 2 platforms connected directly to Microsoft 365 or Google Workspace via API, accessing the full mailbox environment without changing MX records.
Organisational baseline modelling. By ingesting historical email data, Gen 2 platforms built a model of normal communication patterns within an organisation — who emails whom, typical request types, expected vendor relationships, writing style baselines per sender.
Anomaly detection. A message was flagged if it deviated statistically from the established baseline. An executive who had never previously emailed a specific finance team member sending an urgent wire transfer request represented an anomaly against that baseline.
Social graph mapping. Some Gen 2 platforms extended baseline modelling across the supply chain — building vendor communication profiles and flagging messages that deviated from expected vendor behaviour.

Gen 2 solved the BEC problem that Gen 1 could not. For organisations moving off legacy SEGs, the improvement in detection of socially engineered attacks was substantial.

Gen 2's limitation is structural, not a product failure. Behavioural baseline models answer the question: is this email consistent with what we have seen before from this sender? That question becomes unanswerable — and therefore meaningless — in three increasingly common scenarios:

First-contact attacks. A threat actor sending a tax-season phishing email from a brand-new external domain has no baseline to compare against. The system has never seen this sender. Anomaly detection has nothing to anomaly-detect.
Compromised legitimate accounts. When an attacker sends a BEC attack from a supplier's genuine email account — after compromising that account — the communication appears entirely consistent with the baseline. The sender is known. The relationship exists. Nothing is anomalous.
AiTM and PhaaS campaigns at scale. Modern phishing kits generate unique emails, unique QR codes, and unique URLs per recipient. There is no repeated signal to accumulate into a baseline pattern. Each instance looks novel because it is.

Gen 2 asks: is this unusual for this sender? Gen 3 asks: what is this email trying to make the recipient do — and is that action consistent with legitimate business context? These are fundamentally different questions with fundamentally different answers.

Generation 3: The Intent Reasoning Era (~2023 to present)

Generation 3 email security emerged from a recognition that neither signature matching nor behavioural baselining could reliably detect the class of attacks that now dominates the enterprise threat landscape: AI-generated, zero-signal, high-personalisation campaigns that produce no known indicators of compromise and no statistical deviation from any baseline.

The architectural shift in Gen 3 is not incremental. It represents a different question being asked of every email. Rather than asking what does this email contain? (Gen 1) or is this email unusual for this sender? (Gen 2), Gen 3 asks: what is this email trying to accomplish — and is that action consistent with legitimate business purpose?

This shift from signal detection to intent reasoning requires a different underlying technology. It cannot be accomplished with signature databases or statistical anomaly models. It requires a system capable of semantic comprehension — understanding language the way a trained analyst would, in context, with awareness of social engineering patterns, financial fraud methodologies, and the specific organisational environment in which the email arrived.

The five defining characteristics of Gen 3

1. LLM-native analysis at the core.

Gen 3 platforms use Large Language Models as the primary detection engine — not as a feature added to an existing architecture, but as the foundation. The LLM reads the email semantically, understanding tone, urgency, anomalous requests, impersonation cues, and the logical structure of the social engineering attempt. This is distinct from Gen 2 platforms that use BERT-class models as a supplementary signal within a broader behavioural anomaly pipeline.

2. Intent classification, not content matching or baseline comparison.

Every email is classified by its underlying intent: what action it is designed to produce in the recipient, and whether that action serves a legitimate business purpose. A tax-season phishing email from a brand-new domain asking a finance controller to scan a QR code to verify their M365 credentials has a classifiable intent — financial credential harvest — regardless of whether the domain appears on any blocklist or whether the sender has any prior communication history with the organisation.

3. Dual-evidence reasoning.

Gen 3 platforms analyse both evidence of malicious intent and evidence of legitimate context simultaneously. The system does not simply ask whether an email is suspicious. It asks whether the sum of positive and negative evidence resolves to a coherent, explainable threat verdict. This dual-evidence approach is what enables low false positive rates in environments with high volumes of legitimate traffic that might otherwise trigger anomaly flags.

4. Explainable, analyst-readable verdicts.

Because Gen 3 detection is built on reasoning rather than pattern matching, every verdict can be expressed in natural language. A security analyst reviewing a quarantined message sees not a risk score and a rule reference, but a plain-language explanation of what the email was attempting, which specific signals supported that classification, and what action was recommended. This explainability is architecturally native to Gen 3, not a reporting layer added on top.

5. Zero-history detection capability.

Gen 3 platforms detect threats from first-contact senders with no prior communication history, compromised legitimate accounts behaving normally, and entirely novel attack methodologies that have never appeared in any training dataset. The detection does not depend on what has been seen before. It depends on the reasoning capability applied to what is in front of the system right now.

Why the Transition Is Happening Now

The emergence of Gen 3 as a distinct category is not a product marketing decision. It is a response to a specific and measurable change in the threat environment.

Phishing-as-a-Service platforms now make it trivial for low-skilled threat actors to deploy AiTM attacks that bypass MFA entirely, generate unique QR codes per recipient to defeat URL reputation systems, host malicious payloads on legitimate infrastructure such as OneDrive and Eventbrite, and run multi-step kill chains specifically designed to defeat sandbox automation. These capabilities were specialised tradecraft in 2020. They are commodity features in 2026.

In this environment, Gen 1 produces no signal because there is nothing to match. Gen 2 produces no signal because there is no baseline to compare against. The attacks are specifically engineered to leave no footprint that either generation can detect.

The only detection layer that produces a signal is one that reasons about what the attack is trying to do — and Gen 3 platforms are the only ones architecturally capable of that reasoning.

How to Evaluate Whether a Platform Is Genuinely Gen 3

Given the incentive for any email security vendor to claim Gen 3 positioning, buyers and analysts evaluating platforms should apply the following tests:

Ask how the platform detects a first-contact phishing email from a brand-new domain. If the answer involves sender reputation, URL scanning, or anomaly detection against a baseline, it is not Gen 3.
Ask for the verdict explanation on a specific blocked email. A Gen 3 platform should produce a readable, analyst-level explanation of the intent classification — not a score or a rule reference.
Ask how the platform handles a BEC attack sent from a compromised legitimate vendor account. This is the hardest test. The sender is known, authenticated, and has an established baseline. Gen 2 will miss it if the message is well-crafted. Gen 3 detects it through intent reasoning.
Ask for false positive rates. Genuine Gen 3 dual-evidence reasoning produces materially lower false positive rates than Gen 2 behavioural systems in high-volume enterprise environments, because legitimate emails are classified by their intent as much as suspicious ones.
Ask where the LLM sits in the architecture. If the answer is that the LLM is a supplementary signal within a broader rules-based or anomaly-detection pipeline, the platform is Gen 2 with LLM features. Gen 3 means the LLM is the primary reasoning engine.

Final Thoughts

The history of enterprise security is a history of categories hardening into definitions after the market matures enough to demand them. Firewall, IDS, EDR, SIEM — each of these terms went through a period of definitional ambiguity before settling into shared criteria that buyers could use for evaluation and vendors could use for positioning.

Gen 3 email security is at that inflection point now. The threat environment has moved decisively beyond what Gen 1 and Gen 2 architectures were designed to handle. The architectural response — LLM-native intent reasoning — is distinct enough from its predecessors to warrant a stable, agreed-upon category definition.

The definition proposed here is not proprietary. It is an attempt to give the market a shared vocabulary for a real architectural distinction. Vendors who meet the criteria should own the label. Vendors who do not should not be permitted to claim it.

The question worth asking your current vendor is a version of the one Gen 3 platforms answer about every email: what exactly is this system trying to accomplish — and is the architecture behind it actually capable of that?

Frequently Asked Questions

Q1: Is Gen 3 email security the same as AI email security?

Not exactly. Every email security generation has incorporated AI in some form — Gen 1 platforms use machine learning for spam classification, Gen 2 platforms use ML for behavioural baselining and anomaly detection. Gen 3 is a specific architectural classification: it means the platform uses Large Language Models as the primary detection engine, performing semantic intent reasoning rather than pattern matching or anomaly detection. 'AI email security' is a broad marketing term. Gen 3 is a precise architectural criterion.

Q2: Does using an LLM automatically make a platform Gen 3?

No. Many Gen 2 platforms have added LLM-powered features as supplementary signals within an existing behavioural anomaly architecture. This does not make them Gen 3. The classification requires the LLM to be the primary reasoning engine — not a feature sitting on top of a signature database or a baseline comparison model. The test is whether the platform can detect a first-contact phishing email from a brand-new sender with no prior history, using only the content and context of the email itself.

Q3: What happened to Gen 2 platforms — are they obsolete?

Not obsolete, but increasingly insufficient as a standalone defence against the current threat landscape. Gen 2 platforms remain effective at detecting anomalies in sender behaviour and protecting against attacks on known communication relationships. Their structural blind spot is first-contact attacks, compromised legitimate accounts, and zero-signal campaigns — which now represent a growing proportion of enterprise email threats. Many organisations will run Gen 2 and Gen 3 layers simultaneously during the transition period.

Q4: Who coined the term Gen 3 email security?

The three-generation framework — Gateway era (Gen 1), Behavioural era (Gen 2), Intent Reasoning era (Gen 3) — was formalised by StrongestLayer as a way to describe the architectural distinction between its own platform and prior generations. The underlying architectural differences it describes are real and observable across the market, regardless of the terminology used to label them.

Sources and Further Reading

FBI Internet Crime Report 2025 — Business Email Compromise Statistics

Verizon 2025 Data Breach Investigations Report

‍

Subscribe to Our Newsletters!

Be the first to get exclusive offers and the latest news

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.