14.1.4 Bypassing Anti-Money Laundering (AML) Systems

Anti-Money Laundering (AML) systems represent one of the most data-intensive and high-stakes applications of AI in finance. They are no longer simple rule-based engines but complex ecosystems of machine learning models designed to detect needles of illicit activity in haystacks of legitimate transactions. As a red teamer, your objective is to determine if these sophisticated defenses can be systematically blinded, misled, or manipulated by an adversary who understands the underlying AI.

The Target Anatomy: Modern AI-AML Architectures

To bypass an AML system, you first need to understand its components. Modern systems are multi-stage pipelines, each presenting a potential attack surface. An adversary doesn’t need to break the entire system; compromising a single critical stage can be enough to create a reliable blind spot.

Do you have a question about AI security? You can reach us here:








Typical AI-AML System Architecture Data Ingestion (Transactions, KYC) Feature Engineering (Velocity, Network) Unsupervised Model (Anomaly Detection) Supervised Model (SAR Classification) Graph-Based Model (Network Analysis) Risk Aggregation & Alert Generation

  • Supervised Models: These are classifiers (e.g., Gradient Boosting, Neural Networks) trained on historical data, where transactions were labeled as suspicious or benign (often from filed Suspicious Activity Reports – SARs). Their weakness is their reliance on known patterns. They are poor at detecting novel laundering typologies.
  • Unsupervised Models: These systems (e.g., Isolation Forests, Autoencoders) hunt for outliers without pre-existing labels. They excel at finding novel anomalies but can be noisy, generating high false positives. Their vulnerability lies in an adversary’s ability to make illicit transactions appear statistically normal.
  • Graph-Based Models: Using Graph Neural Networks (GNNs), these models analyze the network of transactions, identifying suspicious structures like layering, smurfing rings, or mule accounts. They are powerful but can be confused by manipulating the graph’s topology.

Red Teaming Tactics for AML System Evasion

Your red teaming engagement should focus on emulating sophisticated adversaries who actively work to defeat the AI. This goes beyond simple rule-breaking and into the realm of adversarial machine learning.

Tactic 1: Adversarial Transaction Crafting (Evasion)

The goal here is to design a set of illicit transactions that, when processed by the feature engineering pipeline, produce a feature vector the model classifies as benign. This is a classic evasion attack tailored to the financial domain.

Imagine a model’s risk score is a function of `amount`, `frequency`, and `cross_border_flag`. In a white-box test, you could use gradient-based methods to find the smallest possible perturbation to a transaction to lower its risk score below the alert threshold. In a black-box scenario, this involves carefully probing the model.

# Pseudocode for black-box transaction optimization
function find_benign_transaction(initial_tx):
    tx = initial_tx.copy()
    max_iterations = 100
    
    for i in range(max_iterations):
        risk_score = aml_model.predict(tx)
        
        if risk_score < THRESHOLD:
            return tx  # Evasion successful
        
        # Propose a small, plausible change
        perturbed_tx = generate_perturbation(tx)
        
        # If the new transaction is less suspicious, keep it
        if aml_model.predict(perturbed_tx) < risk_score:
            tx = perturbed_tx
            
    return None # Failed to find a bypass

This attack simulates “structuring” or “smurfing” in an AI-aware manner, where the adversary isn’t just staying below a static $10,000 threshold, but actively optimizing against the model’s learned multi-dimensional decision boundary.

Tactic 2: Training Data Contamination (Poisoning)

A more insidious, long-term attack is data poisoning. This involves injecting carefully crafted data into the system’s training feedback loop. Most AML systems have a mechanism for analysts to label alerts as false positives. An adversary could exploit this to slowly teach the model that a specific type of malicious activity is actually benign.

Consider an attacker establishing a new money laundering typology using a novel cryptocurrency exchange. The initial transactions might trigger alerts, but if they are small and mixed with legitimate-looking activity, analysts might dismiss them. Over time, these “false positive” labels poison the retraining data, creating a blind spot for the model.

Example Poisoning Campaign Timeline
Phase Attacker Action Intended Analyst Response Model Impact
1. Seeding Execute numerous small, benign-looking transactions through a new shell corporation (ShellCorpX). Alerts are flagged as “low risk” or “false positives” due to low amounts. Model begins to associate ShellCorpX with normal activity.
2. Normalization Slowly increase transaction volume and amounts, mixing them with clean funds. Fewer alerts are generated as the model’s confidence in ShellCorpX’s legitimacy grows. The decision boundary shifts; the model’s feature space for ShellCorpX is now considered “benign.”
3. Exploitation Execute large, illicit transactions through the now-trusted ShellCorpX. No alert is generated. The poisoned model fails to detect the illicit flow, creating a durable bypass.

Tactic 3: Graph Topology Manipulation

Graph-based models are highly effective at spotting collusion and complex laundering networks. Therefore, a sophisticated adversary will focus on manipulating the graph structure itself. Instead of a simple chain (A → B → C), they create a complex, legitimate-looking network that obfuscates the flow of funds.

This attack involves creating many intermediary “distributor” accounts that break up large sums and mix them with other funds before reconsolidating them at the destination. From the GNN’s perspective, the core laundering path is hidden within a dense cluster of seemingly normal commercial activity.

Graph Manipulation Attack Before: Simple Chain (Detectable) A B C Illicit funds flow directly. After: Obfuscated Network (Evasive) A C M1 M2 M3 M4 Funds are diffused through mule accounts (M1-M4), hiding the link between A and C.

Defensive Countermeasures and Recommendations

Your role as a red teamer concludes with actionable recommendations. The goal is not just to break the system, but to provide a path toward hardening it.

  • Implement Ensemble Defenses: No single model is foolproof. A robust AML system should use an ensemble of diverse models. An attack that fools a supervised classifier might be caught by an anomaly detector, and a transaction that appears normal in isolation might be flagged by a GNN. The final risk score should be a weighted aggregate from multiple, independent detection mechanisms.
  • Introduce Adversarial Training: The blue team should proactively generate adversarial examples during the training phase. By using techniques like FGSM (Fast Gradient Sign Method) to create minimally perturbed suspicious transactions and explicitly labeling them as malicious, the model learns a more robust decision boundary, making it harder to bypass with simple evasion tactics.
  • Focus on Explainability (XAI): When an alert is generated, analysts should have access to tools (like SHAP or LIME) that explain *why* the model flagged the transaction. This serves two purposes: it helps analysts make better decisions, and it can reveal when the model is focusing on nonsensical or easily manipulated features, a potential sign of an ongoing or past poisoning attack.
  • Monitor for Concept Drift and Anomalous Data Patterns: An adversary’s attempt to poison or evade a model will change the statistical properties of the incoming data. Implement a meta-monitoring layer that watches for drifts in feature distributions. A sudden change in the average transaction patterns from a certain region, for example, could be an indicator of a coordinated attack and should trigger a model review.

Rácz-Akácosi Attila

AI Biztonsági Szakértő

Két évtized analitikai, elemzői háttérrel. 2017 óta foglalkozom mesterséges intelligenciával.
Az utóbbi években AI/LLM biztonságra és AI Red Teaming-re specializálódtam. 
Rendszerszintű gondolkozás hibalisták helyett.