Neuro-Symbolic Fraud Detection: Catching Concept Drift Before F1 Drops (Label-Free)

, every metric looked perfect.
RWSS = 1.000. Output probabilities unchanged. No labels moved.
All the pieces said “all clear.”

Then the alert fired anyway.

Window 3: severity=warning RWSS=1.000 fired=True ← FIDI Z fires here

The model’s predictions didn’t know anything was fallacious yet.
However the symbolic layer did.

That is what actually happened within the experiment — and why it matters for anyone running fraud models in production.

Full code: https://github.com/Emmimal/neuro-symbolic-drift-detection

TL;DR: What You Will Get From This Article

FIDI Z-Rating detects concept drift in 5 of 5 seeds, sometimes before F1 drops, with zero labels required
RWSS alone missed 3 of 5 seeds. A Z-score extension of FIDI is what makes it work
Covariate drift is a whole blind spot. It needs a separate raw-feature monitor
The alert system is ~50 lines of code and the difference between a scheduled retrain and an emergency one

The Story So Far

That is Part 3 of a series. Recent here? One paragraph is all you wish.

A HybridRuleLearner trains two parallel paths: an MLP for detection and a rule path that learns symbolic IF-THEN conditions from the identical data. The rule path found V14 by itself across two seeds, without being told to search for it. That learned rule (IF V14 < −1.5σ → Fraud) is now the thing being monitored. This text asks what happens when V14 starts behaving in a different way.

Can the foundations act as a canary? Can neuro-symbolic concept drift monitoring work at inference time, without labels?

Three Ways Fraud Can Change

Concept drift fraud detection is harder than it sounds because only considered one of the three common drift types actually changes what the model’s learned associations mean. The experiment simulates three varieties of drift on the Kaggle Credit Card Fraud dataset (284,807 transactions, 0.17% fraud rate) across 8 progressive windows each [9].

Covariate drift. The input feature distributions shift. V14, V4, and V12 move by as much as +3.0σ progressively. Fraud patterns stay the identical. The world just looks a bit of different.

Prior drift. The fraud rate increases from 0.17% toward 2.0%. Features are unchanged. Fraud becomes more common.

Concept drift. The sign of V14 is steadily flipped for fraud cases across 8 windows. By the top, the transactions the model learned to flag as fraud now seem like legitimate ones. The rule IF V14 < −1.5σ → Fraud is now pointing within the fallacious direction.

That third one is the one that ought to worry you in production. With covariate and prior drift, there are external signals. Input distributions shift, or fraud rates visibly change. You may monitor those independently. Concept drift leaves no such footprint. The one thing that changes is what the model’s learned associations mean. You is not going to know until F1 starts falling.

Unless something sees it first.

The Problem With the First Three Metrics

The model from How a Neural Network Learned Its Own Fraud Rules: A Neuro-Symbolic AI Experiment produced three label-free monitoring signals as a by-product of the symbolic layer. The concept: if the foundations are learning fraud patterns, changes in how those rules fire should reveal when fraud patterns are shifting.

I expected the primary one to be the early warning. It was not.

The issue is particular to how this model trains. All five seeds converged between epochs 3 and 10 (Val PR-AUC: 0.7717, 0.6915, 0.6799, 0.7899, 0.7951), when temperature τ remains to be between 3.5 and 4.0. At that temperature, rule activations are soft. Every input produces a near-identical activation rating no matter its actual features. In plain terms: the foundations were firing almost the identical way on every transaction, clean or drifted. A similarity metric on near-constant vectors returns 1.000 almost on a regular basis. The primary signal only fired in 2 of 5 seeds for concept drift, and in each cases it was the identical window as F1 or later.

Why hot temperature makes monitoring harder

The LearnableDiscretizer uses a sigmoid gated by temperature τ: σ((x − θ) / τ). At τ = 5.0 (epoch 0), that sigmoid is sort of flat — every feature value produces an activation near 0.5 no matter where it sits relative to the learned threshold. As τ anneals toward 0.1, the sigmoid sharpens right into a near-binary step. Early stopping fires at τ ≈ 3.5–4.0 — before the foundations have fully crystallised. The result: activation vectors are near-constant across all inputs, so any similarity metric between them stays near 1.000 even when fraud patterns are genuinely shifting.

The second signal had the other problem. Absolutely the change in any feature’s contribution is tiny (values within the 0.001–0.005 range) since the rule weights themselves are small at an early-stopped checkpoint. In plain terms: the signal was real but invisible at the size we were measuring it. A set absolute threshold of 0.02 never fires.

Here's what those three original signals are:

RWSS (Rule Weight Stability Rating): cosine similarity between the baseline mean rule activation vector and the present one. In easy terms: are the foundations still firing the identical way they did on clean data?
FIDI (Feature Importance Drift Index): how much each feature’s contribution to rule activations has modified from the baseline. In easy terms: has any specific feature turn out to be roughly essential to the foundations?
RFR (Rule Firing Rate): what fraction of transactions fire each rule.

That diagnosis led to the fitting query. As an alternative of asking “has FIDI modified by greater than X?”, the fitting query is “has FIDI modified by greater than X ?”

That query has a unique answer. And the reply is V14.

The Metrics: Constructing a Label-Free Drift Detection System

Three latest metrics joined the unique three.

RWSS Velocity measures the per-window rate of change: RWSS[w] − RWSS[w−1]. A sudden drop of greater than 0.03 per window fires an alert even before absolutely the value crosses the edge. If RWSS is falling at −0.072 in a single step, that may be a signal no matter where it began.

FIDI Z-Rating is the one that really worked. Relatively than a brand latest signal, it is a straightforward extension of FIDI using Z-score normalisation against the feature’s own window history. As an alternative of asking whether absolutely the change crosses a set threshold, it asks whether the change is anomalous relative to what that feature has been doing. Unlike traditional drift detection methods that depend on input distributions or output labels, this approach operates purely on the symbolic layer, which suggests it really works at inference time, with no ground truth required. It builds on differentiable rule-learning work including ∂ILP [3], FINRule [4], RIFF [5], and Neuro-Symbolic Rule Lists [6], extending those representations with Z-score normalisation reasonably than fixed thresholds. V14’s contribution to rule activations through the clean baseline windows is small and flat. Near zero, stable, predictable. When concept drift begins at window 3, it shifts. Not by much in absolute terms. But by 9.53 standard deviations relative to the history it built during stable windows. That's an unlimited relative anomaly, and no threshold calibration is required to catch it.

PSI on Rule Activations was designed to catch distributional shift within the symbolic layer before the MLP’s compensation masks it on the output level. It didn't work here. The soft activations from early-stopped training (τ ≈ 3.5–4.0 on the saved checkpoint) cluster near 0.5, producing near-uniform distributions that PSI cannot distinguish. PSI_rules = 0.0049 throughout your entire experiment. PSI_rules never fired. It's within the codebase for when models with fully crystallised rules (τ < 0.5) can be found. On this experiment it contributed nothing.

The intended detection order, from earliest to latest:

RWSS Velocity → FIDI Z-Rating → PSI(rules) → RWSS absolute → F1 (label-based)

Here's what actually happened.

Results: What Each Metric Did

Concept Drift

Seed	F1 fires	RWSS fires	VEL fires	FIDIZ fires	PSIR fires
42	W3	W4 (1w late)	W4 (1w late)	W3 (simultaneous)	—
0	W3	—	—	W3 (simultaneous)	—
7	W4	W4 (simult.)	W4 (simult.)	W3 (+1w early)	—
123	W3	—	—	W3 (simultaneous)	—
2024	W4	—	—	W3 (+1w early)	—

FIDI Z-Rating fires in 5 of 5 seeds, at all times at window 3. F1 fires at W3 in three seeds and W4 in two. The mean FIDIZ detection lag is +0.40 windows, meaning it leads F1 on average. In seeds 7 and 2024 it fires one full window before F1 drops. Within the remaining three seeds it fires concurrently. It never fires after F1 for concept drift. Not once.

Across all drift types, FIDI Z-Rating is the one metric that detected concept drift in every seed and never lagged behind F1. For label-free drift detection fraud systems, that's the headline result.

RWSS stays flat while F1 is already falling. The symbolic layer’s activation pattern holds regular through W0–W3 — it registers no change while F1 is already on its way down. The drop to 0.923 at W4 confirms the drift one window after F1 crosses its alert threshold. For this reason RWSS alone is insufficient for early concept drift detection. Image by Creator.

RWSS fires in 2 of 5 seeds and in each cases concurrently with or after F1. Velocity matches RWSS exactly, same window, each time. PSI on rule activations never fires in any respect.

Concept Drift vs Covariate Drift: Why Symbolic Monitoring Has Blind Spots

Covariate drift is where the symbolic layer goes completely silent.

Every symbolic metric: 0 of 5 seeds. Not one signal. Not one window. F1 eventually fires in 4 of 5 seeds at W6 or W7, slowly and late, and the symbolic layer had nothing to do with it. This isn't a spot that higher tuning will close. It's a fundamental property of what the symbolic layer measures.

The explanation is mechanical. When V14, V4, and V12 shift by +3.0σ, the shift is uniform across all samples. The learnable discretizer computes thresholds relative to the info. Each sample still lands in roughly the identical threshold bin relative to its neighbours. Rules fire on roughly the identical proportion of transactions. Nothing within the activation pattern changes. Cosine similarity of mean activations stays at 1.0.

In easy terms: if every transaction shifts by the identical amount, the foundations still see the identical relative picture. Transaction A was above the edge before. It remains to be above the edge after. The fraud-vs-legitimate ordering is preserved. RWSS measures that ordering, not absolutely the values. Consider it as a tide that lifts all boats equally. The boats stay in the identical order. RWSS only measures the order.

If covariate drift is a priority in your deployment, you wish a separate input-space monitor: PSI on raw features, a KS test on V14, or an information quality check. The symbolic layer cannot show you how to there. Symbolic layer drift monitoring has one blind spot, and covariate shift is it.

RWSS and F1 scores across 8 windows under covariate drift. RWSS stays at exactly 1.000 throughout all windows. F1 mean gradually declines from W2 onward, crossing the alert threshold near W6–W7. — RWSS sees nothing. Shifting V14, V4, and V12 by as much as 3σ has no effect on the symbolic layer — the green line never moves from 1.000. The foundations still fire on the identical relative proportion of transactions since the shift is uniform across all samples. F1 eventually drifts below threshold late within the sequence, but no symbolic metric was there to warn you. Image by Creator.

Prior Drift

FIDIZ fires in 5 of 5 seeds, at all times at W3. But prior drift causes F1 to drop at W0 (seed 123) or W2 (seed 2024) within the two seeds where F1 fires in any respect. FIDIZ detection lag for prior drift: −2.00 windows. It fires two windows F1.

This isn't a calibration problem. FIDIZ needs a minimum of three clean windows to construct a history before its Z-score is meaningful. Prior drift that causes a direct fraud rate jump is already visible in F1 before FIDIZ may even start computing. A rolling fraud rate counter will at all times be faster here.

RWSS and F1 scores across 8 windows under prior drift. RWSS stays at 1.000 throughout. F1 mean sits around 0.63–0.65, staying above the alert threshold for all 8 windows in the mean. — Prior drift — increasing the fraud rate from 0.17% to 2% — produces no symbolic signal in anyway. RWSS stays at 1.000 since the rule activation patterns don’t change when only the category balance shifts. The mean F1 actually stays above the alert threshold here, though two individual seeds (visible within the faint traces reaching into the 0.70–0.85 range) do show performance degradation. That is the case where you wish a label distribution monitor, not a rule activation monitor. Image by Creator.

The Alert Demo: Window 3

Here is the moment the entire system was built for.

DriftAlertSystem is built once from the validation set immediately after training. It stores the baseline. Then .check() is named on each latest window. No labels. No retraining. That is inference-only drift detection: the system reads the symbolic layer and nothing else.

Seed 42, concept drift, 8 windows:

Window 0: severity=none      RWSS=0.999  fired=False
Window 1: severity=none      RWSS=0.999  fired=False
Window 2: severity=none      RWSS=0.999  fired=False
Window 3: severity=warning   RWSS=1.000  fired=True   ← FIDI Z fires here
Window 4: severity=critical  RWSS=0.928  fired=True   ← RWSS absolute confirms
Window 5: severity=warning   RWSS=0.928  fired=True
Window 6: severity=warning   RWSS=0.928  fired=True
Window 7: severity=warning   RWSS=0.928  fired=True

At window 3, RWSS is strictly 1.000. The activation pattern is perfectly an identical to baseline. Output probabilities haven't modified. Nothing in the usual monitoring stack has moved.

And the alert fires at WARNING severity.

The explanation is V14. Its Z-score is −9.53. Meaning V14’s contribution to rule activations has shifted to just about 10 standard deviations below the baseline it established during clean windows. The model’s output doesn't know yet. The MLP is compensating. However the rule path cannot compensate. It was trained to specific a set symbolic relationship. It's screaming.

One window later, the MLP stops holding. RWSS drops to 0.928. Velocity falls 0.072 in a single step. Severity escalates to CRITICAL.

═══════════════════════════════════════════════════════
  DRIFT ALERT  |  severity: CRITICAL
  Earliest signal: VELOCITY
═══════════════════════════════════════════════════════
  ── Early-Warning Layer ─────────────────────────────
  RWSS Velocity : -0.0720  [threshold -0.03]  ⚠ FIRED
  FIDI Z-Rating  : ⚠ FIRED
       V14  Z = -9.53
  PSI (rules)   : 0.0049  [moderate≥0.10]  stable
  ── Confirmed Layer ─────────────────────────────────
  RWSS absolute : 0.9276  [threshold 0.97]  ⚠ FIRED
  Rules gone silent: 0  OK
  Mean RFR change  : -0.001
  Really useful motion:
    → Retrain immediately. Don't deploy.
═══════════════════════════════════════════════════════

The report names VELOCITY because the earliest layer. That could be a priority order in the interior logic. In actual window timing, FIDI Z-Rating fired one window earlier at W3. The W3 WARNING is the sooner human-facing alert. The one that offers you time to act before the CRITICAL fires.

5×3 grid showing alert timelines for 5 seeds across covariate, prior, and concept drift. Green circles mark RWSS alerts; orange squares mark F1 alerts. Green circles appear only in the Concept column for seeds 7 and 42, at W4, one step to the right of the F1 alert. — Every cell on this grid is one seed-drift combination. Grey dots are silent windows. An orange square is an F1 alert. A green circle is an RWSS alert. Take a look at the Concept column: seeds 7 and 42 each show a green RWSS circle at W4, with the F1 square one position to the left at W3 — that's the 1-window lag. Seeds 0, 123, and 2024 show no green circle in any respect, only an orange F1 square. Covariate and prior columns show no RWSS circles anywhere. FIDI Z-Rating alerts (not shown on this figure) fired at W3 across all five concept-drift seeds. Image by Creator.

Why FIDI Z-Rating Sees It Before F1 Does

The model has two paths running in parallel from the identical input.

The MLP path carries 88.6% of the ultimate output (mean α = 0.886 across seeds; α is the learned mix weight; 0.886 means the neural network does 88.6% of the prediction work and the symbolic rules do the remaining 11.4%). When concept drift steadily reverses V14’s relationship to fraud labels, the MLP, trained on 284,000 transactions, partially absorbs that change. Its internal representations shift. Output probabilities stay roughly stable for at the least one window. That is the MLP compensating.

The rule path carries 11.4%. It was trained to specific the MLP’s knowledge in symbolic form: V14 below a threshold means fraud [2]. That relationship is fixed and explicit. When V14 flips sign for fraud cases, the rule’s V14 contribution doesn't adjust. It simply stops working. The bit activations for V14 change direction. The rule starts firing on the fallacious transactions.

The neural network adapts. The symbolic layer doesn't. And that is strictly why the symbolic layer detects the drift first.

That asymmetry is what FIDI Z-Rating exploits.

Absolutely the change in V14’s contribution is tiny (values within the 0.001 to 0.005 range) because rule weights are small at an early-stopped checkpoint. A set absolute threshold never catches it.

FIDI heatmap for concept drift across 7 features (V14, V12, V4, V11, V10, V17, V3) and 8 time windows. All cells are uniform grey throughout, indicating near-zero absolute FIDI values across all features and all windows. — This blank heatmap is the entire point. Every cell is similar neutral grey — meaning absolutely the FIDI values for all seven features across all eight windows are effectively zero. The color scale runs from −0.30 to +0.30, but nothing registers. For this reason a set absolute threshold (0.02) never fires, and why Z-score normalisation against each feature’s own history is obligatory. V14’s contribution is collapsing — but only relative to itself, not in absolute terms the size can show. Image by Creator.

But V14’s history through the clean windows is just as flat. When concept drift moves it at window 3, the Z-score is −9.53. Same pattern: near-zero absolute change, extreme relative shift.

The symbolic layer is less compensating than the MLP, so it shows the drift first. FIDI Z-Rating makes the signal visible by comparing each feature to not a set threshold, but to its own history.

But this only holds for considered one of the three drift types. The opposite two are a unique story entirely.

What This System Cannot Do

A system that claims early warning invites overstatement. Here's what the info actually says. That is label-free anomaly detection fraud monitoring, which suggests the constraints are structural, not tunable.

Covariate drift is a whole blind spot. 0 of 5 seeds. The mechanism is explained within the Results section above. Use PSI on raw features or a KS test on V14 as an alternative.

FIDIZ fires late on prior drift by design. When the fraud rate jumps, F1 reacts at W0 or W2. FIDIZ structurally cannot fire before W3. It needs history that doesn't yet exist. A rolling fraud rate monitor responds faster.

PSI on rule activations produced nothing. PSI_rules = 0.0049 throughout every window of each seed. Soft activations from early-stopped training cluster near 0.5, and PSI on near-uniform distributions is insensitive no matter what is definitely happening. This metric is within the codebase and may match with fully annealed models (τ < 0.5). On this experiment it was silent.

5 seeds is evidence, not proof. FIDIZ fires at W3 for concept drift across all 5 seeds. That's consistent and inspiring. It isn't the identical as reliable in production across datasets, fraud types, and drift severities you will have not tested. 5 seeds is a start line, not a conclusion. More seeds, more drift configurations, and real-world validation are needed before strong deployment claims.

Results Summary

The pattern is clearest when stated plainly first. Consider this as an early warning concept drift system with three distinct modes depending on what's changing. Covariate drift: the symbolic layer saw nothing, F1 caught it slowly. Prior drift: the symbolic layer fired after F1, not before. Concept drift: FIDI Z-Rating fired in each seed, at all times at or before F1, averaging +0.40 windows of lead time.

Drift type	F1 fired	RWSS fired	FIDIZ fired	FIDIZ mean lag
Covariate	4/5	0/5	0/5	—
Prior	2/5	0/5	5/5	−2.00w (late)
Concept	5/5	2/5	5/5	+0.40w (early)

Three-panel chart comparing RWSS versus F1 across covariate, prior, and concept drift. Left panel: RWSS flat blue line, F1 slowly declining red line. Centre panel: RWSS flat amber line, F1 stable. Right panel: RWSS drops sharply at W4, F1 collapses at W3. — Three drift types, three completely different stories. Covariate drift (left): RWSS never moves, F1 slowly degrades. Prior drift (centre): RWSS never moves, F1 stays stable on average despite high seed variance. Concept drift (right): RWSS holds until W4 then drops to ~0.91 — but F1 has already fallen sharply at W3. This panel is why RWSS alone isn't enough, and why FIDI Z-Rating — which fires at W3 in all five seeds — is required because the leading signal. Image by Creator.

Constructing It

The system is designed to be utilized in production, not only in a notebook.

# Once, immediately after training
X_val_t      = torch.FloatTensor(X_val)
alert_system = DriftAlertSystem.from_trained_model(model, X_val_t, feature_names)
alert_system.save("results/drift_alert_baseline_seed42.pkl")

# Every scoring run — weekly, each day, per-batch
alert_system = DriftAlertSystem.load("results/drift_alert_baseline_seed42.pkl")
alert = alert_system.check(model, X_this_week)

if alert.fired:
    print(alert.report())

No labels. No retraining. No infrastructure beyond saving a pickle file next to the model checkpoint. The .check() call computes RWSS velocity, FIDI Z-Rating, PSI on activations, and RWSS absolute in that order, using PyTorch [7] and scikit-learn [8]. Severity escalates from none to warning to critical based on what number of fire and the way far RWSS has dropped.

The three early-warning computations are each a number of lines.

RWSS Velocity: rate of change per window.

def compute_rwss_velocity(rwss_history: List[float]) -> float:
    if len(rwss_history) < 2:
        return 0.0
    return float(rwss_history[-1] - rwss_history[-2])

# Alert fires when drop > 0.03 per window
vel_fired = rwss_velocity < -0.03

FIDI Z-Rating: normalise feature contribution anomaly against history.

def compute_fidi_zscore(fidi_history, current_fidi, min_history=3):
    if len(fidi_history) < min_history:
        return {k: 0.0 for k in current_fidi}
    z_scores = {}
    for feat_idx, current_val in current_fidi.items():
        history_vals = [h.get(feat_idx, 0.0) for h in fidi_history]
        mean_h = np.mean(history_vals)
        std_h  = np.std(history_vals)
        z_scores[feat_idx] = (current_val - mean_h) / std_h if std_h > 1e-8 else 0.0
    return z_scores

# Alert fires when any feature Z > 2.5
fidi_z_fired = any(abs(z) > 2.5 for z in z_scores.values())

PSI on Rule Activations: distributional shift within the symbolic layer (included for completeness).

def compute_psi_rules(baseline_acts, current_acts, n_bins=10):
    bins = np.linspace(0, 1, n_bins + 1)
    psi_per_rule = []
    for r in range(baseline_acts.shape[1]):
        b = np.histogram(baseline_acts[:, r], bins=bins)[0] + 1e-6
        c = np.histogram(current_acts[:,  r], bins=bins)[0] + 1e-6
        b /= b.sum(); c /= c.sum()
        psi_per_rule.append(float(np.sum((c - b) * np.log(c / b))))
    return np.mean(psi_per_rule)

V14: Three Articles, One Feature

That is the part I didn't plan. But V14 concept drift behaviour seems to be the thread that ties all three articles together.

Guiding Neural Networks with Domain Rules: I wrote rules about large transaction amounts and anomalous PCA norms. Reasonable intuitions. Nothing to do with V14.

How a Neural Network Learned Its Own Fraud Rules: The model found V14 anyway. Given 30 anonymised features and no guidance, the gradient landed on the one feature with the best absolute correlation to fraud. Twice, across two independent seeds.

This Article, I deliberately made V14 break. I flipped its sign for fraud cases, steadily, across 8 windows. And FIDI Z-Rating registered the collapse at −9.53 standard deviations while RWSS was still 1.000 and F1 had not moved.

V14 FIDI absolute values across 8 time windows under concept drift. Mean line (dark green) runs flat near zero throughout. FIDI alert threshold at 0.20 is shown as a dotted line far above the data. — V14’s FIDI absolute value stays inside 0.002 of zero across all 8 windows and all 5 seeds. The alert threshold of 0.20 sits completely out of reach at the highest of the chart — the gap between the info and the edge is the rationale the unique FIDI monitor never fires. What the Z-score catches as an alternative is that even this near-zero line has a stable history, and when V14’s contribution shifts by a tiny absolute amount at W3, that shift is −9.53 standard deviations from the baseline. The anomaly is within the relative change, not absolutely the value. Image by Creator.

The identical feature, three different roles: ignored, discovered, then monitored as the very first thing to fail. That coherence was not engineered. It's what reproducible multi-seed evaluation on a consistent dataset keeps producing.

What to Do With This

Use FIDI Z-Rating for concept drift detection without labels. It fires in 5 of 5 seeds, requires only 3 windows of history, never fires after F1, and wishes no labels. Keep the Z-score threshold at 2.5 and minimum history at 3 windows.

Add a separate input-space monitor for covariate drift. PSI on raw features or a KS test on critical features like V14. The symbolic layer is blind to distributional shifts that preserve relative activation order.

Use a rolling fraud rate counter for prior drift. FIDIZ structurally cannot fire before W3. A label-based rate counter fires at W0.

Construct the alert baseline immediately after training. Not after drift is suspected. Do it after training. When you wait, you will have already lost your clean reference point. Reserve it alongside the checkpoint file.

One window of early warning is real. Whether it's one week or someday relies on your scoring cadence. For many production fraud teams, the difference between a scheduled retrain and an emergency one is measured in precisely those units.

Three Things That Will Catch You Using This Concept Drift Early Warning System

The three-window blind period. FIDIZ has no history to work with for the primary 3 windows after deployment. You're monitoring with RWSS and RFR only during that point. Plan for it explicitly.

Soft activations will silence PSI_rules. In case your best checkpoint arrives when τ ≥ 1.0 (which happens each time early stopping fires before training is complete), rule activations cluster near 0.5 and PSI_rules returns noise. Check τ at your saved checkpoint. On this experiment τ was still 3.5–4.0 at convergence. That's the reason PSI_rules was silent throughout.

Retrain means re-audit. This technique is a fraud model retraining trigger, not a retrain alternative. After retraining, the foundations change. V14 may not dominate, or latest features could have entered. The compliance sign-off from the previous model doesn't carry forward. Construct the audit into the retrain process, not as a step after, but because the step that closes the loop.

Closing

Three articles. One feature kept appearing.

Guiding Neural Networks with Domain Rules: I ignored it. How a Neural Network Learned Its Own Fraud Rules: The gradient found it. Article 3: When it broke, the symbolic layer noticed before the output layer did.

The experiment has a selected, honest scope: FIDI Z-Rating detects concept drift in 5 of 5 seeds, sometimes one window before F1, never after it, entirely without labels. For covariate drift it's blind. For prior drift it's late. Those aren't caveats added at the top to melt the claim. They're findings that let you know exactly where to make use of this and where to not.

A neuro-symbolic model gives you two channels. The MLP is best at prediction. The symbolic layer is best at knowing when prediction is about to go fallacious. They aren't redundant. They're watching different facets of the identical problem.

The MLP compensates. The symbolic layer cannot. That's its weakness. On this experiment, it turned out to even be its earliest warning.

Series

Disclosure

This text relies on independent experiments using publicly available data (Kaggle Credit Card Fraud dataset, CC-0 Public Domain) and open-source tools (PyTorch, scikit-learn). No proprietary datasets, company resources, or confidential information were used. The outcomes and code are fully reproducible as described. The views and conclusions expressed listed here are my very own and don't represent any employer or organisation.

References

[1] Dal Pozzolo, A. et al. (2015). Calibrating Probability with Undersampling for Unbalanced Classification. IEEE SSCI. Dataset: https://www.kaggle.com/datasets/mlg-ulb/creditcardfraud (CC-0)

[2] Alexander, E. P. (2026). Hybrid Neuro-Symbolic Fraud Detection: Guiding Neural Networks with Domain Rules. . https://towardsdatascience.com/hybrid-neuro-symbolic-fraud-detection-guiding-neural-networks-with-domain-rules/

[3] Evans, R., & Grefenstette, E. (2018). Learning Explanatory Rules from Noisy Data. , 61, 1–64. https://arxiv.org/abs/1711.04574

[4] Wolfson, B., & Acar, E. (2024). Differentiable Inductive Logic Programming for Fraud Detection. arXiv:2410.21928. https://arxiv.org/abs/2410.21928

[5] Martins, J. L., Bravo, J., Gomes, A. S., Soares, C., & Bizarro, P. (2024). RIFF: Inducing Rules for Fraud Detection from Decision Trees. In RuleML+RR 2024. arXiv:2408.12989. https://arxiv.org/abs/2408.12989

[6] Xu, S., Walter, N. P., & Vreeken, J. (2024). Neuro-Symbolic Rule Lists. arXiv:2411.06428. https://arxiv.org/abs/2411.06428

[7] Paszke, A. et al. (2019). PyTorch: An Imperative Style, High-Performance Deep Learning Library. . https://pytorch.org

[8] Pedregosa, F. et al. (2011). Scikit-learn: Machine Learning in Python. , 12, 2825–2830. https://scikit-learn.org

[9] Gama, J. et al. (2014). A Survey on Concept Drift Adaptation. , 46(4). https://dl.acm.org/doi/10.1145/2523813

Code: https://github.com/Emmimal/neuro-symbolic-drift-detection

Neuro-Symbolic Fraud Detection: Catching Concept Drift Before F1 Drops (Label-Free)

TL;DR: What You Will Get From This Article

The Story So Far

Three Ways Fraud Can Change

The Problem With the First Three Metrics

The Metrics: Constructing a Label-Free Drift Detection System

Results: What Each Metric Did

Concept Drift

Concept Drift vs Covariate Drift: Why Symbolic Monitoring Has Blind Spots

Prior Drift

The Alert Demo: Window 3

Why FIDI Z-Rating Sees It Before F1 Does

What This System Cannot Do

Results Summary

Constructing It

V14: Three Articles, One Feature

What to Do With This

Three Things That Will Catch You Using This Concept Drift Early Warning System

Closing

Series

Disclosure

References

What are your thoughts on this topic?
Let us know in the comments below.

Share this article

Recent posts

4 Pandas Concepts That Quietly Break Your Data Pipelines

Constructing a Zero-Trust Architecture for Confidential AI Factories

The Bay Area’s animal welfare movement desires to recruit AI

Deploying Disaggregated LLM Inference Workloads on Kubernetes

Elon Musk’s ‘Terafab’ AI chip factory

Neuro-Symbolic Fraud Detection: Catching Concept Drift Before F1 Drops (Label-Free)

TL;DR: What You Will Get From This Article

The Story So Far

Three Ways Fraud Can Change

The Problem With the First Three Metrics

The Metrics: Constructing a Label-Free Drift Detection System

Results: What Each Metric Did

Concept Drift

Concept Drift vs Covariate Drift: Why Symbolic Monitoring Has Blind Spots

Prior Drift

The Alert Demo: Window 3

Why FIDI Z-Rating Sees It Before F1 Does

What This System Cannot Do

Results Summary

Constructing It

V14: Three Articles, One Feature

What to Do With This

Three Things That Will Catch You Using This Concept Drift Early Warning System

Closing

Series

Disclosure

References

What are your thoughts on this topic? Let us know in the comments below.

Share this article

Recent posts

What are your thoughts on this topic?
Let us know in the comments below.