CPP filter blind to distributed (jointly-strong) signals

Part of #336 (usability epic).

## Problem
CPP's feature filter ranks features by their **individual** `mean_dif`/`abs_auc`, so it is blind to
**distributed** signals — feature blocks that are individually weak but jointly decisive. Concrete case
from this project (iBCE-EL linear epitopes):

- Amino-acid composition: each amino acid's `abs_auc` ≈ 0.03 (≈ random), yet the 20 together give
  ROC-AUC ≈ 0.75.
- When CPP was given a **combined identity + physicochemical** scale set, the filter selected **0%
  identity** features (physicochemical scales score higher individually) and performance **collapsed
  0.75 → 0.57** — the winning signal was filtered out.

Diagnostic that catches it: the **marginal-vs-joint "lift"** = full-block model AUC − best-single-feature
AUC (AAC +0.21 vs physicochemical +0.04).

## Suggestion
- **Document** this failure mode (a scale set can be jointly strong yet fully filtered out).
- Offer an optional **model-based / multivariate** selection (permutation or embedded importance) or a
  **block-level** evaluation (ties to #309), and/or **warn** when a scale group has high joint-vs-marginal
  lift but low per-feature scores.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CPP filter blind to distributed (jointly-strong) signals #341

Problem

Suggestion

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

CPP filter blind to distributed (jointly-strong) signals #341

Description

Problem

Suggestion

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions