Loss-based

Model has lower loss on training data. Compute loss on candidate → below threshold → member. Simple but effective on undertrained models.

Advertisement

Shadow models

Train shadow models with known membership. Learn classifier on loss patterns. Apply to target model. Shokri et al 2017.

Advertisement

LLM-specific

Test perplexity on candidate. Compare to same-length random text. Lower relative → likely in training.