[Paper] SERSEM: Selective Entropy-Weighted Scoring for Membership Inference in Code Language Models
As Large Language Models (LLMs) for code increasingly utilize massive, often non-permissively licensed datasets, evaluating data contamination through Membershi...