doi.org
Graph-Based Model-Agnostic Data Subsampling for Recommendation Systems
August 2023 • Xiaohui Chen, Jiankai Sun, Taiqing Wang, Ruocheng Guo, Liping Liu, Aonan Zhang
Data subsampling is widely used to speed up the training of large-scale recommendation systems. Most subsampling methods are model-based and often require a pre-trained pilot model to measure data importance via e.g. sample hardness. However, when the pilot model is misspecified, model-based subsampling methods deteriorate. Since model misspecification is persistent in real recommendation systems, we instead propose model-agnostic data subsampling methods by only exploring input data structure represented by graph…