Projects per year
Abstract
A standard network pretrained on in-distribution (ID) samples could make high-confidence predictions on out-of-distribution (OOD) samples, leaving the possibility of failing to distinguish ID and OOD samples in the test phase. To address this over-confidence issue, the existing methods improve the OOD sensitivity from modeling perspectives, i.e., retraining it by modifying training processes or objective functions. In contrast, this paper proposes a simple but effective method, namely Weighted Non-IID Batching (WNB), by adjusting batch weights. WNB builds on a key observation: increasing the batch size can improve the OOD detection performance. This is because a smaller batch size may make its batch samples more likely to be treated as non-IID from the assumed ID, i.e., associated with an OOD. This causes a network to provide high-confidence predictions for all samples from the OOD. Accordingly, WNB applies a weight function to weight each batch according to the discrepancy between batch samples and the entire training ID dataset. Specifically, the weight function is derived by minimizing the generalization error bound. It ensures that the weight function assigns larger weights to batches with smaller discrepancies and makes a trade-off between ID classification and OOD detection performance. Experimental results show that incorporating WNB into state-of-the-art OOD detection methods can further improve their performance.
Original language | English |
---|---|
Pages (from-to) | 7371-7391 |
Number of pages | 21 |
Journal | Machine Learning |
Volume | 113 |
Issue number | 10 |
DOIs | |
Publication status | Published - Oct 2024 |
Bibliographical note
Copyright the Author(s) 2024. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.Keywords
- dataset discrepancy
- non-IID
- out-of-distribution detection
Projects
- 3 Finished
-
DP19 Transfer to MQ: Deep analytics of non-occurring but important behaviours
Cao, L. & Kumar, V.
7/06/23 → 30/06/24
Project: Research
-
Deep Interaction Learning in Unlabelled Big Data and Complex Systems (FT190100734)
1/06/20 → 31/05/24
Project: Research
-
Deep analytics of non-occurring but important behaviours (DP190101079)
1/01/20 → 30/09/23
Project: Research