Abstract
The counting task, which plays a fundamental role in numerous
applications (e.g., crowd counting, traffic statistics), aims to predict
the number of objects with various densities. Existing object counting
tasks are designed for a single object class. However, it is inevitable
to encounter newly coming data with new classes in our real world. We
name this scenario as
evolving object counting
. In this paper, we build the first evolving object counting dataset and
propose a unified object counting network as the first attempt to
address this task. The proposed network consists of two key components: a
class-agnostic mask module and a class-incremental module. The
class-agnostic mask module learns generic object occupation prior by
predicting a class-agnostic binary mask (e.g., 1 denotes there exists an
object at the considering position in an image and 0 otherwise). The
class-incremental module is used to handle new classes and provides
discriminative class guidance for density map prediction. The combined
outputs of the class-agnostic mask module and image feature extractor
are used to predict the final density map. When new classes arrive, we
first add new neural nodes to the last regression and classification
layers of the class-incremental module. Then, instead of retraining the
model from scratch, we utilize knowledge distillation to help the model
retain and consolidate what it has previously learned. We also employ a
support sample bank to store a small number of typical training samples
for each class, which are used to prevent the model from forgetting key
information from old data. With this design, our model can efficiently
and effectively adapt to new classes while maintaining good performance
on already-seen data without large-scale retraining. Extensive
experiments on the collected dataset demonstrate favorable performance.
The dataset and code will be available at:
https://github.com/Tanyjiang/EOCO.
| Original language | English |
|---|---|
| Pages (from-to) | 1147-1158 |
| Number of pages | 12 |
| Journal | IEEE Transactions on Circuits and Systems for Video Technology |
| Volume | 34 |
| Issue number | 2 |
| Early online date | 3 Jul 2023 |
| DOIs | |
| Publication status | Published - Feb 2024 |
| Externally published | Yes |