Skip to main navigation Skip to search Skip to main content

KMCT: k-Means Clustering of Trajectories efficiently in location-based services

Yuanjun Liu, Guanfeng Liu, Qingzhi Ma*, Zhixu Li, Shiting Wen, Lei Zhao, An Liu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

Abstract

With the widespread use of GPS devices and the advancement of location-based services, a vast amount of trajectory data has been collected and mined for various applications. Trajectory clustering, which categorizes trajectories into distinct groups, is the fundamental functionality of trajectory data mining. The challenge is how to cluster on a mass of trajectory data efficiently and universally with satisfying results. The raw trajectory clustering algorithms are universal, but trapped in the dilemma between efficiency and desirable results. Other approaches, such as density-based, road network-based, and deep learning-based algorithms, encounter issues like high time complexity, loss of trajectory integrity, reliance on road networks, and data quality during training. To tackle these challenges, we first propose the efficient KMCT (k-Means Clustering of Trajectories) algorithm based on a semantic interpolation transformation to cluster raw trajectories and achieve satisfying results. Additionally, we introduce the DA-KMCT (Density Accelerated k-Means Clustering of Trajectories) algorithm to further boost the clustering process based on trajectory densities and an optimized centroid selecting strategy. Moreover, we present a novel clustering evaluation method called IOD, which efficiently estimates clustering results on large-scale datasets with linear time complexity. Experimental results on real-world datasets demonstrate that KMCT and DA-KMCT outperform five related methods in terms of clustering quality and time efficiency, and the proposed IOD evaluation shows a strong correlation with the Silhouette Coefficient, offering a reliable and efficient alternative for evaluating clustering results.
Original languageEnglish
Title of host publicationCIKM ’24
Subtitle of host publicationproceedings of the 33rd ACM International Conference on Information and Knowledge Management
PublisherAssociation for Computing Machinery (ACM)
Pages1523-1532
Number of pages10
ISBN (Electronic)9798400704369
ISBN (Print)New York, NY
DOIs
Publication statusPublished - 2024
EventACM International Conference on Information and Knowledge Management (33rd : 2024) - Boise, United States
Duration: 21 Oct 202425 Oct 2024

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings
ISSN (Print)2155-0751

Conference

ConferenceACM International Conference on Information and Knowledge Management (33rd : 2024)
Abbreviated titleCIKM ’24
Country/TerritoryUnited States
CityBoise
Period21/10/2425/10/24

Keywords

  • spatial dataset
  • trajectory clustering
  • trajectory data mining

Fingerprint

Dive into the research topics of 'KMCT: k-Means Clustering of Trajectories efficiently in location-based services'. Together they form a unique fingerprint.

Cite this