MyJournals Home  

RSS FeedsAlgorithms, Vol. 12, Pages 29: A Distributed Execution Pipeline for Clustering Trajectories Based on a Fuzzy Similarity Relation (Algorithms)

 
 

24 january 2019 00:01:36

 
Algorithms, Vol. 12, Pages 29: A Distributed Execution Pipeline for Clustering Trajectories Based on a Fuzzy Similarity Relation (Algorithms)
 


The proliferation of indoor and outdoor tracking devices has led to a vast amount of spatial data. Each object can be described by several trajectories that, once analysed, can yield to significant knowledge. In particular, pattern analysis by clustering generic trajectories can give insight into objects sharing the same patterns. Still, sequential clustering approaches fail to handle large volumes of data. Hence, the necessity of distributed systems to be able to infer knowledge in a trivial time interval. In this paper, we detail an efficient, scalable and distributed execution pipeline for clustering raw trajectories. The clustering is achieved via a fuzzy similarity relation obtained by the transitive closure of a proximity relation. Moreover, the pipeline is integrated in Spark, implemented in Scala and leverages the Core and Graphx libraries making use of Resilient Distributed Datasets (RDD) and graph processing. Furthermore, a new simple, but very efficient, partitioning logic has been deployed in Spark and integrated into the execution process. The objective behind this logic is to equally distribute the load among all executors by considering the complexity of the data. In particular, resolving the load balancing issue has reduced the conventional execution time in an important manner. Evaluation and performance of the whole distributed process has been analysed by handling the Geolife project’s GPS trajectory dataset.


 
82 viewsCategory: Informatics
 
Algorithms, Vol. 12, Pages 30: An Exploration of a Balanced Up-Downwind Scheme for Solving Heston Volatility Model Equations on Variable Grids (Algorithms)
Algorithms, Vol. 12, Pages 28: FPGA Implementation of ECT Digital System for Imaging Conductive Materials (Algorithms)
 
 
blog comments powered by Disqus


MyJournals.org
The latest issues of all your favorite science journals on one page

Username:
Password:

Register | Retrieve

Search:

Informatics


Copyright © 2008 - 2024 Indigonet Services B.V.. Contact: Tim Hulsen. Read here our privacy notice.
Other websites of Indigonet Services B.V.: Nieuws Vacatures News Tweets Nachrichten