MyJournals Home  

RSS FeedsRemote Sensing, Vol. 14, Pages 3853: One-Shot Multiple Object Tracking in UAV Videos Using Task-Specific Fine-Grained Features (Remote Sensing)

 
 

9 august 2022 13:44:46

 
Remote Sensing, Vol. 14, Pages 3853: One-Shot Multiple Object Tracking in UAV Videos Using Task-Specific Fine-Grained Features (Remote Sensing)
 


Multiple object tracking (MOT) in unmanned aerial vehicle (UAV) videos is a fundamental task and can be applied in many fields. MOT consists of two critical procedures, i.e., object detection and re-identification (ReID). One-shot MOT, which incorporates detection and ReID in a unified network, has gained attention due to its fast inference speed. It significantly reduces the computational overhead by making two subtasks share features. However, most existing one-shot trackers struggle to achieve robust tracking in UAV videos. We observe that the essential difference between detection and ReID leads to an optimization contradiction within one-shot networks. To alleviate this contradiction, we propose a novel feature decoupling network (FDN) to convert shared features into detection-specific and ReID-specific representations. The FDN searches for characteristics and commonalities between the two tasks to synergize detection and ReID. In addition, existing one-shot trackers struggle to locate small targets in UAV videos. Therefore, we design a pyramid transformer encoder (PTE) to enrich the semantic information of the resulting detection-specific representations. By learning scale-aware fine-grained features, the PTE empowers our tracker to locate targets in UAV videos accurately. Extensive experiments on VisDrone2021 and UAVDT benchmarks demonstrate that our tracker achieves state-of-the-art tracking performance.


 
123 viewsCategory: Geology, Physics
 
Remote Sensing, Vol. 14, Pages 3851: Can Small Industrial Platforms Achieve Large Space Spillover? Identifying the Spatial Spillover Scope of Characteristic Towns Using the Gradient Difference Method (Remote Sensing)
Remote Sensing, Vol. 14, Pages 3850: Video-Based Nearshore Bathymetric Inversion on a Geologically Constrained Mesotidal Beach during Storm Events (Remote Sensing)
 
 
blog comments powered by Disqus


MyJournals.org
The latest issues of all your favorite science journals on one page

Username:
Password:

Register | Retrieve

Search:

Physics


Copyright © 2008 - 2024 Indigonet Services B.V.. Contact: Tim Hulsen. Read here our privacy notice.
Other websites of Indigonet Services B.V.: Nieuws Vacatures News Tweets Nachrichten