TY - JOUR
T1 - Swarm intelligence for detecting interesting events in crowded environments
AU - Kaltsa, Vagia
AU - Briassouli, Alexia
AU - Kompatsiaris, Ioannis
AU - Hadjileontiadis, Leontios J.
AU - Strintzis, Michael Gerasimos
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/7/1
Y1 - 2015/7/1
N2 - This paper focuses on detecting and localizing anomalous events in videos of crowded scenes, i.e., divergences from a dominant pattern. Both motion and appearance information are considered, so as to robustly distinguish different kinds of anomalies, for a wide range of scenarios. A newly introduced concept based on swarm theory, histograms of oriented swarms (HOS), is applied to capture the dynamics of crowded environments. HOS, together with the well-known histograms of oriented gradients, are combined to build a descriptor that effectively characterizes each scene. These appearance and motion features are only extracted within spatiotemporal volumes of moving pixels to ensure robustness to local noise, increase accuracy in the detection of local, nondominant anomalies, and achieve a lower computational cost. Experiments on benchmark data sets containing various situations with human crowds, as well as on traffic data, led to results that surpassed the current state of the art (SoA), confirming the method's efficacy and generality. Finally, the experiments show that our approach achieves significantly higher accuracy, especially for pixel-level event detection compared to SoA methods, at a low computational cost.
AB - This paper focuses on detecting and localizing anomalous events in videos of crowded scenes, i.e., divergences from a dominant pattern. Both motion and appearance information are considered, so as to robustly distinguish different kinds of anomalies, for a wide range of scenarios. A newly introduced concept based on swarm theory, histograms of oriented swarms (HOS), is applied to capture the dynamics of crowded environments. HOS, together with the well-known histograms of oriented gradients, are combined to build a descriptor that effectively characterizes each scene. These appearance and motion features are only extracted within spatiotemporal volumes of moving pixels to ensure robustness to local noise, increase accuracy in the detection of local, nondominant anomalies, and achieve a lower computational cost. Experiments on benchmark data sets containing various situations with human crowds, as well as on traffic data, led to results that surpassed the current state of the art (SoA), confirming the method's efficacy and generality. Finally, the experiments show that our approach achieves significantly higher accuracy, especially for pixel-level event detection compared to SoA methods, at a low computational cost.
KW - anomaly
KW - crowd
KW - swarm intelligence
KW - traffic
UR - http://www.scopus.com/inward/record.url?scp=84927643408&partnerID=8YFLogxK
U2 - 10.1109/TIP.2015.2409559
DO - 10.1109/TIP.2015.2409559
M3 - Article
AN - SCOPUS:84927643408
SN - 1057-7149
VL - 24
SP - 2153
EP - 2166
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
IS - 7
M1 - 7055905
ER -