Conferencia

La versión final de este artículo es de uso interno de la institución.
Consulte el artículo en la página del editor

Abstract:

Action detection was formulated as a subvolume mutual information maximization problem in [8], where each subvolume identifies where and when the action occurs in the video. Despite the fact that the proposed branch-and-bound algorithm can find the best subvolume efficiently for low resolution videos, it is still not efficient enough to perform multiinstance detection in videos of high spatial resolution. In this paper we develop an algorithm that further speeds up the subvolume search and targets on real-time multi-instance action detection for high resolution videos (e.g. 320 × 240 or higher). Unlike the previous branch-and-bound search technique which restarts a new search for each action instance, we find the Top-K subvolumes simultaneously with a single round of search. To handle the larger spatial resolution, we downsample the volume of videos for a more efficient upperbound estimation. To validate our algorithm, we perform experiments on a challenging dataset of 54 video sequences where each video consists of several actions performed by different people in a crowded environment. The experiments show that our method is not only efficient, but also capable of handling action variations caused by performing speed and style changes, spatial scale changes, as well as cluttered and moving background. © 2010 IEEE.

Registro:

Documento: Conferencia
Título:Efficient search of Top-K video subvolumes for multi-instance action detection
Autor:Goussies, N.A.; Liu, Z.; Yuan, J.
Ciudad:Singapore
Filiación:DC - FCEyN, Univ. de Buenos Aires, Argentina
Microsoft Research, Redmond, WA, United States
School of EEE, Nanyang Technological University, Singapore, 39798, Singapore
Palabras clave:Action recognition; Branch-and-bound; Action recognition; Branch and bounds; Branch-and-bound algorithms; Data sets; High resolution; High spatial resolution; Low resolution video; Mutual information maximization; Search technique; Spatial resolution; Spatial scale; Subvolumes; Upper bound; Video sequences; Image resolution; Video recording; Algorithms
Año:2010
Página de inicio:328
Página de fin:333
DOI: http://dx.doi.org/10.1109/ICME.2010.5583547
Título revista:2010 IEEE International Conference on Multimedia and Expo, ICME 2010
Título revista abreviado:IEEE Int. Conf. Multimedia Expo, ICME
Registro:https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_97814244_v_n_p328_Goussies

Referencias:

  • Laptev, I., On space-time interest points (2005) International Journal of Computer Vision, 64 (2-3), pp. 107-123
  • Bentley, J., Programming pearls: Algorithm design techniques (1984) Commun. ACM, 27 (9), pp. 865-873
  • Schuldt, C., Laptev, I., Caputo, B., Recognizing human actions: A local svm approach (2004) Proc. IEEE Conf. on Pattern Recognition
  • Laptev, I., Marszalek, M., Schmid, C., Rozenfeld, B., Learning realistic human actions from movies (2008) Proc. IEEE Conf. on Computer Vision and Pattern Recognition
  • Reddy, K.K., Liu, J., Shah, M., Incremental action recognition using feature-tree (2009) Proc. IEEE Intl. Conf. on Computer Vision
  • Shechtman, E., Irani, M., Space-time behavior based correlation (2005) Proc. IEEE Conf. on Computer Vision and Pattern Recognition
  • Ke, Y., Sukthankar, R., Hebert, M., Event detection in crowded videos (2007) Proc. IEEE International Conf. on Computer Vision
  • Yuan, J., Liu, Z., Wu, Y., Discriminative subvolume search for efficient action detection (2009) Proc. IEEE Conf. on Computer Vision and Pattern Recognition
  • Hu, Y., Cao, L., Lv, F., Yan, S., Gong, Y., Huang, T.S., Action detection in complex scenes with spatial and temporal ambiguities (2009) Proc. IEEE Intl. Conf. on Computer Vision
  • Lampert, C.H., Blaschko, M.B., Hofmann, T., Beyond sliding windows: Object localization by efficient subwindow search (2008) Proc. IEEE Conf. on Computer Vision and Pattern Recognition
  • Yuan, J., Liu, Z., Wu, Y., Zhang, Z., Speeding up spatio-temporal sliding-window search for efficient event detection in crowded videos (2009) ACM Multimeida Workshop on Events in Multimedia
  • Bobick, A.F., Davis, J.W., The recognition of human movement using temporal templates (2001) IEEE Trans. on Pattern Analysis and Machine Intelligence (PAMI), 23 (3), pp. 257-267
  • Rodriguez, M.D., Ahmed, J., Shah, M., Action mach a spatio-temporal maximum average correlation height filter for action recognition (2008) Proc. IEEE Conf. on Computer Vision and Pattern Recognition
  • Weinland, E.B.D., Ronfard, R., Free viewpoint action recognition using motion history volumes (2006) Computer Vision and Image Understanding, 104 (2-3), pp. 207-229
  • Ke, Y., Sukthankar, R., Hebert, M., Efficient visual event detection using volumetric features (2005) Proc. IEEE International Conf. on Computer Vision
  • Yang, M., Lv, F., Xu, W., Yu, K., Gong, Y., Human action detection by boosting efficient motion features (2009) IEEE Workshop on Video-oriented Object and Event Classification in Conjunction with ICCV, , Kyoto, Japan, Sept.29-Oct.2
  • Lin, Z., Jiang, Z., Davis, L.S., Recognizing actions by shape-motion prototype trees (2009) Proc. IEEE Intl. Conf. on Computer Vision
  • Jiang, H., Drew, M.S., Li, Z.-N., Successive convex matching for action detection (2006) Proc. IEEE Conf. on Computer Vision and Pattern Recognition

Citas:

---------- APA ----------
Goussies, N.A., Liu, Z. & Yuan, J. (2010) . Efficient search of Top-K video subvolumes for multi-instance action detection. 2010 IEEE International Conference on Multimedia and Expo, ICME 2010, 328-333.
http://dx.doi.org/10.1109/ICME.2010.5583547
---------- CHICAGO ----------
Goussies, N.A., Liu, Z., Yuan, J. "Efficient search of Top-K video subvolumes for multi-instance action detection" . 2010 IEEE International Conference on Multimedia and Expo, ICME 2010 (2010) : 328-333.
http://dx.doi.org/10.1109/ICME.2010.5583547
---------- MLA ----------
Goussies, N.A., Liu, Z., Yuan, J. "Efficient search of Top-K video subvolumes for multi-instance action detection" . 2010 IEEE International Conference on Multimedia and Expo, ICME 2010, 2010, pp. 328-333.
http://dx.doi.org/10.1109/ICME.2010.5583547
---------- VANCOUVER ----------
Goussies, N.A., Liu, Z., Yuan, J. Efficient search of Top-K video subvolumes for multi-instance action detection. IEEE Int. Conf. Multimedia Expo, ICME. 2010:328-333.
http://dx.doi.org/10.1109/ICME.2010.5583547