J. Aggarwal and M. Ryoo, Human activity analysis, ACM Computing Surveys, vol.43, issue.3, pp.1-43, 2011.
DOI : 10.1145/1922649.1922653

Y. Benezeth, B. Emile, H. Laurent, and C. Rosenberger, Vision-Based System for Human Detection and Tracking in??Indoor Environment, International Journal of Social Robotics, vol.63, issue.10, pp.41-52, 2010.
DOI : 10.1007/s12369-009-0040-4

URL : https://hal.archives-ouvertes.fr/inria-00545469

M. Blank, L. Gorelick, E. Shechtman, M. Irani, and R. Basri, Actions as space-time shapes, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.1395-1402, 2005.
DOI : 10.1109/ICCV.2005.28

J. Candamo, M. Shreve, D. Goldgof, D. Sapper, and R. Kasturi, Understanding Transit Scenes: A Survey on Human Behavior-Recognition Algorithms, IEEE Transactions on Intelligent Transportation Systems, vol.11, issue.1, pp.206-224, 2010.
DOI : 10.1109/TITS.2009.2030963

O. Chomat and J. Crowley, Probabilistic recognition of activity using local appearance, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149), 1999.
DOI : 10.1109/CVPR.1999.784616

V. Delaitre, I. Laptev, and J. Sivic, Recognizing human actions in still images: a study of bag-of-features and part-based representations, Procedings of the British Machine Vision Conference 2010, 2010.
DOI : 10.5244/C.24.97

URL : https://hal.archives-ouvertes.fr/hal-01060885

P. Dollar, P. Rabaud, G. Cottrell, and S. Belongie, Behavior Recognition via Sparse Spatio-Temporal Features, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pp.65-72, 2005.
DOI : 10.1109/VSPETS.2005.1570899

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.77.5712

A. Kläser, M. Marsza?ek, and C. Schmid, A spatiotemporal descriptor based on 3d-gradients, British Machine Vision Conference, pp.995-1004, 2008.

I. Laptev and T. Lindeberg, Space-time interest points, IEEE International Conference on Computer Vision, pp.432-439, 2003.

I. Laptev, M. Marszalek, C. Schmid, and B. Rozenfeld, Learning realistic human actions from movies, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587756

URL : https://hal.archives-ouvertes.fr/inria-00548659

J. Liu and M. Shah, Learning human action via information maximization, IEEE int. conf. on Computer Vision and Pattern Recognition, 2008.

L. Meng, L. Qing, P. Yang, J. Miao, X. Chen et al., Activity recognition based on semantic spatial relation, IEEE International Conference on Pattern Recognition, 2012.

J. Niebles, H. Wang, and L. Fei-fei, Unsupervised learning of human action categories using spatialtemporal words, International Journal of Computer Vision, vol.79, issue.3 1, 2008.

J. Niebles, H. Wang, and F. L. , Unsupervised learning of human action categories using spatial-temporal words, British Machine Vision Conference, pp.1249-1258, 2006.

P. Oluwatoyin and K. Wang, Video-based abnormal human behavior recognition -a review, IEEE Trans. on Systems, Man, and Cybernetics, Part C, vol.42, issue.6 2, pp.865-878, 2012.

M. H. Matikainen and R. Sukthankarl, Representing Pairwise Spatial and Temporal Relations for Action Recognition, European Conference on Computer Vision, 2010.
DOI : 10.1007/978-3-642-15549-9_37

M. Ryoo and J. Aggarwal, Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities, 2009 IEEE 12th International Conference on Computer Vision, pp.1593-1600, 2009.
DOI : 10.1109/ICCV.2009.5459361

M. S. Ryoo and J. K. Aggarwal, UT-Interaction Dataset, ICPR contest on Semantic Description of Human Activities (SDHA), IEEE International Conference on Pattern Recognition Workshops, 2004.

S. Savarese, A. Delpozo, J. Niebles, and L. Fei-fei, Spatial-Temporal correlatons for unsupervised action classification, 2008 IEEE Workshop on Motion and video Computing, pp.1-8, 2008.
DOI : 10.1109/WMVC.2008.4544068

C. Schuldt, I. Laptev, and B. Caputo, Recognizing human actions: a local SVM approach, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004., pp.32-36
DOI : 10.1109/ICPR.2004.1334462

P. Scovanner, S. Ali, and M. Shah, A 3-dimensional sift descriptor and its application to action recognition, Proceedings of the 15th international conference on Multimedia , MULTIMEDIA '07, 2004.
DOI : 10.1145/1291233.1291311

P. K. Turaga, R. Chellappa, V. S. Subrahmanian, and O. Udrea, Machine Recognition of Human Activities: A Survey, IEEE Transactions on Circuits and Systems for Video Technology, vol.18, issue.11, pp.1473-1488, 2008.
DOI : 10.1109/TCSVT.2008.2005594

A. Yilmaz and M. Shah, Recognizing human actions in videos acquired by uncalibrated moving cameras, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.150-157, 2005.
DOI : 10.1109/ICCV.2005.201

L. Zelnik-manor and M. Irani, Event-based analysis of video, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, 2001.
DOI : 10.1109/CVPR.2001.990935

G. Zhao and M. Pietikainen, Dynamic Texture Recognition Using Local Binary Patterns with an Application to Facial Expressions, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.6, pp.915-928, 2007.
DOI : 10.1109/TPAMI.2007.1110