[article]
Titre : |
Attention-based video streaming |
Type de document : |
texte imprimé |
Auteurs : |
Çağatay Dikici, Auteur ; H. Işıl Bozma, Auteur |
Année de publication : |
2012 |
Article en page(s) : |
pp. 745–760 |
Note générale : |
Electronique |
Langues : |
Anglais (eng) |
Mots-clés : |
Biologically motivated attentive vision Foveation Spatio-temporal pre-processing Face tracking Neural networks Video streaming |
Résumé : |
This paper considers the problem of video streaming in low bandwidth networks and presents a complete framework that is inspired by the fovea–periphery distinction of biological vision systems. First, an application specific attention function that serves to find the important small regions in a given frame is constructed a priori using a back-propagation neural network that is optimized combinatorially. Given a specific application, the respective attention function partitions each frame into foveal and periphery regions and then a spatial–temporal pre-processing algorithm encodes the foveal regions with high spatial resolution while the periphery regions are encoded with lower spatial and temporal resolution. Finally, the pre-processed video sequence is streamed using a standard streaming server. As an application, we consider the transmission of human face videos. Our experimental results indicate that even with limited amount of training, the constructed attention function is able to determine the foveal regions which have improved transmission quality while the peripheral regions have an acceptable degradation. |
ISSN : |
0923-5965 |
En ligne : |
http://www.sciencedirect.com/science/article/pii/S0923596510000998 |
in Signal processing. Image communication > Vol. 25 N° 10 (Novembre 2010) . - pp. 745–760
[article] Attention-based video streaming [texte imprimé] / Çağatay Dikici, Auteur ; H. Işıl Bozma, Auteur . - 2012 . - pp. 745–760. Electronique Langues : Anglais ( eng) in Signal processing. Image communication > Vol. 25 N° 10 (Novembre 2010) . - pp. 745–760
Mots-clés : |
Biologically motivated attentive vision Foveation Spatio-temporal pre-processing Face tracking Neural networks Video streaming |
Résumé : |
This paper considers the problem of video streaming in low bandwidth networks and presents a complete framework that is inspired by the fovea–periphery distinction of biological vision systems. First, an application specific attention function that serves to find the important small regions in a given frame is constructed a priori using a back-propagation neural network that is optimized combinatorially. Given a specific application, the respective attention function partitions each frame into foveal and periphery regions and then a spatial–temporal pre-processing algorithm encodes the foveal regions with high spatial resolution while the periphery regions are encoded with lower spatial and temporal resolution. Finally, the pre-processed video sequence is streamed using a standard streaming server. As an application, we consider the transmission of human face videos. Our experimental results indicate that even with limited amount of training, the constructed attention function is able to determine the foveal regions which have improved transmission quality while the peripheral regions have an acceptable degradation. |
ISSN : |
0923-5965 |
En ligne : |
http://www.sciencedirect.com/science/article/pii/S0923596510000998 |
|