next up previous contents
Next: Spectral Networks and Deep Up: Summary of References Related Previous: Learning Multi-modal Latent Attributes   Contents

Subsections

Large-scale Video Classification with Convolutional Neural Networks [45]

Original Abstract

Convolutional Neural Networks (CNNs) have been es-tablished as a powerful class of models for image recog-nition problems. Encouraged by these results, we pro-vide an extensive empirical evaluation of CNNs on large- scale video classification using a new dataset of 1 millionYouTube videos belonging to 487 classes. We study mul-tiple approaches for extending the connectivity of a CNNin time domain to take advantage of local spatio-temporalinformation and suggest a multiresolution, foveated archi-tecture as a promising way of speeding up the training.Our best spatio-temporal networks display significant per-formance improvements compared to strong feature-basedbaselines (55.3

Main points


next up previous contents
Next: Spectral Networks and Deep Up: Summary of References Related Previous: Learning Multi-modal Latent Attributes   Contents
Miquel Perello Nieto 2014-11-28