“School of Cognitive”

Back to Papers Home
Back to Papers of School of Cognitive

Paper   IPM / Cognitive / 17768
School of Cognitive Sciences
  Title:   Going beyond still images to improve input variance resilience in multi-stream vision understanding models
  Author(s): 
1.  A. Fadaei
2.  M. Abolghasemi Dehaqani
  Status:   Published
  Journal: Scientific Reports
  Vol.:  14
  Year:  2024
  Supported by:  IPM
  Abstract:
Traditionally, vision models have predominantly relied on spatial features extracted from static images, deviating from the continuous stream of spatiotemporal features processed by the brain in natural vision. While numerous video-understanding models have emerged, incorporating videos into image-understanding models with spatiotemporal features has been limited. Drawing inspiration from natural vision, which exhibits remarkable resilience to input changes, our research focuses on the development of a brain-inspired model for vision understanding trained with videos. Our findings demonstrate that models that train on videos instead of still images and include temporal features become more resilient to various alternations on input media.

Download TeX format
back to top
scroll left or right