Abstract
In this paper, a multimodal corpus for the study of cartoons as input for language acquisition is presented. After explaining what prompted us to compile the corpus, the stages of defining, collecting, and transcribing the data are briefly discussed. Eventually, we show how the information contained in the corpus can be used as a starting point for studies of the acoustic environment and the soundscape to which viewers of the cartoons considered are exposed.