Project introduction


Welcome to the website of our ADA project.

Choice of datasets and question

Among all the proposed datasets, we choose to retain the CMU movie summary corpus. This dataset gathers informations such as the movie title, its genre, the release date, the langage, the casting, but also a summary of the preprocessed with the Standford CoreNLP pipeline. With all these informations available, we chose to study the representation of stereotypes in cinema.