The Modal Corpus consists of three equivalent resources of English, French and Italian dialogues drawn, respectively, from the Santa Barbara Corpus of Spoken American English (for English),the ESLO Corpus, plus the OTG Corpus and the Accueil UBS Corpus (for French), the VoLip Corpus (for Italian) annotated for epistemic modality.
We annotated about 20.000 words per language for a total amount of 2824 epistemic constructions (833 for the English Corpus, 1271 for the French Corpus, 720 for the Italian Corpus).

