Abstract: International audience; We here describe the Rhapsodie resource, a syntactic and prosodic treebank of spoken French, composed of 57 short samples of spoken French (5 minutes long on average, amounting to 3 hours of speech and 33000 words), and an orthographic transcription. The transcription and the annotations are all aligned on the speech signal : phonemes, syllables, words, speakers, overlaps. The main objective of the Rhapsodie project is to define rich, explicit, and reproducible schemes for the annotation of prosody and syntax in differen...
(read more)
Topics: 
Natural language processing
Linguistics
Speech recognition