Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
2013-12
Literature review on prosody reveals the lack of corpora for prosodic studies in Catalan and Spanish. In this paper, we present a corpus intended to fill this gap. The corpus comprises two distinct data-sets, a news subcorpus and a dialogue subcorpus, the latter containing either conversational or task-oriented speech. More than 25 h were recorded by twenty eight speakers per language. Among these speakers, eight were professional (four radio news broadcasters and four advertising actors). The entire material presented here has been transcribed, aligned with the acoustic signal and prosodically annotated. Two major objectives have guided the design of this project: (i) to offer a wide coverage of representative real-life communicative situations which allow for the characterization of prosody in these two languages; and (ii) to conduct research studies which enable us to contrast the speakers different speaking styles and discursive practices. All material contained in the corpus is provided under a Creative Commons Attribution 3.0 Unported License.
Peer Reviewed
Postprint (published version)
Article
English
Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal; Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic; Versificació; Processament de la parla; Catalan corpus; Dialogue corpus; Prosodic corpus; Radio news corpus; Spanish corpus; Versification; Speech processing systems
http://link.springer.com/article/10.1007%2Fs10579-012-9213-0
http://creativecommons.org/licenses/by-nc-nd/3.0/es/
Restricted access - publisher's policy
Attribution-NonCommercial-NoDerivs 3.0 Spain
E-prints [72991]