T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, Simultaneous modeling of spectrum, pitch and duration in hmm-based speech synthesis, Proc. of Eurospeech, pp.2347-2350, 1999.

C. Boidin and O. Boëffard, Generating intonation from a mixed cart-hmm model for speech synthesis, Proc. of Interspeech, 2008.

J. Latorre and M. Akamine, Multilevel parametric-base f0 model for speech synthesis, Interspeech, 2008.

Y. Morlec, Génération multiparamétrique de la prosodie du français par apprentissage automatique, 1997.

B. Holm, Sfc : un modèle de superposition de contours multiparamétriques pour la génération automatique de la prosodie -apprentissage automatique et application à l'énonciation de formules mathématiques, 2003.

H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, A Hidden Semi-Markov Model-Based Speech Synthesis System, Proc. ICSLP, pp.1397-1400, 2004.
DOI : 10.1093/ietisy/e90-d.5.825

B. Gao, Y. Qian, Z. Wu, and F. Soong, Duration refinement by jointly optimizing state and longer unit likelihood, Proc. of Interspeech, 2008.

J. Yamagishi, H. Kawai, and T. Kobayashi, Phone duration modeling using gradient tree boosting, Speech Communication, vol.50, issue.5, pp.405-415, 2008.
DOI : 10.1016/j.specom.2007.12.003

S. Chen, W. Lai, and Y. Wang, A new duration modeling approach for mandarin speech, IEEE Transactions on Speech and Audio Processing, vol.11, issue.4, pp.308-320, 2003.
DOI : 10.1109/TSA.2003.814377

K. Sreenivasa-rao and B. Yegnanarayana, Modeling durations of syllables using neural networks, Computer Speech and Language, vol.21, issue.2, pp.282-295, 2007.

P. Barbosa and G. Bailly, Generating segmental duration by p-centers, Proc. of the Fourth Workshop on Rhythm Perception and Production, pp.163-168, 1996.

F. Gachet and M. Avanzi, Les parenthèses en français : Etude prosodique, 2009.

F. Koopmans-van-beinum and M. Van-donzel, Relationship between discourse structure and dynamic speech rate, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996.
DOI : 10.1109/ICSLP.1996.607960

T. Mishra, J. Van-santen, and E. Klabbers, Decomposition of pitch curves in the general superpositional intonation model, Speech Prosody, 2006.

J. O. Dell, The use of context in large vocabulary speech recognition, 1995.

C. Veaux, B. Beller, D. Schwarz, and X. Rodet, Ircamcorpustools : an extensible plateform for speech corpora exploitation, Proc. of ELREC, 2008.

P. Lanchantin, A. Morris, X. Rodet, and C. Veaux, Automatic phoneme segmentation with relaxed textual constraints, Proc. of ELREC, 2008.
URL : https://hal.archives-ouvertes.fr/hal-01161385

F. Béchet, Liaphon : un système complet de phonétisation de textes, pp.47-67, 2008.

N. Obin, X. Rodet, and A. Lacheret-dujour, A syllablebased prominence model based on discriminant analysis and context-dependency, Proc. of SPECOM, 2009.
URL : https://hal.archives-ouvertes.fr/halshs-00636518