Text to speech by cepstral

8/21/2023

Cepstral offers 8KHz and 16KHz voices at a reasonable cost (when compared with other non-free solutions) and are compatible with the Festival project (see the Festival script on the Cepstral Support Pages. (use the external links below to compare two systems). One might find that the most notable pro to using this commercial product, compared to a free solution, is the quality of the voices available. For general use, Swift's functions are comparable to that of Festival and other free solutions, allowing one to easily convert text to speech, and the ability to save that speech as an audio file (e.g.wav). Přibil, J., Přibilová, A.: Distributed Listening Test Program for Synthetic Speech Evaluation.Swift is a text-to-speech engine made by Cepstral allowing the user to convert text to computer generated speech. IEEE Signal Processing Magazine 16, 22–38 (1999)Īkande, O.O., Murphy, P.J.: Estimation of the Vocal Tract Transfer Function with Application to Glottal Wave Analysis. A Perfect Fit for Signal and Image Processing. Přibil, J., Madlová, A.: Two Synthesis Methods Based on Cepstral Parameterization. Ito, T., Takeda, K., Itakura, F.: Analysis and Recognition of Whispered Speech. In: Proceedings of the 2nd International Symposium on Brain Vision and Artificial Intelligence, Naples, pp. IEEE Transactions on Acoustics, Speech, and Signal Processing ASSP-22, 207–217 (1974)Įsposito, A., Stejskal, V., Smékal, Z., Bourbakis, N.: The Significance of Empty Speech Pauses: Cognitive and Algorithmic Issues. Gray, A.H., Markel, J.D.: A Spectral-Flatness Measure for Studying the Autocorrelation Method of Linear Prediction of Speech Analysis. In: Proceedings of the 15th Biennial International EURASIP Conference Biosignal, Brno, pp. Vích, R.: Cepstral Speech Model, Padé Approximation, Excitation, and Gain Matching in Cepstral Speech Synthesis. In: Proceedings of the 13th International Conference on Systems, Signals and Image Processing, Budapest, pp. Přibil, J., Přibilová, A.: Voicing Transition Frequency Determination for Harmonic Speech Model. IEEE Transactions on Audio, Speech, and Language Processing 14, 1137–1144 (2006) Theune, M., Meijs, K., Heylen, D., Ordelman, R.: Generating Expressive Speech for Storytelling Applications. In: Proceedings of Eurospeech, Budapest, pp. House, D., Bell, L., Gustafson, K., Johansson, L.: Child-Directed Speech Synthesis: Evaluation of Prosodic Variation for an Educational Computer Program. In: Esposito, A., Faundez-Zanuy, M., Keller, E., Marinaro, M.

Přibil, J., Přibilová, A.: Emotional Style Conversion in the TTS System with Cepstral Description.

IEEE Transactions on Audio, Speech, and Language Processing 14, 1145–1154 (2006) Tao, J., Kang, Y., Li, A.: Prosody Conversion from Neutral Speech to Emotional Speech. IEEE Transactions on Audio, Speech, and Language Processing 14, 1117–1127 (2006) Navas, E., Hernáez, I., Luengo, I.: An Objective and Subjective Study of the Role of Semantics and Prosodic Features in Building Corpora for Emotional TTS. Iida, A., Campbell, N., Higuchi, F., Yasumura, M.: A Corpus-Based Speech Synthesis System with Emotion. Přibilová, A., Přibil, J.: Non-linear Frequency Scale Mapping for Voice Conversion in Text-to-Speech System with Cepstral Description.

0 Comments

Text to speech by cepstral

Leave a Reply.

Author

Archives

Categories