Text-to-Speech Synthesis

Before delving into the details of how to perform text-to-speech conversion, we will first examine some of the fundamentals of communication in general. This chapter looks at the various ways in which people communicate and how communication varies depending on the situation and the means which are used. From this we can develop a general model, which will then help us specify the text-to-speech problem more exactly in the following chapter.
We experience the world though our senses and we can think of this as a process of gaining information. We share this ability with most other animals: if an animal hears running water it can infer that there is a stream nearby; if it sees a ripe fruit it can infer that there is food available. This ability to extract information from the world via the senses is a great advantage in the survival of any species. Animals can, however, cause information to be created: many animals make noises, such as barks or roars, or gestures such as flapping or head nodding, which are intended to be interpreted by other animals. We call the process of deliberate creation of information with the intention that it be interpreted communication.
The prerequisites for communication are an ability to create information in one being, an ability to transmit this information and an ability to perceive the created information by another being. All three of these prerequisites strongly influence the nature of communication; for example, animals...