SAMPA
The Speech Assessment Methods Phonetic Alphabet (SAMPA) is a computer-readable phonetic script using 7-bit printable ASCII characters, based on the International Phonetic Alphabet (IPA).
It was originally developed in the late 1980s for six European languages by the EEC ESPRIT information technology research and development program. As many symbols as possible have been taken over from the IPA; where this is not possible, other signs that are available are used, e.g. [@] for schwa, [2] for the vowel sound found in French deux and [9] for the vowel sound found in French neuf.
Today, officially, SAMPA has been developed for all the sounds of the following languages:
- Arabic
- Bulgarian
- Cantonese
- Czech
- Danish
- Dutch
- English
- Estonian
- French
- German
- Greek
- Hebrew
- Hungarian
- Italian
- Norwegian
- Polish
- Portuguese
- Romanian
- Russian
- Scots
- Serbo-Croatian
- Spanish
- Swedish
- Thai
- Turkish
Problems with SAMPA
SAMPA tables are valid only in the language they were created for. The tables of languages are not harmonised so there are conflicts between languages. The result of this problem is that SAMPA cannot be used as an ASCII representation of the general IPA alphabet. To solve this problem X-SAMPA was created, which provides one single table without language specific differences.
See also:
- A concise version of SAMPA chart for English sounds.
- A more complete SAMPA chart of the sounds found in most of the European languages.
- Kirshenbaum
- Unicode and HTML/IPA Extensions