- Prevent configure from picking up any stray /dev/dsp's and believing
it should use OSS
- In the play script, add proper arguments if ran on NetBSD
- Fix the $PATH setting in the play script
diphones, male voice.
The following phoneme symbols are assumed in the us3 diphone sets. It
slightly different than the SAMPA alphabet since american english is not
british english.
SYMBOL PRONOUNCED LIKE IN
p drop proxy
t plot tromp
4 later (flapped allophone of t)
k rock crop
b cob box
d nod dot
g jog gospel
f prof fox
s boss sonic
S wash shop
tS notch chop
T cloth thomp
v salve volley
z was zombie
Z garage jacques
dZ dodge jog
D clothe thy
m palm mambo
n john novel
N bong
l doll lockwood
l= litle
r star roxanne
j yacht
w show womble
h harm
r= her urgent
i even
A arthur
u oodles
I illness
E else
{ apple
V nut
U good
@ about
EI able
AI island
OI oyster
@U over
aU out
O all
-Julian Assange <proff@iq.org>
synthesis system.
This voice provides a American English male voice using the MBROLA
synthesis method. It uses a modified CMU lexicon for pronunciations.
Prosodic phrasing is provided by a statistically trained model using
part of speech and local distribution of breaks. Intonation is
provided by a CART tree predicting ToBI accents and an F0 contour
generated from a model trained from natural speech. The duration
model is also trained from data using a CART tree.
The quality of this voice is not as high as us1 and us2
This voice can be activated via (voice_us3_mbrola)
-Julian Assange <proff@iq.org>
synthesis system.
This voice provides a American English male voice using the MBROLA
synthesis method. It uses a modified CMU lexicon for pronunciations.
Prosodic phrasing is provided by a statistically trained model using
part of speech and local distribution of breaks. Intonation is
provided by a CART tree predicting ToBI accents and an F0 contour
generated from a model trained from natural speech. The duration
model is also trained from data using a CART tree.
This voice can be activated via (voice_us2_mbrola)
-Julian Assange <proff@iq.org>
synthesis system.
This voice provides a American English female voice using the MBROLA
synthesis method. It uses a modified CMU lexicon for pronunciations.
Prosodic phrasing is provided by a statistically trained model using
part of speech and local distribution of breaks. Intonation is
provided by a CART tree predicting ToBI accents and an F0 contour
generated from a model trained from natural speech. The duration
model is also trained from data using a CART tree.
This voice can be activated via (voice_us1_mbrola)
-Julian Assange <proff@iq.org>
wq
Festival 1.4.0 has the following improvements over the previous release (1.3.1 January 1999)
o distributed under a free X11-type licence
o generalization of stats modules, ngram, CART, wfst with viterbi so they
can be shard more easily
o Tidy up of Utterance/Relation/Item architecture
o Initial JSAPI support
o Three new us voices using MBROLA databases
o Tilt code overhaul
o XML load for Relations
o Fringe graphic display (ALPHA) released seperately
http://www.cstr.ed.ac.uk/projects/fringe.html