Calling VO-6 Masters

waftlord · January 6, 2026, 10:13am

I’m developing a Text to speech (song?!) process. But the system needs a few examples to help guide accuracy and options.

If anyone is willing to share some pattern/kit sysex (that have quality diphthong and phoneme use) here it will expedite the process.
even just 16 steps with one or two clear words will still be really helpful.

i’ve given it a go but VO-6 is quite subjective in output, so the more approaches/variations we can incorporate the more wide ranging the possibilities.

i want to incorporate the filter for mouth shape effectively alongside the core synthesis. (then LFO for vibrato/tremolo etc)

end game is: import a MIDI file and enter lyrics to make the mono sing. (potential for 1/2/3/4/5/6 voice arrangements (plock permitting)!
(actually in poly mode 6 voice unison arrangements should be far more effective as all voices will share same trigs.

waftlord · January 6, 2026, 10:36am

completely related

LyingDalai · January 6, 2026, 4:38pm

Are you going to interface with a TTS app?
I feel like what is needed is a table
Phonème => VO-6 values
No clue how phonème are encoded though, never looked into this.

waftlord · January 6, 2026, 4:49pm

everything’s in place, it’s its own system. just need a wide range of words/diphthongs/phonemes/morphemes in pattern (.syx) form to help steer.

waftlord · January 6, 2026, 4:51pm

example word vowel VOC1 setting VOC2 setting

lake A 93 118
leak E 40 127
like I - - not a
discreet sound (aah-y-uh)
oh O 99 48
you U - - not a
discreet sound (y-oo)
lack a 127 93
let e 124 98
lick i 91 109
lock (ah) o 127 60
luck (uh) u 112 53
luke u 71 51
look oo 93 61

waftlord · January 6, 2026, 4:51pm

but how these are timed and shaped are user taste.