Casio CT-S1000V: More tips

A few new Casio CT-1000V tips and observations. Hope they’re helpful.

Portamento

The CT-S1000V (CT-S500) has portamento. Control over portamento is quite flexible. Be prepared to experiment, however, as the interaction between portamento settings is not immediately obvious. Unfortunately, the User Guide is not super helpful as it refers to several terms with “portamento” in the name, e.g., “Upper Portamento,” “Part Portamento”, etc.

All of these tips apply to the CT-S500, too.

There are two different ways to access portameto-related settings: through the MENU button and through the Settings sub-menu.

The MENU parameters allow the following adjustments:

  • UPPER PORT: Turn on Upper Portamento.
  • PART PORT: Turn on Part Portmento for each part (Upper1, Upper2, Lower).
  • TIME: Change portamento time for Upper1, Upper2 and Lower, individually. Each part has its own time.

When Upper Portamento is enabled, you can enable/disable portamento on the Upper part using a front panel button.

Casio CT-S1000V/CT-S500 UPPER PORT soft button [Casio]

Ah, so which panel button is that? If you press the INSTRUMENT button, the CT-S1000V displays five soft buttons: SPLIT, LAYER, TOUCH, SUS and ARP. The SUS button controls sustain. If you want to control Upper Portamento instead, dive into the Settings sub-menu and scross to “SUS/UPPER PORT button”. Change the value from “SUS” to “UPPER PORT”. Now the INSTRUMENT button shows “UPPER PORT” instead of “SUS”. Pressing the “UPPER PORT” soft button applies portamento to the Upper part. This feature allows you to apply portamento selectively during a solo line.

I hope this brief overview helps when reading the User Guide. I recommend reading the fine print about Upper Portamento because Upper Portamento can override Part Portamento. (Surprise!)

Expression pedal

For some crazy reason, I didn’t hook up and configure an expression pedal on Day 1. In retrospect, one should probably tangle with pedal set-up early just in case pedal settings are saved in user memory locations like CT-S1000V registrations.

The User Guide gives good step-by-step directions concerning pedal set-up. The CT-S1000V is vendor agnostic, thankfully. I have three (!) Yamaha FC-7 pedals and didn’t want to buy another pedal. The CT-S1000V supports two TRS wiring schemes as shown in the diagrams below:

Casio CT-S1000V/CT-S500 pedal polarity types [Casio]

To make a long story short, the Yamaha FC-7 is polarity type 1. The FC-7 resistance is 50K ohms and be sure to go through the simple calibration steps in the User Guide. For reference, the FC-7 TRS signals are:

  • Tip: Reference voltage
  • Ring: Wiper
  • Sleeve: Ground

You wouldn’t believe how many forum posts get this wrong!

Roland, Kurzweil and Fata are polarity type 2. The User Guide confirms operation for the Roland EV-5, Kurzweill CC-1, Fatar VP-25 and Fatar VP-26. Type 2 TRS signals are:

  • Tip: Wiper
  • Ring: Reference voltage
  • Sleeve: Ground

The Roland FV-500L should work, too. Be aware that the EV-5 and FV-500L have a “minimum volume” potentiometer (variable resistor) in series with the main control potentiometer. Turn the minimum volume control to 0 before calibrating. The main control potentiometer resistance is 10K ohms; the minimum volume potentiometer resistance is 50K ohms.

Rant of the day: I have a nice, light-weight Boss EV-1-WL expression pedal. Wish I could use it with the CT-S1000V (and others). No device to device BLE, no 5-pin MIDI, no host compatibility. Arg.

Are pedal settings really stored?

I posted this question on the Casio Music Forums site. Even though the User Guide claims the pedal settings are stored in MY SETUP and registrations, I haven’t seen evidence. If you change the pedal settings and load a registration (or MY SETUP), the changed settings remain.

I hope that Casio will clarify.

Drawbar organ tones

As mentioned earlier, I pulled together a bank of drawbar organ registrations. I settled on the following tones:

    Gospel Organ 2    Mellow 
Rock Organ 2 Mellow
Elec.Organ 1 Bright church-y for hymns
Elec.Organ 6 Simmerin' for grease

In all four cases, I split the keyboard putting the “Organ Bass” tone in the left hand. I like the way Organ Bass holds down the low-end and doesn’t sound swirly. Split point is E4.

Movin’ on to pipe organs…

Copyright © 2022 Paul J. Drongowski

Casio CT-S1000V: Drawbar organ tones

I’m choosing and dialing in the Hammond B-3 organ tones for church tunes. The Casio CT-S1000V (and CT-S500) provides over thirty electric organ tones. Most of the organ tones incorporate an Active DSP effect.

Choosing tones is a process of selection and elimination. Some sounds just aren’t suitable for church, e.g., distorted rock organ, combo organ, or theatre organ. As to personal taste, I usually avoid voices with hard percussion. I often play in a group with piano, guitar or drums. There is already enough going on percussively that I don’t need to add another element to the on-going rhythm. In a typical situation, it’s more important to hold down or emphasize a left-hand bass.

Tone name        Description                     DSP effect 
--------------- ------------------------------ ------------
JS Organ Jimmy Smith, percussion AMP cab->Trem->3EQ
AMP Organ 1 First four, no percussion Phaser->Drive->AMP cab
Rock Organ 1 No percussion Phaser->Drive->AMP cab
Hard Rock Organ Chorus->Drive->AMP cab
Gospel Organ 1 Mellow-ish Rotary
Velo Organ Velocity brings in bars Drive rotary
F.Organ Farfisa combo organ Trem->AMP cab
V.Organ Vox combo organ Trem->AMP cab
RTF FD Organ Rotary fast, full drawbars Drive rotary
Rock OD Organ Drive rotary
Tremolo Organ Mellow, C3 sampled-in Auto pan
DP Organ Total Jon Lord Dist
Jazz Organ 1 Bright hard perc jazz Rotary
Jazz Organ 2 Darker hard perc jazz Rotary
Elec.Organ 1 Bright. church-y Rotary
Elec.Organ 2 Mellow-ish, key click Rotary
Elec.Organ 3 First four Rotary
Elec.Organ 4 Full drawbars Rotary
Elec.Organ 5 Mellow, soft perc Phaser
Perc.Organ 1 Bright hard perc Rotary
Perc.Organ 2 Darker soft perc Rotary
Gospel Organ 2 Mellow Rotary
Full Drawbar Full drawbar Rotary
Rock Organ 2 Church-y (swimming rotary) Drive rotary
Rock Organ 3 Mellow-ish very fast Drive rotary
Click Organ Low drawbars, key click Drive rotary
70's Organ Farfisa-like, key click Rotary
Organ Pad Weird No DSP effect
Theatre Organ No DSP effect
Perc.Organ 3 Hard perc, low drawbars Rotary
Elec.Organ 6 Low+high (16+1), click Rotary
AMP Organ 2 Rocky bars Tone->Drive->AMP cab->Trem
AMP Organ 3 Heavy OD Tone->Drive->AMP cab->Trem
Organ Flute Pipe organ flute No DSP effect
Puff Organ Chiffy organ flute No DSP effect
Reed Organ Portable reed organ Mono 3-band EQ
Rotary F-Organ Farfisa organ, rotary speaker Rotary
Rotary V-Organ Vox organ, rotary speaker Rotary

The table above are notes taken while auditioning CT-S1000V drawbar organ tones. The CT-S500 has the same complement of organ voices, so any comments made here apply to the CT-S500, too.

My goal is a registration bank of four tones that provide a range of timbres. I want to push a registration button while playing in order to match the current musical situation. So far, a few candidates stand out:

    Elec.Organ 6     Simmering 16+1 
Gospel Organ 2 Mellow, ballad-like sound
Gospel Organ 1 Brighter, hymn foundation
Rock Organ 2 Brighter still, hymn playing

I listened to all organ tones with Active DSP enabled and bypassed. “Rock Organ 2” has a church-y sound when the overdrive and rotary speaker are removed. I can knock the rock out of the preset by re-programming the Active DSP effect.

Most organ tones have a preset Active DSP effect as shown in the table. A few of the tones — like AMP Organ 1 — don’t use the rotary effect at all. (Surprise!)

As I mentioned in an earlier post, a few of the tones have a swirl-y or swimming fast rotary setting. Sometimes the rise acceleration and fall deceleration times are too short. The settings below are ballpark.

    Drive Rotary     PJD      Velo Org 
---------------- -------- --------
Type: 2 2
Overdrive Gain: 42 48
Overdrive Level: 42 31
Speed: Fast Fast
Brake: Rotate Rotate
Fall Accel: 40 10
Rise Accel: 60 18
Slow Rate: 40 17
Fast Rate: 100 111
Vib/chorus: C3 C2
Wet Level: 100 100
Dry Level: 0 44
Bypass: Off Off

I will use these parameter values as my starting point moving ahead. The wet/dry parameters could be helpful in a stereo mix avoiding hard left/right channel throb. BTW, the sim doesn’t have parameters for horn/rotor balance.

The Casio rotary speaker sims include a scanner vibrato/chorus option: Off, V1, V2, V3, C1, C2, C3. The presets assign overdrive to knob 2 (K2). I don’t usually adjust overdrive when I’m playing and will assign vibrato/chorus control instead. That way I can add or subtract chorus on the fly. Unfortunately, scanner off is at the knob’s full left position and C3 is at the full right position. That will make for a big gesture knob-spin when playing. Wish this could be assigned to a button…

As to reverb, I’ve settled on ROOM 2 with a send level of 20. This is just enough to add a little space when practicing, but it won’t muddy the sound too much when playing live.

Copyright © 2022 Paul J. Drongowski

Casio CT-S1000V: About them effects

Casio gave the CT-S Casiotone series a major boost with the new CT-S500 and CT-S1000V models. The new models include the acclaimed CT-S1 voices. Even better, Casio added a wide range of editable DSP effects. The DSP effects are in addition to the three system effect units: reverb, chorus and delay. Up to two DSP effect chains can be applied to tones (voice) and auto-accompaniment parts.

Casio CT-S500 and CT-S1000V simplified [Casio]

Casio have their own way of organizing effects. DSP effects are organized into chains, where each chain consists of one to four DSP modules. A DSP module is a unit executing a particular effect algorithm like EQ, tremolo, compression, etc. There are 29 DSP module types.

As a user, you don’t work directly in terms of DSP modules. Instead, you select a chain from one of 100 preset DSP chains. The first 25 or so preset chains have only a single DSP module, just about covering the 29 DSP module types individually. The remaining 75 presets have two, three or four DSP modules. Many chains are obviously guitar-oriented while others are slanted toward keyboards, percussion and ambience. When it comes to effect applications, who am I to judge? 🙂

The CT-S approach may appear to be too restrictive, but each DSP module has a BYPASS parameter which lets you turn off its stage. If you can find a preset chain with effects in the desired order, you’re good to go, thanks to DSP parameter editing.

Thank heavens for DSP parameter editing! The rotary speaker simulations are too fast right out of the box, for example. In addition to modifying effect parameters, you can assign a DSP parameter to each of the three front panel knobs (e.g., rotary speaker speed). Everything gets saved into a registration slot from which tone and effect settings are recalled.

Oh, yes, you have full access to Active DSP parameters. Unfortunately, this is not true for system effects (reverb, chorus, and delay). In the case of system effects, only the effect type (e.g., ROOM 3, HALL 2, etc.) can be changed. Minor bummer.

If I’m a little vague about effect routing, it’s because Casio have not published an effect routing diagram for the CT-S500 or CT-S1000V. I don’t feel like I’ve been held back in practice, but I would love to see a detailed signal flow. A diagram would clear up people’s questions about effect assignment (how many and where).

I’ve been pulling together a dozen of my most frequently used patches (layers and splits). I like to detune layers in order to add a bit more motion and interest. Individual tones, however, do not have a fine tuning parameter (i.e., plus or minus cents).

The orchestral voices are not pre-programmed with a DSP effect (thank goodness). Lacking a fine tuning control, an alternative method is to add a pitch shift DSP module to the tone. There are four DSP preset chains that contain pitch shift:

  #21 Pitch:        --> Pitch --> 
#87 Pitch Delay: --> Delay --> Pitch --> Phaser --> Auto Pan -->
#90 Pitch Mod 1: --> Tone --> Phaser --> Delay --> Pitch -->
#91 Pitch Mod 2: --> Pitch --> Delay --> Phaser --> Tone -->

The multi-module chains are fun, but waaay more complicated than necessary. Nonetheless, I jotted down the pitch-related parameters for reference:

    Parameter    Pitch  PitcDel1  PitcMod1  PitcMod2 
----------- ----- -------- -------- --------
Preset # 21 87 90 91
Pitch 0 0 +14 0
High Damp 127 127 127 127
Feedback 127 64 127 70
In Level 110 110 110 127
Wet Level 127 110 110 127
Dry Level 127 110 127 127
Fine +6 +12 +4 +6

Turns out, the stock pitch parameters (column 1) are ready to go, adding a pleasant chorus-like effect to layered tones.

Of course, I can’t let it rest and tried the six ambient enhancement presets. I got a nice, subtle feel with “AmbientEnh 4”. The chain breaks down like this:

    Piano Effect   -->      Delay     -->  Tone Control 
-------------- -------------- --------------
Lid: Full Open Time: 90 Low Freq: 50
Ref Level: 60 TmRatioL: 28 Low Gain: -12
TmRatioR: 64 Mid Freq: 100
Level L: 110 Mid Gain: -6
Level R: 110 High Freq: 16k
Fdbk: Cross High Gain: 0
Fdbk Level: 30
High Damp: 120
Tempo Syn: Off
In Level: 127 In Level: 100 In Level: 90
Dry Level: 110 Dry Level: 110 Dry Level: 0
Wet Level: 90 Wet Level: 110 Wet Level: 90
Bypass: Off Bypass: Off Bypass: Off

I used my ears first before diving into the details. Yep, I was surprised to see “Piano Effect” in the chain. Frankly, I don’t care what it’s called as long as it sounds good! The chain takes a little harshness off woodwinds while adding early reflections and cross delay. Cross delay isn’t too important to me as I will be playing in MONO at the job. Still, why not?

Hopefully, this look at CT-S1000V (CT-S500) DSP effects is helpful.

Copyright © Paul J. Drongowski

Casio Lyric Creator: Pronunciation

You might decide to fine tune the Casio CT-S1000V’s pronunciation of certain words or syllables. Crack open Casio Lyric Creator, select a lyric phrase, and long-press a word or syllable. Lyric Creator opens the “Edit Phonemes” display.

Casio Lyric Creator phoneme editing

If you’re working in English, Lyric Creator shows the word’s phonetic translation above a peculiar looking on-screen keyboard. The keys correspond to the major phonemes in the English language.

If you want to spend (waste) an entire afternoon, query English phonics. At the very least, you’ll finding articles about phonics for first readers. If you deep dive further, you will be down the infinite rabbit hole of linguistics. Beware!

Sometimes it’s worth it to check up on Lyric Creator. In one version of “Amazing Grace,” Lyric Creator produced the ‘G’ at the beginning of “Grace.” In another version, it didn’t produce the ‘G’ at all. YMMV. The C-MU Pocket Sphinx of 10,000 frequent words is a good resource when checking pronounciations. [Hope they didn’t ask anyone from Pittsburgh to make this list.]

There are roughly 44 common phonemes in English separated into vowels and consonants. Looking at the phoneme keyboard, most of the consonants make sense. There are a few two-letter combinations like “ng”, “tt”, “th”, “jh”, and “zh”. These cover sounds like the “ng” in “sing”.

The vowels, however, are not what they appear to be! The phonemes follow the International Phonetic Alphabet (IPA). [Not “India Pale Ale”.] The IPA has more special characters (glyphs) than Iverson’s APL (A Programming Language). The glyphs are mapped to two- and three-character sequences.

So, what reference phonetic alphabet does Lyric Creator use? CT-S1000V Vocal Synthesis was trained on the Carnegie-Mellon University (C-MU) Pronouncing Dictionary. The C-MU Pronouncing Dictionary employs the ARPAbet, which was first used in speech recognition research.

Here’s a quick and dirty correspondence table between symbols in the ARPAbet and English sounds:

    Vowels          Consonants      Consonants 
---------- ---------- ----------
aa bOt b Buy ng siNG
ae bAt ch CHin p Pie
ah bUt d Die r Rye
ax oracLE dd miDDle s Sigh
ao OUght dh THy sh SHy
aw bOUt f Fight t Tie
axr lettER g Guy tt auTumn
ay bIte hh High th THigh
eh bEt jh Jive v Vie
er bIRd k Kite w Wise
ey bAIt l Lie y Yacht
ih bIt m My z Zoo
iy bEAt mm zh diviSion
ow bOAt n Night cl
oy bOY nn wiNNer
uh bOOk
uw bOOt

You’ll notice several oddities among the English vowels, e.g., “ay” producing the long ‘i’ sound in “bite”. “ay”, really? No doubt, you saw the “ey” in “Grace” and thought that was strange, too.

The 44 common phonemes aside, there are a few phonemes in the Lyric Creator keyboard that required investigation. There are a few phonemes for which I have no clue! Next are a few special cases for your consideration.

There are two “th” sounds: voiced and unvoiced. A voiced sound is produced with the vocal chords; An unvoiced sound is produced solely by other components of the vocal tract. In Lyric Creator:

  • “dh” is the voiced “th” sound.
  • “th” is the unvoiced “th” sound.

Some vowel sounds are influenced by the letter ‘r’. The phoneme “axr” is a vowel influenced by ‘r’ as in “creator” or “letter”.

“ax” turns up in some interesting cases like “autumn” (ao,tt,ax,m), “middle” (m,ih,dd,ax,l) and “kindle” (k,ih,n,d,ax,l).

The word “middle” contains “dd” and “tt”. These phonemes lead to a discussion of alveolar flapping in English. [Yes, “flapping”.] We now stand at the maw of the rabbit hole and I will take my leave.

Have fun!

Copyright © 2022 Paul J. Drongowski

Casio Lyric Creator: First experience

Here are some observations after getting my feet wet with Casio Lyric Creator and Casio CT-S1000V.

Connection

As several other folks have mentioned, you must connect your iPad (Android device) to the CT-S1000V with a USB cable. Right now, Casio Lyric Creator cannot communicate with the CT-S1000V over Bluetooth.

It’s a little weird. Casio Music Space — recently released — does communicate with the CT-S1000V over Bluetooth. Casio Music Space could be a useful educational tool. However, it doesn’t fill a need for me.

Casio Music Space has a pairing dialog when initiating Bluetooth connection. Perhaps this is all that Lyric Creator needs? The IOS Bluetooth settings page does not find or show the CT-S1000V; I guess pairing is up to the app.

This is all new and perhaps we should give the Casio developers a little more time. At the moment, the WU-BT10 Bluetooth dongle is not much help. Glad it was included with the CT-S1000V. If I laid out $80USD for the dongle on top of the CT-S1000V, I’d be disappointed.

Memory capacity

The CT-S1000V comes factory-loaded with 100 dance floor (EDM) phrases. It’s fun to mess with these although they are not my cup of tea.

Casio Lyric Creator has an “Instrument Data Management” button at the bottom of the main screen. IDM displays the phrases loaded into a connected CT-S1000V.

There are 48MBytes of space of internal phrase memory allocated for phrases. Nearly all of the factory phrases are about 300KBytes, occupying roughly 30MBytes total. That leaves about 18MBytes free and available.

At some point, I will zap the factory phrases. Fortunately, Casio provide a file with the CT-S1000V Preset Lyric Tone Data. The file is in the Electronic Musical Instruments support area. You will need to scroll down to the “Digital Keyboards” section to find the file.

It’s a DAL file which resets all user data. Yikes! Be sure to save your own content before loading this file! It’s a factory reset.

The Lyric Creator User Guide describes how to restore individual lyric tones (phrases). Check out the Data Management section of the Guide for more details.

Just a phrase

Starting simple, I created a phrase from the first line of “Amazing Grace:”

Am -az -ing grace how sweet the sound

Then I added note values (durations), save the lyrics, and transfered the phrase to the CT-S1000V. Pretty smooth although I missed the need to SAVE the lyric file (with the cryptic name) to the CT-S1000V. The keyboard sat there in the “Preparing” state until the transfer request timed out.

Casio Lyric Creator

I played the phrase note by note. All good. Then I tried to find the most natural sounding Vocalist, settling on the Bossa Nova Vocalist.

This short phrase, BTW, took roughly 600KBytes of storage.

Prying eyes

Of course, the next thing is to save the lyrics file (extension lyj) on the PC and open the file with a text editor (Emacs). Sure enough, it’s all text. A lyrics file contains:

  • Header information about file and path names.
  • The lyric text as one continuous string.
  • A sequence of syllable data items.
  • Lyric Creator option settings, e.g., input language mode, auto split, auto conversion, etc.
  • More file paths including the Music XML file path.

It’s all one big string with delimiters dividing fields, attributes and values. Not the easiest thing for a human to read and we probably weren’t expected to poke around inside. (Ha!)

The sequence of syllable data items is the most interesting part. This is how Lyric Creator subdivided your lyric phrase and its note values. Here is a typical syllable data item:

{"text":"grace","phoneme":"g,r,ey,s","length":960,"note":60}

“grace” is the syllable. Normally, the phoneme property is empty. I choose to enter my own phonetic spelling for “grace”. [I’ll have a lot more to say about phonetic spelling in a future post.] If you don’t spell out your own phonemes, Casio will use the default spelling and leave the phoneme property blank. Length is the note value (duration) specified in tick units (480 ticks per quarter note). The note property is a MIDI note number (default 60 or middle C).

Importing Music XML

That was too easy. Let’s go for all the marbles and import a Music XML file. Now the road gets rougher. I think Casio need to do more testing.

I installed the latest version of MuseScore and created a simple chart for “Amazing Grace.” The Lyric Creator Music Guide says: “If the file being imported has multiple parts, only the first part will be imported.” I take this to mean “Lyric Creator will only import the first verse.” Thus, I only created one verse.

I exported an uncompressed Music XML file from MuseScore. Lyric Creator cannot import compressed Music XML; only .xml or .musicxml extensions are allowed.

OK, I should have saved the first attempt to show you. [I didn’t.] Lyric Creator brought in the lyrics — sort of. It missed the pick-up syllable (note) and it inserted a few extra syllables. On reflection, the extra syllables may be Do Re Mi Fa So La Ti Do, corresponding to the pitches of the C scale. Surprise! Lyric Creator also inserted a few rests where rests occurred in the score. Hyphen placement seemed a little random.

Overall, if the text is short, I wouldn’t bother with Music XML right now. Pressing on…

I cleaned up the lyric text: deleting Do Re Mi etc., adding the missing pick-up syllable, and so forth. The changed text saved OK.

Next, I edited the note values. Lyric Creator had generated note values based on the Music XML score. However, when I added the pick-up syllable, it looked like everything was now off by one place. Diligently, I changed the note values to match the original score. Went to save, and uh-oh, the text plus note values are too big to save. I deleted the rests and that was enough to trim the lyrics and make Lyric Creator happy again.

The Lyric Creator User Guide has a caveat: “You can enter lyrics for up to 100 syllables when the note value (note length) is eighth note. The number of syllables you can input depends on the note value.” Looks like I tripped the limit.

I transferred the lyrics for the first verse to CT-S1000V and successfully played them note by note. Hurray!

The entire first verse with note values required 1.6MBytes of storage. Yes, some of those factory phrases are gonna go!

My final observation has to do with the MIDI note numbers. Are they or what are they? I need to investigate the note property as the generated note numbers don’t match the melody. I question my own conjecture…

Summary

Well, Lyric Creator works within its limitations. I’m not sure that Music XML is ready for prime time. Unless I really, really wanted or needed to import Music XML, I would start simply with Lyric Creator’s editor.

Copyright © 2022 Paul J. Drongowski

Casio singing synthesis in pictures

I’m slowly immersing myself in the singing synthesis technology behind the Casio CT-S1000V. Heck, ya need somethin’ to do during TV advertisements while watching sports. 🙂

There are two major approaches to speech (singing) synthesis: unit-selection and statistical parametric.

Most people are familiar with unit-selection systems like Texas Instruments old Speak and Spell or the much more advanced Yamaha Vocaloid™. Unit-selection relies upon a large database of short waveform units (AKA phonemes) which are concatenated during synthesis. The real trick behind natural sounding singing (and speech) is the connective “tissue” between units. Vocaloid creates waveform data that connects individual phonetic units.

If you are familiar with Yamaha’s Articulation Element Modeling (AEM), a light should have lit in your mind. The two technologies have similarities, i.e., joining note heads, bodies, and tails. The Yamaha NSX-1 chip implements a stripped down Vocaloid engine and Real Acoustic Sound (AEM).

The content and size of the unit waveform database is a significant practical problem. The developers must record, organize and store a huge number of sampled phrases (waveform units). The Vocaloid 2 Tonio database (male, operatic English singer) occupies 750MBytes on my hard drive — not small and was a real challenge to collect, no doubt.

Statistical parametric systems effectively encode the source phonetic sounds into a model such as an hidden Markov model (HMM). During training, the source speech is subdivided into temporal frames and the individual frames are reduced to acoustic parameters. The model learns to associate specific text with the corresponding acoustic parameters. During synthesis, the model is fed text and acoustic parameters are recalled by the model. The acoustic parameters drive some form of vocoding. (“Vocoding” is used broadly here.)

Deep neural networks (DNN) improve on HMM. Sinsy is a DNN-based singing voice synthesis (SVS) system from Nogoya Institute of Technology. It is the culmination of many years of research by sensei Professor Keiichi Tokuda, his students and colleagues. It was partially supported by the Casio Science Promotion Foundation. Thus, adoption by Casio is hardly accidental!

Sinsy Singing Voice Synthesis System

The Sinsy block diagram is taken from their paper: Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System, by Yukiya Hono, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda, EEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 29, pp. 2803-2815, 2021. The method is quite complex and consists of several models. It’s not clear (to me, yet) if the Casio approach has all elements of the Sinsy approach. I recommend reading the paper, BTW; it’s well-written and highly technical.

Casio U.S. Patent 10,789,922 vocal synthesis

The next block diagram is taken from Casio’s U.S. Patent number 10,789,922 awarded September 29, 2020. Their approach is separated into a training phase and a synthesis (playing) phase. You’ll notice that Casio employ only an acoustic model. The patent discloses a “Voice synthesis LSI” unit, so their software may have a hardware assist. We’ll need to take a screwdriver to the CT-S1000V to find out for sure!

A picture is worth a thousand words. A technical diagram, however, requires a little interpretive context. 😉 Paraphrasing the Casio patent:

The text analysis unit produces phonemes, parts of speech, words and pitches. This information is sent to the acoustic model. The acoustic model unit estimates and outputs an acoustic feature sequence. The acoustic model represents a correspondence between the input linguistic feature sequence and the output acoustic feature sequence. Acoustic feature sequence includes:

  • Spectral information modeling the vocal tract (cepstrum MEL coefficients, line spectral pairs, or similar).
  • Sound source information modeling vocal chords (fundamental pitch frequency (F0) and power value).

The vocalization model unit receives the acoustic feature sequence. It generates singing voice inference data for a given singer. The singing voice inference data is output through a digital-to-analog converter (DAC). The vocalization model unit consists of:

  • A sound source generator:
    • Generates a pulse train for voiced phonemes.
    • Generates white noise for unvoiced phonemes.
  • A synthesis filter:
    • Uses the output signal from the sound source generator.
    • Is a digital filter that models the vocal tract based on spectral information.
    • Generates singing voice inference data (AKA “samples”).
Casio U.S. Patent 10,789,922 vocal synthesis process detail

This rather complicated diagram from U.S. Patent 10,789,922 shows the synthesis phase in more detail. It shows the lyric string decomposed into phoneme and frame sequences. Each frame is sent to an acoustic model which generates an acoustic feature sequence, that is, the acoustic parameters that were learned during training. The acoustic parameters are synthesized (vocoded) into 255 samples. Each frame is about 5.1 msec long.

Casio CT-S1000V Vocal Synthesis (User Guide)

Well, if the second patent diagram was TMI, here is the block diagram from the Casio CT-S1000V user guide. The simplified diagram is quite concise and accurate! You should be able to relate these blocks directly back to the patent.

I hope this discussion is informative. In a later post, I’ll take a look at a few practical details related to Casio CT1000V Vocal Synthesis.

Copyright © 2022 Paul J. Drongowski

Casio CT-S1000V: Quick tips

One feels “all thumbs” when starting in with a new keyboard. The Casio CT-S1000V has a lot of functionality and customization below the MENU button and within the SETTINGS item in the main MENU. I made a map to help me get around:

MENU                      SETTINGS 
My Set-up Transpose
Active DSP Touch Off Velocity
Balance Split Point
Octave Shift Rhythm Auto Set
Sustain Chord Finger Mode
Portamento Rhythm Controller Type
Pedal SUS/UPPER PORT Button
Pitch Band ARP/AH Button
Knob Rhythm Volume
Arpeggio Song Volume
Auto Harmony Tuning
Sampling Surround
Song Audio In Center Cancel
Metronome MIDI OUT Channel UPPER1, UPPER2, LOWER
System Effects Local Control
EQ MIDI SYNC Mode
Scale Auto Power Off
MIDI Control Battery
Wireless LCD Contrast
Media Button Long Press Time
Home Customization Speaker
Settings >>>>>>>>>>>> Phone Speaker
Demo Setting Initialize
Exit All Initialize
Version

The Casio CT-S500 is probably organized in the same way.

MY SETUP Power On Recall

Yesterday, I mentioned MY SETUP and how useful it is for establishing a global set-up for a given playing situation. It’s also useful for establishing an initial set-up during power-up. Simply enter MY SETUP, select one of the four set-up entries, and press the AT PW-ON soft button. The CT-S1000V will recall the selected set-up during power-ups.

Even though it’s cool to get kicked into Vocal Synthesis at power-up — a nice marketing/sales ploy — I’d rather have a B-3 at my fingertips. 🙂

Active DSP HOLD

Active DSP assigns effect parameters to the three front panel knobs: K1, K2 and K3. The “Amp Organ 1” tone, for example, assigns the knobs this way:

  • K1: M1 Speed
  • K2: M1 OD Gain
  • K3: M1 Brake

K1 is the rotary speaker speed, K2 is the overdrive and K3 is the speaker break which stops simulated rotor/horn.

That’s great until you hit HOME or MENU and — what the??? The knobs are re-assigned to cut-off, resonance and modulation. That’s when Active DSP HOLD comes into play.

If you press the Active DSP HOLD soft button before leaving the Active DSP screen, the CT-S1000V will remember the DSP knob assignments when you go HOME or whatever. Save this in your set-up, too.

Slow your roll, Sparky

The first time your spin up the rotary in “Amp Organ 1,” you’ll be appalled at the short ramp-up time (acceleration) and the final rotor/horn speed. The Active DSP screen is also your way into the DSP parameters. I have the Drive Rotary effect applied to the organ. It has the following parameters:

  • Rotary speaker type
  • Overdrive gain
  • Overdrive level
  • Speed
  • Brake
  • Fall acceleration (ramp down)
  • Rise acceleration (ramp up)
  • Slow rate
  • Fast rate
  • Vibrato/Chorus
  • Wet level
  • Dry level

I like the sound of a beat-up Leslie with slow motors and slipping belts. Feel free to adjust the acceleration and slow/fast rates down.

Nice to see the chorus/vibrato simulation (V1, C1, V2, C2, V3, C3). The Hammond had a unique chorus/vibrato scanner unit which is a necessary component of gospel organ registrations. I’d love to see more details about Casio’s rotary speaker emulation including the scanner and speaker types.

Oh, yeah, please let us assign rotary speaker speed to the foot pedal. Thanks.

Boing

I wish I could see more details about the reverb, chorus and delay effects, too.

CT-S1000V has three system-wide effects: reverb, chorus and delay. Usually you get only reverb and chorus, and don’t always see a separate delay unit. Cool.

Unlike the DSP effects, you do not get to tweak system-wide effect parameters. All you get are presets with rather uninformative names like Room 1, Hall 2, etc. I listened to the room reverbs and settled on Room 2 for church registrations. Although ears should be the final judge, I wish I could see the parameter values behind the presets in order to make good choices.

I recommend publishing an effect routing diagram like the one I found in the CT-S5000 manual (below). Thanks.

Casio CT-X3000/CT-X5000 effect routing diagram [Casio]

Copyright © 2022 Paul J. Drongowski

Casio CT-S1000V: Observations

Gonna post a few notes while I take an ear break.

I’m rather pleased with the sound and play-ability of the Casio CT-S1000V. For now, I’m focused on sound design and playing, having only dipped into the auto-accompaniment rhythms and vocal synthesis.

Patches

Sounds, not the song. (Sorry, Clarence Carter.)

My first order of business is building a bunch of sound combinations that are suitable for the contemporary and traditional church music that I play. Jazz, funk and pop will have to wait a little while…

The CT-S1000V provides two different means of storing a patch: My Set-up and registrations. My Set-up is accessed through the main MENU button. Up to four set-ups can be stored. As I quickly discovered, My Set-up stores everything but the kitchen sink including settings like speaker ON/OFF. I sacrificed the fourth location (SAX) and created a LINE OUT entry with the internal speaker turned off. This seems like a good use for My-Setup, namely, saving global configurations for different playing situations, e.g., home, gig, etc.

Registrations are more appropriate for tone (voice) programming. There’s more registrations than My-Setup locations: 16 banks with four registrations per bank, 64 registrations total. That may seem stingy by today’s standards, but I don’t need more than 8 to 16 locations to cover most of my gig needs. Plus, one can always save registrations to a USB flash drive and load them as playing situations arise.

Tone programming

Registrations can save most everything related to tone programming: split, layer, effects, and much more. Yes, you can edit CT-S1000V tones — one reason why I passed on the CT-S1 and waited.

Tone editing is similar to “quick edit” that you might find on a synth. You can tweak 21 parameters including cutoff, resonance, attack time, release, vibrato, volume, pan, effect sends and 4-band EQ. You don’t get synthesizer-level deep editing. Cutoff, etc. are offsets (-64:+64) from the preset value. If you want it all in front of you like MODX or Montage, this isn’t the droid you’re looking for.

With only three front panel knobs, you need to assign a tone parameter to a knob first, then tweak. The changed value is saved along with everything else in a registration (including the knob assignment).

DSP editing, on the other hand, is deep. Using a feature called “Active DSP”, you can choose an effect type, assign a parameter to a knob, and tweak effect parameters. I’m still experimenting with Active DSP, especially for controlling rotary speaker speed. I’ll have more to say when I have a better grip on Active DSP.

Splits and layers

The CT-S1000V works logically and supports two split zones: Lower and Upper. When Split is turned off, all’s you get is Upper. Upper supports two layers: Upper 1 and Upper 2; Lower cannot be layered. The split point is configurable and you can adjust the balance (level) between tones. This is just enough to be dangerous. If you’re looking for a pile of layers, move along.

Starting with my Roland days (circa 1995), I’ve kept notes about the most useful tone combinations for contemporary church music. Here are my favorite combinations:

               Tone 1       Tone 2       Tone 3        Tone 4 
----------- ----------- ------------ ------------
High School Tuba A Celesta Flute 1A Clarinet A
Warm Tp Sect Tbs mp C Tbs mp B Tbs mf C Brs LipNzl
CTp + Tb Sect C Tps mp A Tbs mp A C Tps f A Tbs f A
Horn+Wood Flute 1A Clarinet C Oboe mf A Horns mf A
NobleHornPop Horns f A Flugel C Tb Sect B Trumpet 1C
NobleHornPop French 1C Flugel C Tb Sect B Trumpet 1C

Orch Reeds Oboe mf A E.Horn C Oboe f A
Wind&Str1 Oboe mf A Flute 1B DolceStr.A JV Strings A
Wood Sect Oboe mf A Flute 1A Clarinet A E.Horn A
Flute/Clari Flute 1C Clarinet C

ChamberWinds Oboe mf B Oboe mf A Sop.Sax mf A Flute 1A
ChamberWoods Clarinet A Flute 1C Flute 1A

Warm Strings Soft Pad A F.Str mp A JP Strings2C JP Strings1A
ChmbrQuartet Violin C Violin 2 A Cello A Cello 2 A
ViolinCello Vc mp B Bassoon A Va mp A Oboe mf A

These combinations date back my old JV-90, XP-60 and XV-5050! You’ll find equivalents on my MODX and Genos. My task now is to build similar combinations (registrations) on CT-S1000V.

BTW, I’m also dialing back reverb where necessary. I try for a happy balance — not too dry for practice at home, but not so much as to murk up the sound in a reverberant church hall.

If I had one wish, I would like to give each registration a name. I have a running map of “which registration does what,” but wish the names appeared on the CT-S1000V screen, not “Bank 1-1”.

Tones for old bones

By and large, the CT-S1000V orchestral tones are decent; most of them are musically useful and sound good through both the built-in speaker and an external monitor. The tone parameters are enough to cure overly bright tones or sharp attacks. Although I haven’t worked with them yet, the chromatic percussion (celeste, glockenspiel, etc.) don’t have any obvious tuning issues and are musical.

Two layers aren’t much. Fortunately, CT-S1000V has a few preset combi tones — Brass & String, Violin Section, Chamber (orchestra), Flute & Oboe, Pipe Section — which provide another “layer” or two on the cheap.

Don’t forget about the ethnic voices. CT-S1000V has accordions, fiddle, and harmoniums. Harmoniums! Jazzers who want to get their Jon Batiste on should look to these for melodica.

Organs

The CT-S1000V pipe organs are decent. Yes, there is the usual over-done reedy sound, but there are three tone presets that are suitable for hymn-playing and congregational singing. Even though the keybed is squared-off and similar to piano keys, it has a nice resistance and allows the legato-like gestures one uses when playing a pipe organ.

The drawbar organ tones are serviceable. No, we’re not in clone territory here and you can’t change the drawbar settings. There are rotary speaker effect algorithms, but again, we’re not in Vent or clone territory quality-wise. The only way to change rotary speaker speed is via Active DSP and turning the knob to which rotary speed has been assigned. I wish there was a way to assign rotary speed to the foot pedal. (OK, I need two wishes from the Genie.)

On the up-side, the rotary effect has a brake setting. One could brake the rotary and put the LINE OUT into a Lester K, Vent, or whatever. I will be giving this a try and will post notes. With Lester on the floor, I could stomp on a foot switch and change rotary speed, too.

Summary

Well, I hope these observations are helpful! The Casio CT-S1000V has a lot of sound-making value for very little money. So many tones to try…

Copyright © 2022 Paul J. Drongowski

Casio CT-S1000V: First impressions

After test driving the Casio CT-S1 and CT-S410, I took the plunge and bought a Casio CT-S1000V (AiX Sound Source with Vocal Synthesis, $450USD street). The price was irresistible after making a trade-in. (The Yamaha SHS-500 Sonogenic retired.)

In terms of build quality, the Casio CT-S1000V is robust enough for light to moderate gigging. It feels solid. I miss the fabric speaker covering (Casio CT-S1) as it is a touch of class. I suspect that fabric would get dirty on gigs, however. I wouldn’t park any drinks on this keyboard (or any keyboard) with everything exposed! Yep, it weighs ten pounds, not bad for a keyboard with in-built speakers.

Casio CT-S1000V

The power supply is a small lump-in-the-middle brick. The mains lead is rather short with one of those “figure 8” IEC 60320 C7 plugs. Other accessories include a music stand and a Casio WU-BT10 Bluetooth dongle — don’t lose that tiny little bugger! The music stand isn’t super-robust and I’m not sure that I want to park a heavy binder o’tunes on it. It’s also too low for my reading glasses and I will probably stick to my usual tripod music stand.

The CT-S1000V keybed is rather nice for a keyboard in this price range. The keys are squared off and piano-like although there’s no hammer simulation, of course. The keys are evenly spaced, are level, and don’t wobble too much. The keys have a textured surface similar to the Roland GO:KEYS. The throw is a little bit light and soft, not unpleasant. (BTW, I couldn’t stand the Roland GO:KEYS and returned it due to keybed issues.)

I can hand-swipe without cutting my hands. I don’t know how the keys will stand up to this kind of abuse in the long run. Plus, this board is so light, I’m afraid of throwing it off the keyboard stand when swiping!

The speaker sound is OK. I regard the speakers as “courtesy speakers.” Sometimes it’s convenient to push only one switch and start playing. They’re loud enough for my studio room, maybe loud enough for the church gig where we don’t generate a lot of stage volume. They don’t get buzzy at loud volume. Since I don’t play at very loud volume at home, I’m good with that. Casio wisely blessed the CT-S1000V with 1/4″ stereo output jacks so I can send the CT to the church PA.

I read just enough of the manual to enable Active DSP, which assigns DSP parameters to the knobs. With an organ tone selected, turning knob 1 (K1) switched between slow and fast rotary speaker speed. Wish there was a way to assign rotary speed to a button or the foot switch… I need to experiment more with Active DSP. Gotta experiment with splits and layers, too. I guess everything is saved to a registration, but we’ll find out!

I played with Vocal Synthesis enough to know there are multiple Vocalists. Some Vocalists are more natural than others. One of the Vocalists is “Death Voice” and I would like to uncork that one in church. 🙂

Quite a playable instrument. I haven’t listened to any of the rhythms yet because I’m mostly interested in flat-out playing. Switching sections (intro, main, fill, etc.) with the buttons below the display reminds me of switching arpeggios on the Yamaha MOX/MOXF.

Hope these impressions help!

Copyright © 2022 Paul J. Drongowski

Ye olde Yamaha Dance Kit

Ya learn somethin’ every day. Thanks for to Mark — my neighborhood to the north in Vancouver — who looped me in.

As one might expect, Yamaha have updated their drum kit samples over the years. Who knew — the DanceKit circa 2000 is more heavy, punchy and analog than present-day DanceKit. According to Mark (and Musicnik), the Standard Kit had more punch back in the day.

The table below summaries the instruments in the Yamaha Standard Kit and Dance Kit:

                    Standard Kit      Dance Kit 
Keyboard MIDI 127/000/001 127/000/28
-------- -------- ---------------- ---------------
40 E 1 28 E 0 Brush Tap Swirl Reverse Cymbal *
41 F 1 29 F 0 Snare Roll Snare Roll
42 F# 1 30 F# 0 Castanet Hi Q 2 *
43 G 1 31 G 0 Snare Soft Snare Techno *
44 G# 1 32 G# 0 Sticks Sticks
45 A 1 33 A 0 Bass Drum Soft Kick Techno Q *
46 A# 1 34 A# 0 Open Rim Shot Rim Gate *
47 B 1 35 B 0 Bass Drum Hard Kick Techno L *
48 C 2 36 C 1 Bass Drum Kick Techno 2 *
49 C# 2 37 C# 1 Side Stick Side Stick Analog *
50 D 2 38 D 1 Snare Snare Clap *
51 D# 2 39 D# 1 Hand Clap Hand Clap
52 E 2 40 E 1 Snare Tight Snare Dry *
53 F 2 41 F 1 Floor Tom L Tom Analog 1 *
54 F# 2 42 F# 1 Hi-Hat Closed Hi-Hat Close Analog 1 *
55 G 2 43 G 1 Floor Tom H Tom Analog 2 *
56 G# 2 44 G# 1 Hi-Hat Pedal Hi-Hat Close Analog 2 *
57 A 2 45 A 1 Low Tom Tom Analog 3 *
58 A# 2 46 A# 1 Hi-Hat Open Hi-Hat Open Analog *
59 B 2 47 B 1 Mid Tom L Tom Analog 4 *
60 C 3 48 C 2 Mid Tom H Tom Analog 5 *
61 C# 3 49 C# 2 Crash Cymbal 1 Cymbal Analog *
62 D 3 50 D 2 High Tom Tom Analog 6 *
63 D# 3 51 D# 2 Ride Cymbal 1 Ride Cymbal 1
64 E 3 52 E 2 Chinese Cymbal Chinese Cymbal
65 F 3 53 F 2 Ride Cymbal Cup Ride Cymbal Cup
66 F# 3 54 F# 2 Tambourine Tambourine
67 G 3 55 G 2 Splash Cymbal Splash Cymbal
68 G# 3 56 G# 2 Cowbell Cowbell Analog *
69 A 3 57 A 2 Crash Cymbal 2 Crash Cymbal 2
70 A# 3 58 A# 2 Vibraslap Vibraslap
71 B 3 59 B 2 Ride Cymbal 2 Ride Cymbal 2
72 C 4 60 C 3 Bongo H Bongo H
73 C# 4 61 C# 3 Bongo L Bongo L
74 D 4 62 D 3 Conga H Mute Conga Analog H *
75 D# 4 63 D# 3 Conga H Open Conga Analog M *
76 E 4 64 E 3 Conga L Conga Analog L *
77 F 4 65 F 3 Timbale H Timbale H 7
8 F# 4 66 F# 3 Timbale L Timbale L
79 G 4 67 G 3 Agogo H Agogo H
80 G# 4 68 G# 3 Agogo L Agogo L
81 A 4 69 A 3 Cabasa Cabasa
82 A# 4 70 A# 3 Maracas Maracas 2 *
83 B 4 71 B 3 Samba Whistle H Samba Whistle H
84 C 5 72 C 4 Samba Whistle L Samba Whistle L
85 C# 5 73 C# 4 Guiro Short Guiro Short
86 D 5 74 D 4 Guiro Long Guiro Long
87 D# 5 75 D# 4 Claves Claves 2 *
88 E 5 76 E 4 Wood Block H Wood Block H
89 F 5 77 F 4 Wood Block L Wood Block L
90 F# 5 78 F# 4 Cuica Mute Scratch H *
91 G 5 79 G 4 Cuica Open Scratch L *

The starred (“*”) entries denote analog drum machine samples.

I decided to do a side-by-side comparison. I first recorded the DanceKit samples as dry as possible on the Yamaha PSS-A50 and the Yamaha QY-70 (circa 1997). Then I matched everything up, ignoring the toms and a few extraneous instruments.

You’ll hear all the PSS-A50 examples first followed by all of the QY70 examples. I’ll let you decide as to your personal preference. Although I tried to get the A50 dry, there seems to be a hint of reverb remaining.

Without further ado, here is a ZIP file containing the WAV for all of the Yamaha QY-70 Dance Kit instruments starting from the bottom of the keyboard to the top. Have fun! Slice and dice everything into audio mirepois.

If a drum machine plays in the forest and no one is around, does it still make a sound? 🙂

Copyright © 2022 Paul J. Drongowski