Listen to the evolution of speech synthesizers from 1939-1985

AndreaJames · February 1, 2021, 4:59pm

Originally published at: Listen to the evolution of speech synthesizers from 1939-1985 | Boing Boing

…

Papasan · February 1, 2021, 5:17pm

from 1939-1985…

Well did evolution stop in 1985, because that would answer a lot of questions.

AndreaJames · February 1, 2021, 5:35pm

mine did.

Papasan · February 1, 2021, 5:45pm

Here ya go Andrea, just sing & dance around, I’m doing that Right Now!

Gnagn · February 1, 2021, 8:14pm

In the '90s I worked for a speech and hearing research company in Cambridge co-founded by Ken Stevens, who had worked extensively with Dennis Klatt.

One of our products was a PC-based speech synthesizer based on Klatt’s KLSYN88. Programming the synthesizer could be difficult, because it took ~40 time-varying parameters as input and hand-tweaking that much data was a pain. Ken’s big innovation on the synthesizer was a system that reduced the number of parameters to ten, and used the constraints of the vocal tract to produce all of the Klatt parameters from those ten.

We had a demo audio file of Ken speaking the first two lines of The Raven, and a file that used Ken’s high-level system to produce a synthesized version of Ken’s voice saying the same thing. IIRC at some point we mixed the two files up and nobody noticed for months. It was a really good system.

Anyhoo · February 2, 2021, 2:47pm

@AndreaJames
For me one thing of interest is that the developers of the various speech engines had skewed the speech to follow an accent! Some seem more British and others different.

Apart from the digital artifact of the low bit rate it is the sliding or glissando between pitches that would seem to be the main problem, was wondering if this was on their radar when creating this magic.

Purplecat · February 2, 2021, 2:53pm

Of course, just at the end of the video’s time frame, speech synthesis was beginning to be available on the desktop.

RickMycroft · February 2, 2021, 4:17pm

I still have my ancient RS232 text-to-speech card.

Maybe I’ll fire it up again, but it really was kind of awful. eta: Besides, my Pis do better:

https://elinux.org/RPi_Text_to_Speech_(Speech_Synthesis)

AndreaJames · February 2, 2021, 5:35pm

They are emulating the Transatlantic accent!

Anyhoo · February 2, 2021, 6:28pm

This is some sort of malarkey of which you speak, does it cut the mustard?!

With regard to speech synthases and digital speech recognition/production in general this topic highlights a certain ‘Western’ bias as the ‘baseline’… I’m thinking of languages that rely on tonal and pitch inflections for example Vietnamese.

Just now wondering of the impact of diverse cultures and languages on modern tech and vice versa?

system · February 6, 2021, 5:00pm

This topic was automatically closed after 5 days. New replies are no longer allowed.

Topic		Replies	Views
Watch a 1939 demonstration of Voder, the world's first speech synthesizer, then try to operate an online recreation boing	10	456	July 23, 2023
Perfect impression of a contemporary text-to-speech bot boing	12	1796	April 14, 2019
Old school speech synthesizers perform Monty Python's "Argument" sketch boing	13	1537	September 26, 2016
Stephen Hawking's speech synthesizer now free/open software boing	12	2137	August 23, 2015
Google's talking AI is indistinguishable from humans boing	58	3907	April 8, 2018

Listen to the evolution of speech synthesizers from 1939-1985

from 1939-1985…

Related topics