VUX World


All about conversational commerce with Charlie Cadbury

Ep. 28

In this episode, we take a deep dive into conversational commerce: what it is, what's possible and how you can turn conversing strangers into paying customers.

Our guest

Charles Cadbury is the co-founder of Say It Now, a company that helps brands respond the the growing consumer need for immediacy. Charlie's history is impressive. He's seen more than 1,000 client briefs and delivered over 300 digital projects, many of them related to commerce. After working with Lola Tech to create the Dazzle platform, Charlie's attention remains focused on conversational interactions and helping brands convert conversations into commerce.

Where to listen


Check out the Say It Now website

Follow Charles on Twitter

More Episodes


What is text-to-speech and how does it work with Niclas Bergström

Every voice assistant needs three core components: Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Text-to-Speech (TTS). We've already coveredwhat Automatic Speech Recognition is and how it workswith Catherine Breslin and in this episode, we're covering the latter, text-to-speech.To guide us through the ins and outs of TTS, we're joined by Niclas Bergström, a TTS veteran and co-founder of one of the largest TTS companies on the planet, Readspeaker.Text-to-speech is the technology that gives voice assistants a voice. It's the thing that produces the synthetic vocal sound that's played from your smart speaker or phone whenever Alexa or Siri speaks. It's the only part of a voice assistant that you'd recognise. The other core components, ASR and NLU, are silent.And, given how we're hard wired for speech - a baby can recognise its mother's voice from the womb - how your voice assistant or voice user interface (VUI) sounds is one of the most important parts of it.A voice communicates so much information without us necessarily being aware. Just from the sound of someone's voice, you can infer gender, age, mood, education, place of birth and social status. From the sound of someone's voice, you can decide whether you trust them.With voice assistants, voice user interfaces, or any hardware or software that speaks, choosing the right voice is imperative.Some companies decide on a stock voice. One of Readspeaker's 90 voices or perhaps Amazon Polly. Others create their own bespoke voice that's fit for their brand.We see examples ofLyrbird's voice cloningand we hear Alexa speak every day, so it's easy to take talking computers for granted. Because speaking is natural and easy for us, we assume that it's natural and easy for machines to talk. But it isn't.So in this episode, we're going to lift the curtain on text-to-speech and find out just exactly how it works. We'll look at what's happening under the hood when voice assistants talk and see what goes into creating a TTS system.Readspeakeris a pioneering voice technology company that provides lifelike Text to Speech (TTS) servicesfor IVR systems, voice applications, automobiles, robots, public service announcement systems, websites or anywhere else.It's been in the TTS game for over 20 years and has in-depth knowledge and experience in AI and Deep Neural Networks, which they put to work in creating custom TTS voices for the world's biggest brands.LinksVisitReadspeaker.comto find out more about TTS servicesAndReadspeaker.aifor more information on TTS research and samples

Voice technology and music with Dennis Kooker and Achim Matthes

Music has been the top use case on smart speakers pretty much from the beginning. Having any song you like at your beckoning call makes playing music around the house easier than ever. And households that play music out loud are, apparently,happier households.It doesn't require too much thought, either. So, discoverability isn't as much of a challenge as with skills, actions and services. If you want to play some Michael Jackson, just ask.Having said that, music consumption habits are advancing. According toPandora, more people are listening to up-beat, exercise music during lockdown, presumably to exercise to given the gyms are shut. And more people are listening to more ambient music, too, as well as child friendly playlists. People spending time at home and using their music service to relax and entertain the kids respectively.And there's a growing trend moving away from listening to artists and towards listening to playlists. Random compilations of different tunes grouped around a theme. And with smart speakers, we're seeing an insight into people's contexts with the music they ask to play. For example 'play BBQ music' might not be something you'd try and find on Spotify, but you might ask for it from your smart speaker.In the age of playlists, mood music and music on demand, how does a record label make sure that its catalogue of music is found and played on smart speakers? Well, that's what we're going to find out in this episode.In this episode: voice strategy at Sony MusicWe're joined by Dennis Kooker, President, Global Digital Business and US Sales, and Achim Matthes, Vice President, Partner Development, at Sony Music Entertainment. Dennis and Achim walk us through how Sony Music is thinking about voice, some of the behavioural trends they're seeing play out, how they make sure that, when you ask for a Sony Artist song, you get what you've asked for, what's involved in music discoverability, what trends they're seeing and where they see music and voice heading in future.