Lightspeed Ventures-backed audio platform Pocket FM introduced it has partnered with voice-cloning firm ElevenLabs to rapidly convert textual content content material, resembling script, into audio collection utilizing AI.
Pocket FM, which raised $103 million in Sequence D funding in March, instructed TechCrunch on the time that it was already experimenting with the flexibility to transform textual content content material into audio utilizing ElevenLabs‘ tech. Now, the India-based firm has expanded the partnership to make the conversion device accessible to all creators over the following few weeks.
Within the take a look at part, Pocket FM already produced 30,000 hours of audio collection utilizing ElevenLab’s AI tech. With the brand new roll-out, the startup expects to triple its content material library of over 100,000 hours of audio content material this yr. Pocket FM additionally mentioned that throughout the experimental part, the AI-powered instruments helped it minimize the price of producing audio by 90%.
Pocket FM’s co-founder and CTO Prateek Dixit instructed TechCrunch over a name that with this partnership, the corporate needs to make it simpler for writers to transform their writings into audio collection.
“We have over 250,000 writers (including the ones on the company’s Pocket Novel writing plaform) and this partnership decreases the cost of setting up and recording audio for them,” he mentioned.
“Even with a good set up of recording tools and equipment, writers can produce roughly 30 minutes of high-quality audio content per day. With the AI tools, this output can be 10 times more,” he added.
Pocket FM has constructed a device integrating ElevenLabs tech, via which it’s providing 50 voices for writers who wish to convert their content material. ElevenLabs’ co-founder Mati Staniszewski mentioned that his firm’s device understands the context of the writing and infers feelings via the voice robotically.
“Working with Pocket FM, we are deploying our newer models that understand the genre of writing and are emotionality better,” Staniszewski mentioned.
Dixit famous that based mostly on knowledge from customers’ engagement with this type of content material, the platform additionally plans to counsel voices that work properly for writers in a selected style.
Pocket FM shouldn’t be the one audio collection platform experimenting with AI-powered instruments. Google-backed Kuku FM is utilizing GPT-4, Claude, BandLab and even ElevenLabs to assist its writers with totally different levels of creation, together with refining script, producing thumbnails, including sound results and changing textual content into audio.
Kuku FM instructed TechCrunch that it’s also experimenting with utilizing visible technology instruments resembling Midjourney and Runway to create adverts associated to content material.
High quality of content material and impression on artists
The promise of AI-powered instruments is to generate extra content material sooner, however that doesn’t imply the content material is sweet. Pocket FM’s reply to aiding discovery and surfacing high quality content material is making its discovery algorithm refined and experimenting with person engagement.
“If a writer publishes an audio series, we surface that content to a select number of users and observe engagement metrics. If these metrics are positive, we further propagate that,” Dixit mentioned.
Kuku FM mentioned it’s working with its high quality management crew to make sure solely high-quality content material is promoted on its app, even when creators have used AI within the course of.
“We realized the importance of having a human Quality Control team at the center of our decision-making when it comes to audio content production. We have developed a core team of Content Producers who have high ownership & authority on the artistic standards,” the corporate’s co-foudner and CEO Lal Chand Bisu mentioned.
Using AI might result in faster outcomes and a much bigger content material library for these platforms, however it would additionally scale back the roles of voiceover artists working with them. India’s Affiliation of Voiceover Artists (AVA) has expressed its considerations about AI taking up.
“If AI takes over, we are finished. As voice artists, we need to get some regulation in place so that our livelihood is protected,” Amarinder Singh Sodhi, the affiliation’s normal secretary, instructed Indian publication Scroll.
Sodi additionally instructed Scroll about incidents the place voiceover artists have been known as into the studio to report samples to coach AI with out acquiring their consent or informing them.
“On an emotional level, it scares me. By using AI, you are essentially diluting the human experience of storytelling. You lose out on an emotional connection,” Delhi-based voiceover artist Aditya Mattoo instructed TechCrunch.
He added that giving entry to premium voices to individuals who don’t have the style and ability to supply high quality content material will result in the market getting flooded by dangerous content material.
Voice artists in different elements of the world have additionally raised considerations about AI impacting their jobs. And regardless of working with among the AI corporations, they really feel uncomfortable about their voices being altered.
Once we requested in regards to the impression of AI-powered voice technology on Pocket FM, the corporate didn’t immediately reply the query. Nonetheless, Dixit famous that engagement with AI-generated content material in its experiments is “as good as human voiceover production.” Notably, the corporate can also be engaged on expertise to include a number of voices in a single audio output.
Each Pocket FM and Kuku FM don’t at present label their content material to point if AI has been used within the creation course of.