Technology

Audio platform Pocket FM uses AI tools to help it expand its content catalog

Published

6 months ago

November 27, 2024

IAM

India based audio platform Pocket FM has over 200,000 hours of content on the web site. However, the corporate’s CEO Rohan Nayak believes that the platform still has room to grow when it comes to creating original content and expanding its library to include multiple genres and sub-genres. The fastest way to achieve that is to use artificial intelligence tools that may help produce audio, write strategies, and tailor these stories to different geographies.

“I still believe that our content catalog is not sufficient for our users. There are so many genres and subgenres that we don’t have in our library. I don’t think we have the depth of content that would fall under the adult entertainment category,” Nayak told TechCrunch over the phone.

The company has already established an excellent partnership with ElevenLabs to transform texts into audio series. This resulted in 5x faster production and 30x lower cost compared to professionally generated audio series.

“We have already tested how these AI adaptations perform in various markets and have seen encouraging results. We are still refining our models for errors, but we believe the technology is good enough to be used in program production,” Nayak said.

One of the AI tools Pocket FM is trying out allows stories to be adapted across regions. The company says it has trained internal models that do not do easy translation, but handle cultural nuances when transforming stories from one region to one other.

He added that the difficult task is to solve the models’ hallucinations within the context of stories spanning a whole lot of episodes. Pocket FM found that it had to address the restrictions of context windows of open source models, in addition to construct maps of the relationships between different entities within the story to maintain character consistency.

Another tool the corporate offers for writers is to test their work as a creative assistant, helping them create alternative stories or giving them plot ideas. The company also plans to use some insights from historical data within the tool to show authors what’s working on the platform.

Nayak mentioned that while the tool is in its early stages of development, the corporate wants to open up a author’s room to a solo author who might post an episode on a single day. He noted that a author’s room allows you to brainstorm without having to control your creativity, and that’s the predominant idea of this writing assistant.

Additionally, the corporate is investing in creating a success engine that might be based on insights from the platform about which shows develop into hits.

Pocket FM’s ultimate goal is to scale its catalog, on condition that it produces some content itself and produces programming through its network of writers. But to scale and gain popularity, it needs to create hit shows.

“Blockbusters power every content platform. While we have a good start to the funnel with user-generated content, blockbusters are still hard to come by.”

Pocket FM achieved encouraging results thanks to the implementation of artificial intelligence. It has over 40,000 series on the platform with the help of artificial intelligence in voice creation. What’s more, the corporate generated $3 million in revenue from them. Overall, the platform earned $127 million in fiscal yr 2024.

The company’s most difficult challenge is finding the appropriate balance between AI helping creators and creating content quickly. There is at all times a risk that individuals will use AI to speed up content production and degrade quality. As a result, the platform becomes stuffed with mediocre content and it becomes difficult for the algorithms to distinguish good programs.

Puneet Sharma, a author and lyricist based in India, identified that in a world where a lot work is formulaic, the onus might be on artists to prove the authenticity of their work.

Sharma added that AI tools can help writers generate ideas and learn different styles. However, because of this learning could also be lost through failure within the means of using these tools.

Nayak said some authors and creators are already using AI tools. The company’s idea is to provide tools together with the context of the story and a platform.

Pocket FM has raised $197 million in multiple rounds from backers including Lightspeed Ventures, Tencent and Times Internet. The company competes on multiple fronts with other players resembling Audible, Omidyar Network-backed Pratilipi and Google-backed Kuku FM.

This article was originally published on : techcrunch.com

Technology

The next large Openai plant will not be worn: Report

Published

16 hours ago

May 22, 2025

IAM

Sam Altman speaks onstage during The New York Times Dealbook Summit 2024.

Opeli pushed generative artificial intelligence into public consciousness. Now it might probably develop a very different variety of AI device.

According to WSJ reportThe general director of Opeli, Altman himself, told employees on Wednesday that one other large product of the corporate would not be worn. Instead, it will be compact, without the screen of the device, fully aware of the user’s environment. Small enough to sit down on the desk or slot in your pocket, Altman described it each as a “third device” next to MacBook Pro and iPhone, in addition to “Comrade AI” integrated with on a regular basis life.

The preview took place after the OpenAI announced that he was purchased by IO, a startup founded last 12 months by the previous Apple Joni Ive designer, in a capital agreement value $ 6.5 billion. I will take a key creative and design role at Openai.

Altman reportedly told employees that the acquisition can ultimately add 1 trillion USD to the corporate conveyorsWearing devices or glasses that got other outfits.

Altman reportedly also emphasized to the staff that the key would be crucial to stop the copying of competitors before starting. As it seems, the recording of his comments leaked to the journal, asking questions on how much he can trust his team and the way rather more he will be able to reveal.

(Tagstotransate) devices

This article was originally published on : techcrunch.com

Technology

The latest model AI Google Gemma can work on phones

Published

2 days ago

May 20, 2025

IAM

It grows “open” AI Google, Gemma, grows.

While Google I/O 2025 On Tuesday, Google removed Gemma 3N compresses, a model designed for “liquid” on phones, laptops and tablets. According to Google, available in a preview starting on Tuesday, Gemma 3N can support sound, text, paintings and flicks.

Models efficient enough to operate in offline mode and without the necessity to calculate within the cloud have gained popularity within the AI community lately. They will not be only cheaper to make use of than large models, but they keep privacy, eliminating the necessity to send data to a distant data center.

During the speech to I/O product manager, Gemma Gus Martins said that GEMMA 3N can work on devices with lower than 2 GB of RAM. “Gemma 3N shares the same architecture as Gemini Nano, and is also designed for incredible performance,” he added.

In addition to Gemma 3N, Google releases Medgemma through the AI developer foundation program. According to Medgemma, it’s essentially the most talented model to research text and health -related images.

“Medgemma (IS) OUR (…) A collection of open models to understand the text and multimodal image (health),” said Martins. “Medgemma works great in various imaging and text applications, thanks to which developers (…) could adapt the models to their own health applications.”

Also on the horizon there may be SignGEMMA, an open model for signaling sign language right into a spoken language. Google claims that Signgemma will allow programmers to create recent applications and integration for users of deaf and hard.

“SIGNGEMMA is a new family of models trained to translate sign language into a spoken text, but preferably in the American sign and English,” said Martins. “This is the most talented model of understanding sign language in history and we are looking forward to you-programmers, deaf and hard communities-to take this base and build with it.”

It is value noting that Gemma has been criticized for non -standard, non -standard license conditions, which in accordance with some developers adopted models with a dangerous proposal. However, this didn’t discourage programmers from downloading Gemma models tens of tens of millions of times.

(Tagstransate) gemma

This article was originally published on : techcrunch.com

Technology

Trump to sign a criminalizing account of porn revenge and clear deep cabinets

Published

3 days ago

May 19, 2025

IAM

President Donald Trump is predicted to sign the act on Take It Down, a bilateral law that introduces more severe punishments for distributing clear images, including deep wardrobes and pornography of revenge.

The Act criminalizes the publication of such photos, regardless of whether or not they are authentic or generated AI. Whoever publishes photos or videos can face penalty, including a advantageous, deprivation of liberty and restitution.

According to the brand new law, media firms and web platforms must remove such materials inside 48 hours of termination of the victim. Platforms must also take steps to remove the duplicate content.

Many states have already banned clear sexual desems and pornography of revenge, but for the primary time federal regulatory authorities will enter to impose restrictions on web firms.

The first lady Melania Trump lobbyed for the law, which was sponsored by the senators Ted Cruz (R-TEXAS) and Amy Klobuchar (d-minn.). Cruz said he inspired him to act after hearing that Snapchat for nearly a 12 months refused to remove a deep displacement of a 14-year-old girl.

Proponents of freedom of speech and a group of digital rights aroused concerns, saying that the law is Too wide And it will probably lead to censorship of legal photos, similar to legal pornography, in addition to government critics.

(Tagstransate) AI

This article was originally published on : techcrunch.com