Google created an A.I. tool that transforms hums and text into actual music: ‘This is bigger than ChatGPT to me’

Fortune· Jonathan Raa—NurPhoto via Getty Images
In this article:

The rapid rise of OpenAI's artificial intelligence chatbot ChatGPT has left many wondering what else will be changed by generative A.I. tools. If a Google research paper released this week is anything to go by, songwriting will be—and perhaps the music industry.

The paper describes a tool called MusicLM that “can transform whistled and hummed melodies according to the style described in a text caption.” It can also generate “high-fidelity music from text descriptions such as ‘a calming violin melody backed by a distorted guitar riff.’”

On the paper’s website, examples show results generated by the tool. In one instance, somebody hums “Bella Ciao,” an Italian folk song from the late 19th century. Then, based on that, the tool generates music with various instruments and styles, including guitar solo, string quartet, and jazz with saxophone.

https://twitter.com/bleedingedgeai/status/1619081383477137408

"Whoa, this is bigger than ChatGPT to me. Google almost solved music generation, I'd say," tweeted Keunwoo Choi, an A.I. scientist at Gaudio Lab, an A.I. audio technology company.

“Think of MusicLM as the ChatGPT for music,” tweeted entrepreneur Martin Uetz, adding, “I can't wait for this to go mainstream.”

Generative A.I. vs. artists

Less eager might be musicians who’ve spent decades mastering their instruments, just as illustrators and graphic artists have been angered by A.I. tools that create impressive images from mere text prompts.

Among those A.I. art tools are Midjourney, Stable Diffusion, and DALL-E 2. One man recently used Midjourney to illustrate a children’s book. Impressed with the tool, he shared his experience on social media—and was stunned by the backlash from illustrators. And last year, an image generated with Midjourney won a prize at an art festival, which also angered artists.

The problem artists have with such tools is that they train themselves on a massive collection of digitized artworks without consent. A lawsuit recently filed in San Francisco by working artists describes Stable Diffusion and Midjourney as “collage tools that violate the rights of millions of artists.”

Indeed, copyright concerns are keeping Google AI from releasing MusicLM to the public. But startups might be more willing to release such technology into the wild.

Not that Big Tech isn’t also plowing resources into generative A.I.

DALL-E is offered by ChatGPT maker OpenAI. Microsoft is investing billions into OpenAI and will use its technology in a wide variety of products, including the Bing search engine. That in turn has lit a fire under Google parent Alphabet, which is working on similar tools to answer the challenge.

As a tool, MusicLM is far from perfect, but it hints at where things are headed. The same can be said of ChatGPT itself. As billionaire Mark Cuban recently said of the A.I. chatbot, “Imagine what GPT 10 is going to look like.”

This story was originally featured on Fortune.com

More from Fortune:
Olympic legend Usain Bolt lost $12 million in savings to a scam. Only $12,000 remains in his account
Meghan Markle’s real sin that the British public can’t forgive–and Americans can’t understand
‘It just doesn’t work.’ The world’s best restaurant is shutting down as its owner calls the modern fine dining model ‘unsustainable’
Bob Iger just put his foot down and told Disney employees to come back into the office

Advertisement