AI audio research

Audiobox: Where anyone can make a sound with an idea

Audiobox is Meta’s new foundation research model for audio generation. It can generate voices and sound effects using a combination of voice inputs and natural language text prompts — making it easy to create custom audio for a wide range of use cases. The Audiobox family of models also includes specialist models Audiobox Speech and Audiobox Sound, and all Audiobox models are built upon the shared self-supervised model Audiobox SSL.

Play and create with Audiobox models using the demos below

Capabilities

A series of interactive audio demos to help you understand the unique capabilities of Audiobox. You can experiment with each capability individually.

Audiobox Maker

Express your creativity and make a fun and original audio story with all that Audiobox has to offer. Download and share it with friends.

Research

How does Audiobox work?

Explore the technical details of our foundational audio model and our commitment to making AI safe for everyone.