Meta AI Unveils Voicebox: Revolutionary Generative AI Model Sets New Standards in Speech Synthesis and Multilingual Capabilities
Meta AI develops Voicebox, the first generative AI model for speech that can generalize across tasks with state-of-the-art performance. Voicebox creates high-quality audio clips and can synthesize speech across six languages, perform noise removal, content editing, style conversion, and diverse sample generation. Voicebox outperforms the current state-of-the-art English model VALL-E and YourTTS on word error rate and achieves new state-of-the-art results on audio style similarity metrics on English and multilingual benchmarks. Voicebox is based on a new approach called Flow Matching and was trained with more than 50,000 hours of recorded speech and transcripts from public domain audiobooks in six languages. Although the Voicebox model or code isn't publicly available due to potential risks of misuse, Meta AI is sharing audio samples and a research paper detailing their approach and results. Voicebox could usher in a new era of generative AI for speech with its versatility for tasks such as in-context text-to-speech synthesis, cross-lingual style transfer, speech denoising and editing, and diverse speech sampling.
More in News
Databricks Soars with $5.4bn Revenue Run Rate Amid AI Surge
Databricks has reported a remarkable $5.4 billion annual revenue run rate, reflecting a 65% year-over-year growth. The company has raised over $7 billion...
Mixed Reality Play 'An Ark' Transforms Audience Experience
The new mixed reality play 'An Ark' at The Shed uses augmented reality glasses to create an intimate connection between actors and the audience. This inn...
Google Enhances Privacy Tools to Tackle Sensitive Data
Google is rolling out new features to help users remove sensitive information from search results. Users can now delete personal data like driver's licen...
Innovative Solutions Emerge to Combat Space Debris Threat
With over 11,000 active satellites and 1.2 million pieces of debris, Earth's orbit is increasingly hazardous. Advanced tracking and collision avoidance s...
Tem Secures $75 Million to Revolutionize Energy Trading
Tem, a London-based energy software company, has raised $75 million in a Series B funding round led by Lightspeed Venture Partners. Valued at over $300 m...
Read More in AiShorts
Linux 6.19 Released with AMD GPU Boost and Teaser for 7.0
Linus Torvalds has announced the release of Linux kernel 6.19, enhancing support for older AMD GPUs and introducing improved HDR capabilities. This updat...
EU Launches €700 Million NanoIC Semiconductor Pilot Line
The European Union has inaugurated NanoIC, a semiconductor pilot line in Leuven, backed by a €700 million investment from the Chips Act. This facility ai...
Apple's iPhone 17e Launches at $599 with Exciting Upgrades!
Apple is set to release the iPhone 17e at the budget-friendly price of $599, matching its predecessor. The new model includes MagSafe charging, the advan...
GuliKit’s Compact USB Dongle Connects PS5 Controllers to Switch!
The GuliKit Hyperlink Gen 2 USB dongle allows gamers to connect PS5 controllers to the Nintendo Switch 2, alongside compatibility with Xbox and other con...
Unlocking Convenience: New Digital Car Key Technology Allows Easy Sharing!
At the recent Plugfest event, automotive and tech companies unveiled advancements in digital car keys, enabling users to text secure key copies for shari...
Apple's CarPlay to Embrace AI Chatbots like ChatGPT!
Apple is reportedly working on an update that will allow CarPlay users to interact with third-party AI chatbots such as ChatGPT, Claude, and Gemini. This...
Anthropic Unveils Claude Opus 4.6: A Game Changer Beyond Coding!
Anthropic has launched Claude Opus 4.6, claiming it to be their 'smartest model' yet. This upgrade enhances capabilities for complex tasks like Excel and...
Nintendo Switch 2 Unveils Exciting Third-Party Game Lineup!
Nintendo's recent Direct showcased a thrilling lineup for the Switch 2, highlighting titles from major studios. Bethesda is bringing Fallout 4 and Indian...
OpenAI Launches Frontier: A New Era for Managing AI Agents!
OpenAI has unveiled Frontier, a platform designed to empower businesses to manage AI agents seamlessly. Functioning like HR for AI, it offers shared cont...
Snowflake and OpenAI Launch $200 Million Partnership to Revolutionize AI in Data Management
Snowflake and OpenAI have partnered for $200 million to integrate advanced AI models, including GPT-5.2, into Snowflake’s platform. This collaboration en...
Microsoft Launches AI Content Licensing Hub!
Microsoft has announced the development of the Publisher Content Marketplace (PCM), aimed at simplifying content licensing for AI companies. Co-designed ...
Apple Xcode Unleashes AI-Powered Coding with OpenAI and Anthropic Integration!
Apple has unveiled new features in Xcode 26.3, integrating OpenAI's Codex and Anthropic's Claude Agent to enhance coding efficiency. Developers can now a...
Firefox 148: Mozilla Empowers Users with AI Control!
Mozilla's Firefox 148 launch on February 24 introduces user-control over AI features, allowing complete disabling or selective activation. Amid recent la...
Waymo Secures $16 Billion to Expand Robotaxi Reach Globally
Waymo has successfully raised $16 billion to enhance its robotaxi operations, planning to expand to 20 new cities in 2026. Led by Dragoneer Investment Gr...
Apple's Game-Changer: Foldable iPhone with Massive Battery Expected Soon!
Apple is rumored to launch its first foldable iPhone, dubbed the iPhone Fold, featuring an impressive battery exceeding 5,500mAh. This surpasses the capa...
Shocking AI App Vulnerabilities Expose User Data on Android Devices!
A recent study uncovered critical security flaws in 38,630 AI-powered Android apps, revealing systemic user data risks on the Google Play Store. Research...