资讯

In today's rapidly evolving technology landscape, voice interaction has become the mainstream method of human-computer communication. On September 1, Jumps Star officially launched Step-Audio 2 mini—a ...
Striving for open-source and equitable speech-to-speech translation US technology company Meta has produced an AI model that can directly translate speech in one language to speech in another.
From the voice-to-text feature on your phone to the captions that make videos more accessible, speech transcription is ...
The intelligent voice team at Qifu Technology has brought more good news — the multimodal emotion recognition research paper ...
Apache-licensed plan takes aim at costlier options Mistral has released an open automatic speech recognition (ASR) software bundle called Voxtral in a bid to undercut rivals on price and quality ...
Mozilla’s open source voice recognition tool nears human-like accuracy Mozilla has released an open source voice recognition tool that it says is "close to human level performance," and free for ...
Here are the best open-source and free-to-use AI models for text, images, and audio, organized by type, application, and licensing considerations.
Xiaomi has launched a 7-billion-parameter version of its open-source voice model, MiDashengLM, which incorporates Alibaba's open-source Qwen 2.5 series. This model focuses on in-car systems and ...
Fixie's Ultravox Realtime brings enterprise-grade voice recognition, and a scalable architecture designed to handle thousands of concurrent calls. For Hexaware, it adds another powerful tool to ...