AI and Agents

Speech

Search every project in one place

Press / to search. Tap a tag to filter. Click any row for details.

Search and filter

Filtering for

Results

Row number Tags
A general-purpose automatic speech recognition model trained on 680k hours of multilingual and multitask supervised data.
A family of open-source voice AI models from Microsoft for text-to-speech and long-form speech recognition.
A tokenizer-free text-to-speech foundation model for multilingual speech generation and voice cloning.

Know a project that belongs here?

Tell us what it does and why it stands out.