Каталог AI-сервисов
Полный каталог AI-инструментов с фильтрами по категориям, ценам и рейтингам
Transcribe audio & video with Whisper. Export TXT/SRT/VTT. Auto-delete 24h.
Hailuo Audio 2.5 hd is a high-fidelity TTS model for production-quality narration and content.
High-fidelity Speech-02 model using AR Transformer plus Flow-VAE, aimed at top-tier zero-shot TTS quality and benchmark-leading naturalness.
Cabina transcriber v1 is a real-time speech-to-text model with stable timestamps, punctuation, and optional speaker labels for meetings, calls, and streams.
Orpheus TTS is Canopy Labs’ Llama-based 3B speech LLM for natural, emotionally controllable, multilingual text-to-speech with real-time streaming and voice cloning.
Zonos-v0.1 is Zyphra’s open-weight text-to-speech family, two 1.6B models trained on 200k+ hours of multilingual speech, offering expressive, real-time TTS and high-quality voice cloning.
High-quality version of MiniMax Speech 2.6, focused on ultra-natural voice, Fluent LoRA cloning and robust handling of complex text formats across many languages.
Qwen’s open text-to-speech model supporting multilingual speech generation with custom voice capability.
In-browser speech recognition demo Space for Moonshine ASR, designed to run locally in the browser (WebGPU or WASM).