Каталог AI-сервисов
Полный каталог AI-инструментов с фильтрами по категориям, ценам и рейтингам
PixVerse V5.5 is PixVerse’s audio-visual text and image to video model that generates 5-10 s 1080p multi-shot clips with native speech, music and SFX, improved motion stability and multi-shot camera control for story driven, lip-synced short videos.
Text and image to video model with better motion control, identity consistency, and cleaner small details.
Wan2.2-S2V-14B is a speech-to-video model that turns a narrated prompt into a coherent, temporally stable clip. It preserves identity and style from references, follows cues in the narration for timing and motion, and supports targeted edits for production use.
Fast reframing and aspect changes with composition aware cropping and fill.