Ai Video Generation
inferen-sh/skillsThis skill enables users to generate videos using over 40 AI models through a command-line interface, supporting various types such as text-to-video, image-to-video, and avatar lipsync. It is designed for developers, creators, and AI enthusiasts seeking to produce high-quality or realistic videos with customizable prompts, images, or audio. Key capabilities include upscaling, merging videos, adding sound effects, and creating AI-driven avatars for diverse media workflows.
AI Video Generation
Generate videos with 40+ AI models via inference.sh CLI.

Quick Start
Requires inference.sh CLI (
infsh). Get installation instructions:npx skills add inference-sh/skills@agent-tools
infsh login
# Generate a video with Veo
infsh app run google/veo-3-1-fast --input '{"prompt": "drone shot flying over a forest"}'
Available Models
Text-to-Video
Model
App ID
Best For
Veo 3.1 Fast
google/veo-3-1-fast
Fast, with optional audio
Veo 3.1
google/veo-3-1
Best quality, frame interpolation
Veo 3
google/veo-3
High quality with audio
Veo 3 Fast
google/veo-3-fast
Fast with audio
Veo 2
google/veo-2
Realistic videos
P-Video
pruna/p-video
Fast, economical, with audio support
WAN-T2V
pruna/wan-t2v
Economical 480p/720p
Grok Video
xai/grok-imagine-video
xAI, configurable duration
Seedance 1.5 Pro
bytedance/seedance-1-5-pro
With first-frame control
Seedance 1.0 Pro
bytedance/seedance-1-0-pro
Up to 1080p
Image-to-Video
Model
App ID
Best For
Wan 2.5
falai/wan-2-5
Animate any image
Wan 2.5 I2V
falai/wan-2-5-i2v
High quality i2v
WAN-I2V
pruna/wan-i2v
Economical 480p/720p
P-Video
pruna/p-video
Fast i2v with audio
Seedance Lite
bytedance/seedance-1-0-lite
Lightweight 720p
Avatar / Lipsync
Model
App ID
Best For
OmniHuman 1.5
bytedance/omnihuman-1-5
Multi-character
OmniHuman 1.0
bytedance/omnihuman-1-0
Single character
Fabric 1.0
falai/fabric-1-0
Image talks with lipsync
PixVerse Lipsync
falai/pixverse-lipsync
Realistic lipsync
Utilities
Tool
App ID
Description
HunyuanVideo Foley
infsh/hunyuanvideo-foley
Add sound effects to video
Topaz Upscaler
falai/topaz-video-upscaler
Upscale video quality
Media Merger
infsh/media-merger
Merge videos with transitions
Browse All Video Apps
infsh app list --category video
Examples
Text-to-Video with Veo
infsh app run google/veo-3-1-fast --input '{
"prompt": "A timelapse of a flower blooming in a garden"
}'
Grok Video
infsh app run xai/grok-imagine-video --input '{
"prompt": "Waves crashing on a beach at sunset",
"duration": 5
}'
Image-to-Video with Wan 2.5
infsh app run falai/wan-2-5 --input '{
"image_url": "https://your-image.jpg"
}'
AI Avatar / Talking Head
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
Fabric Lipsync
infsh app run falai/fabric-1-0 --input '{
"image_url": "https://face.jpg",
"audio_url": "https://audio.mp3"
}'
PixVerse Lipsync
infsh app run falai/pixverse-lipsync --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
Video Upscaling
infsh app run falai/topaz-video-upscaler --input '{"video_url": "https://..."}'
Add Sound Effects (Foley)
infsh app run infsh/hunyuanvideo-foley --input '{
"video_url": "https://silent-video.mp4",
"prompt": "footsteps on gravel, birds chirping"
}'
Merge Videos
infsh app run infsh/media-merger --input '{
"videos": ["https://clip1.mp4", "https://clip2.mp4"],
"transition": "fade"
}'
Related Skills
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@agent-tools
# Pruna P-Video (fast & economical)
npx skills add inference-sh/skills@p-video
# Google Veo specific
npx skills add inference-sh/skills@google-veo
# AI avatars & lipsync
npx skills add inference-sh/skills@ai-avatar-video
# Text-to-speech (for video narration)
npx skills add inference-sh/skills@text-to-speech
# Image generation (for image-to-video)
npx skills add inference-sh/skills@ai-image-generation
# Twitter (post videos)
npx skills add inference-sh/skills@twitter-automation
Browse all apps: infsh app list
Documentation
- Running Apps - How to run apps via CLI
- Streaming Results - Real-time progress updates
- Content Pipeline Example - Building media workflows
GitHub Owner
Owner: inferen-sh