Table of Contents
- 1. Happy Horse 1.1 at a Glance
- 2. Text to Video - Up to 15 Seconds at 1080P
- 3. Image to Video - Animate Any First Frame
- 4. Reference to Video - Character-Consistent Generation
- 5. What You Can Build With Happy Horse 1.1 API?
- 6. How Happy Horse 1.1 Compares to Other Pixazo Video Models?
- 7. Accessing Happy Horse 1.1 API on Pixazo
- 8. Happy Horse 1.1 for Developers, Creators, and Platforms
- 9. The Bigger Picture
- 10. Frequently Asked Questions for Happy Horse 1.1 API
Happy Horse 1.1 API is now live on Pixazo API - Alibaba's cinematic video generation model that supports up to 15 seconds of high-definition output, image animation, character-consistent reference generation. Happy Horse 1.1 is designed for high-speed performance at production resolution, making it a strong fit for developers building video pipelines that need both quality and throughput.
This post covers what Happy Horse 1.1 does well, its three API endpoints (text-to-video, image-to-video, reference-to-video), how it fits alongside other video models on Pixazo, and how to get started.
Happy Horse 1.1 at a Glance
Happy Horse 1.1 is Alibaba's cinematic video generation model, made available through Pixazo API. It generates videos up to 15 seconds at up to 1080P resolution, with support for five aspect ratios (16:9, 9:16, 1:1, 4:3, 3:4) - covering landscape, portrait, square, and broadcast formats from a single endpoint.
The model ships with three distinct generation modes:
- Text to Video (T2V) - Generate cinematic clips from a text prompt alone
- Image to Video (I2V) - Animate a provided first-frame image into a video
- Reference to Video (R2V) - Generate character-consistent video using reference images
| Provider | Alibaba |
| Model type | Text-to-Video -· Image-to-Video -· Reference-to-Video - |
| Max clip duration | 15 seconds (range: 3-15 sec) |
| Resolution | 720P or 1080P (default: 1080P) |
| Aspect ratios | 16:9, 9:16, 1:1, 4:3, 3:4 |
| Watermark | None by default |
| Pricing (promotional) | 720P: $0.084/sec -· 1080P: $0.11/sec |
| API access | Pixazo API - one key, unified credits |
Happy Horse 1.1 is the current generation; Happy Horse 1.0 remains available for teams with existing integrations.
Text to Video - Up to 15 Seconds at 1080P
Happy Horse 1.1's T2V endpoint accepts a text prompt and returns a video clip between 3 and 15 seconds. Default resolution is 1080P; 720P is available at a lower cost per second for use cases where full HD isn't required.
resolution: 720P or 1080P (default: 1080P)
ratio: 16:9, 9:16, 1:1, 4:3, 3:4 (default: 16:9)
seed: integer for reproducibility across runs
watermark: false by default - output ships without branding
The prompt field accepts up to 5,000 Latin characters (or 2,500 Chinese characters), giving enough room for detailed scene descriptions across most production briefs without truncation.
Suggested Read: Introducing Happy Horse 1.0 API on Pixazo
Image to Video - Animate Any First Frame
The I2V endpoint takes a first-frame image and animates it into a video. This is useful for product visualization, character animation from existing artwork, and any workflow where the starting visual is already defined and only the motion needs to be generated.
Input image requirements: JPEG, PNG, or WebP; minimum 300Ã300 pixels; aspect ratio between 1:2.5 and 2.5:1; maximum 10MB. The model respects the input image's aspect ratio in the output.
For I2V, the prompt is optional and should describe motion rather than restating the subject - "slow pan right as fog rolls through the forest" performs better than describing what the first frame already shows.
Reference to Video - Character-Consistent Generation
R2V is Happy Horse 1.1's most differentiated mode. It accepts one or more reference images of characters and uses them to maintain visual consistency of those characters across the generated video. Characters are referenced in the prompt using character1, character2, and similar tokens that map to the order of images in the media array.
Use cases where R2V adds the most value:
- Branded character or mascot animation across multiple clips in a campaign
- Consistent protagonist appearance in serialized short-form content
- Product or fashion subject preservation in animated showcase videos
- Multi-character scenes where each character needs to retain their look from reference
What You Can Build With Happy Horse 1.1 API?
The combination of T2V, I2V, R2V makes Happy Horse 1.1 a practical fit across several production workflows:
- Short-form social content - Generate 5-15 second clips in portrait or square format for social platforms directly from briefs or scripts
- Product visualization - Animate product images into showcase videos using I2V, without requiring a full animation pipeline
- Character-led ad campaigns - Use R2V to maintain consistent brand characters or mascots across a campaign's video assets
- Storyboard-to-video pipelines - Convert scene illustrations or storyboard frames into animated clips via I2V
- High-volume video generation - High-speed architecture makes it suitable for batch generation workloads at 720P
Suggested Read: Introducing VEED Fabric 1.0 API on Pixazo
How Happy Horse 1.1 Compares to Other Pixazo Video Models?
Pixazo supports multiple video generation models. Here is how Happy Horse 1.1 sits within that lineup:
| Model | Core Strength | What Happy Horse 1.1 Adds |
|---|---|---|
| Kling AI | Motion quality and physics simulation | R2V character consistency |
| LTX Video | Speed and rapid iteration | Higher-fidelity cinematic output at 1080P, longer clip duration (up to 15 sec) |
| Wan Video | Capable cinematic output | More structured character reference system for multi-character consistency across clips |
| Cosmos | Physically accurate world simulation | Character fidelity and multilingual media production workflows |
Accessing Happy Horse 1.1 API on Pixazo
Happy Horse 1.1 is available on Pixazo API under the same authentication and billing system as all other supported models - one API key, one credit balance, one integration.
Current pricing (40% promotional discount active):
- 720P: $0.084/sec (regular rate $0.14/sec)
- 1080P: $0.11/sec (regular rate $0.18/sec)
Pricing is per second of generated video. A 10-second clip at 1080P costs $1.10 at current promotional rates. New accounts receive $5 free credit on first payment - enough for approximately 45 seconds of 1080P output, or a meaningful test run across all three endpoint modes before committing to a workload.
Suggested Read: Pixazo Launches Wan 2.5 with Audio Video Sync
Happy Horse 1.1 for Developers, Creators, and Platforms
Pixazo wraps Happy Horse 1.1 in the same unified developer layer as all other supported models. That means:
- Developers get a single SDK, single API key, consistent error handling, and webhook callbacks for long-running renders - no per-provider integration overhead
- Creators can experiment between Happy Horse 1.1 and other video models on Pixazo without switching accounts or learning new dashboards
- Platforms and agencies can offer their users multiple video model choices behind a single integration, with usage billing consolidated in one place
Suggested Read: Introducing Gemini Omni API on Pixazo
The Bigger Picture
The video generation model landscape in 2026 is moving fast, and no single model dominates every workflow. Pixazo's video stack now includes models optimized for speed (LTX Video), physics accuracy (Cosmos), motion quality (Kling), and cinematic range (Wan Video). Happy Horse 1.1 adds character-consistent multi-subject generation - capabilities that are harder to replicate through post-production workarounds and genuinely expand what's possible in a pure API pipeline.
For teams building video production workflows that require character consistency across clips or localized multilingual output at production resolution, Happy Horse 1.1 is a meaningful addition to what Pixazo API can support.
Full API documentation, all three endpoint references, and pricing details are at pixazo.ai/models/happy-horse.
Suggested Read: Qwen Image Layered API Now Live on Pixazo
Frequently Asked Questions for Happy Horse 1.1 API
1. What is Happy Horse 1.1 API?
Happy Horse 1.1 is Alibaba's cinematic video generation model, available through Pixazo API. It generates videos up to 15 seconds at 720P or 1080P resolution with support for three modes: text-to-video, image-to-video, and reference-to-video.
2. Who developed Happy Horse?
Happy Horse is developed by Alibaba. Pixazo provides API access to the model through its unified API platform - it does not train or own the underlying model.
3. What are the three generation modes in Happy Horse 1.1?
Text to Video (T2V) generates clips from a text prompt. Image to Video (I2V) animates a provided first-frame image. Reference to Video (R2V) generates character-consistent video using reference images, with characters referenced by token in the prompt.
4. How long can Happy Horse 1.1 generate videos?
The T2V and I2V endpoints support video durations from 3 to 15 seconds. The default duration is 5 seconds. R2V also supports the same 3-15 second range.
5. What resolutions does Happy Horse 1.1 support?
Happy Horse 1.1 supports 720P and 1080P output. The default resolution is 1080P. Five aspect ratios are available: 16:9, 9:16, 1:1, 4:3, and 3:4.
6. What is Reference to Video (R2V) and how does it work?
R2V accepts one or more reference images of characters and uses them to maintain visual consistency of those characters in the generated video. Characters are referenced in the prompt using tokens like character1 and character2, which map to the order of images passed in the API request.
7. How is Happy Horse 1.1 priced on Pixazo?
Pricing is per second of generated video. At current promotional rates (40% off): 720P is $0.084/sec and 1080P is $0.11/sec. A 10-second 1080P clip costs $1.10. New accounts receive $5 free credit on first payment.
8. How do I get started with Happy Horse 1.1 API?
Get an API key at pixazo.ai/models/happy-horse. All three endpoints (T2V, I2V, R2V) use the same key and are documented with request parameters, example requests, and response schemas on model page.
Related Articles
- Introducing Pixazo Free Image generation APIs (Open Beta): Build With Flux Schnell, Stable Diffusion & Inpainting — Free
- Best Image Restoration APIs in 2026
- Flux Schnell API Pricing: Complete Cost Breakdown & The Cheapest Way to Generate Images at Scale
- Best AI Image and Video Generation API Platforms in 2026
- Best Lipsync APIs in 2026
- Best Tools APIs in 2026
- Introducing LongCat-Image API on Pixazo: High-Fidelity, Bilingual Text-to-Image & Editing for Production Workflows
- Best Reference To Image APIs in 2026
- We Ran GPT-Image-2 Against 4 Competitors on 10 Real Pixazo Prompts. Here’s What Arena Didn’t Tell You.
- Introducing Seedance 1.5 API on Pixazo for Cinematic AI Video Generation
- Best Closed Source 3D Model Generation APIs in 2026: In-Depth Comparison Guide
- Introducing Kling Video 3 Turbo Pro API on Pixazo API
- Introducing Ideogram v4 API on Pixazo API
- Best Reference To Video APIs in 2026
- Best Text To Speech APIs in 2026
Most Popular Posts
- Best AI Image and Video Generators in 2026
- Best AI 3D Model Generators in 2026
- Best AI Image Generation Models in 2026
- Best AI Video Generation Models in 2026
- Best Open Source Image Generation Models in 2026
- Best Open Source Video Generation Models in 2026
- Best Prompts to Create Amazing Videos using AI

