I spent an afternoon exploring the capabilities of several text-to-video AI models available in Poe Ai.
My goal is to discover the “best AI video generator” on Poe, so let’s GO!
https://vimeo.com/1061676260?share=copy#t=0
Quick-Jump Links:
Pika 1.0 – Pika 1.5 – Dream Machine – Kling Pro 1.5 – Hailuo Ai – Hailuo Live – Pika 2.0 – Runway – VEO-2 – Ray 2 – Sora
Starting Prompt
To fairly evaluate each Ai Video generator, every video we generate will use an identical prompt:
“A pullback shot depicting a guitar being strummed by a grunge rock musician standing in front of the reflecting pond at Balboa Park in San Diego, California.”
We include a Subject (guitarist), Actions for the Subject (strumming a guitar), Actions for the Camera (pullback shot), & Environmental details (Balboa Park’s reflecting pond).
We’ll check how well each respective video-generation model is able to pleasingly incorporate all the aspects of our text prompt.
Testing Ai Video Generator Models (Text-to-Video)
We’ll be testing all the Ai Video Generators that are currently available in Poe Ai.
Each video below was generated by a different Ai Video Generator in Poe, and they all used the exact same starting prompt.
1. PIKA 1.0
Pika 1.0 – Pika 1.5 – Dream Machine – Kling Pro 1.5 – Hailuo Ai – Hailuo Live – Pika 2.0 – Runway – VEO-2 – Ray 2 – Sora
- Compute Points: 3,750 (Learn more about Poe Ai Compute Points)
- Performance: Poor. Unrecognizable visuals. Essentially unusable.
“There’s nothing. It’s just a weird mass of guitar and there’s some sort of reflection happening. It’s not very useful.”
2. PIKA 1.5
Pika 1.0 – Pika 1.5 – Dream Machine – Kling Pro 1.5 – Hailuo Ai – Hailuo Live – Pika 2.0 – Runway – VEO-2 – Ray 2 – Sora
- Compute Points: 11,500
- Performance: Noticeably better, but visuals inconsistent.
“It’s doing a lot of what we asked for. I see there’s a guitar, there’s a musician, there’s a reflecting pond.”
3. Dream Machine
Pika 1.0 – Pika 1.5 – Dream Machine – Kling Pro 1.5 – Hailuo Ai – Hailuo Live – Pika 2.0 – Runway – VEO-2 – Ray 2 – Sora
- Compute Points: 12,000
- Performance: Stylized, somewhat unnatural.
“Dream Machine’s got a lot of good accuracy. He’s moving in kind of slow motion, but it does a pullback like we asked.”
4. Kling Pro Version 1.5
Pika 1.0 – Pika 1.5 – Dream Machine – Kling Pro 1.5 – Hailuo Ai – Hailuo Live – Pika 2.0 – Runway – VEO-2 – Ray 2 – Sora
- Compute Points: 16,667
- Performance: Realistic drone-like pullback, believable.
“Now this looks, to me, very realistic. I could have shot that if I were at this venue, with myself and my phone.”
5. Hailuo AI
Pika 1.0 – Pika 1.5 – Dream Machine – Kling Pro 1.5 – Hailuo Ai – Hailuo Live – Pika 2.0 – Runway – VEO-2 – Ray 2 – Sora
- Compute Points: 16,667
- Performance: Stylized, moderate realism.
“I really like it. It has a level of artifice; it looks a little fake and processed.”
6. Hailuo Live
Pika 1.0 – Pika 1.5 – Dream Machine – Kling Pro 1.5 – Hailuo Ai – Hailuo Live – Pika 2.0 – Runway – VEO-2 – Ray 2 – Sora
- Compute Points: 16,667
- Performance: Realistic movements, stable visuals.
“This one seems more realistic compared to the other one’s hyper-stylized look. He’s strumming; he’s doing everything.”
7. PIKA 2.0
Pika 1.0 – Pika 1.5 – Dream Machine – Kling Pro 1.5 – Hailuo Ai – Hailuo Live – Pika 2.0 – Runway – VEO-2 – Ray 2 – Sora
- Compute Points: 20,000
- Performance: Good realism, accurate movements.
“This actually looks like a friend of mine. So there we go. ”
8. Runway
Pika 1.0 – Pika 1.5 – Dream Machine – Kling Pro 1.5 – Hailuo Ai – Hailuo Live – Pika 2.0 – Runway – VEO-2 – Ray 2 – Sora
- Compute Points: 20,000 (10-sec)
- Performance: Good visuals, “zoom out” rather than “pullback”.
“He’s kind of floating on the water, but maybe his music is just that good. [Cinematography-wise, ]it is not a pullback that I’m seeing. To me it looks like a zoom out.”
9. Google VEO 2
Pika 1.0 – Pika 1.5 – Dream Machine – Kling Pro 1.5 – Hailuo Ai – Hailuo Live – Pika 2.0 – Runway – VEO-2 – Ray 2 – Sora
- Compute Points: 92,000
- Performance: Good realism, maybe, but inaccurate environment.
“That’s certainly not Balboa Park, because you can’t see downtown from Balboa Park. I gotta say I don’t think that was 92,000 points spent very well.”
10. Ray 2 (Initial Test)
Pika 1.0 – Pika 1.5 – Dream Machine – Kling Pro 1.5 – Hailuo Ai – Hailuo Live – Pika 2.0 – Runway – VEO-2 – Ray 2 – Sora
- Compute Points: ~94,000
- Performance: Promising but inconsistent.
“I was interested to see how well it was accurately depicting finger strumming and moving up the guitar neck. He’s kind of wonking out there toward the end.”
11. Open Ai Sora
- Compute Points: N/A – Open Ai uses different metrics
- Performance: Strong color saturation and realism, overeall.
“I think it’s basically the most realistic video of the bunch. I’m only a regular Chat GPT+ user, so I can only generate 5 second clips, and they all have an OpenAi watermark.”
Improving Realism with Image References
For more control over the finished video’s look, we can give most Ai Video Models an “image reference” file, too.
Let’s return to the idea of the Balboa-Park Guitarist; we’ll use an Ai Image Generator in Poe called “Recraft AI” to create a still image, incorporating all aspects from original video prompt.
Let’s use this image reference with each Ai Video Generator again (except VEO-2, which doesn’t support image uploads).
With this still-image as our ‘starting point’, we’ll be able to see how each Ai Video generator does with image-to-video Ai examples in Poe:
This video comparison shows each Ai Video model’s output, in sequence:
( Pika 1.0, Pika 1.5, Dream Machine, Kling Pro, Hailuo Ai, Hailuo Live, Pika 2.0, Runway, Ray 2, Real Life)
We clearly see the fidelity improvements across Pika models and dynamic camera actions emerging in Pika 2.0.
Kling Pro appeared the most realistic, as if recorded by a human.
Ray 2 had the highest accuracy, smoothest action, and most dynamic camera movements overall.
Also, Ray 2 was WAY cheaper to use when I paired it with an image reference file, and used slightly lower resolutions.
Final Verdict
To answer the headline’s question, the best Ai Video Generator on Poe is: Ray 2, particularly when using an image reference.
Ray 2 delivers the most visually pleasing and accurate results, out of the ten video generators in our test.
Of course, my opinion is subjective, but I think you’ll discover that each ai video-generator in Poe excels at different things.
Go try some Ai video generators on Poe, and let me know what you create and how the results turned out.
New to Poe? Learn more about “What is Poe Ai?” here.
Finally, for the statistics fans, here’s a bunch of numbers:
AI Model | Compute Points | Parameters/Notes |
---|---|---|
Pika-1.0 | 3,750 | No special parameters |
Pika-1.5 | 11,500 | No special parameters |
Dream-Machine | 12,000 | –aspect_ratio 16:9 |
Hailuo-Live | 16,667 | No special parameters |
Hailuo-AI | 16,667 | No special parameters |
Kling-Pro-v1.5 | 16,667 | No special parameters |
Pika-2.0 | 20,000 | No special parameters |
Pika-2.0 | 20,000 | –ingredient_mode precise |
Runway | 20,000 | –duration 10 |
Runway | 21,000 | –duration 10 |
Ray2 | 23,500 | Used image-reference upload |
Veo-2 | 92,000 | No special parameters |
Ray2 | 94,500 | –aspect_ratio 16:9 –duration 9s –resolution 1080p |
Summary:
- Most Expensive: Ray2 (94,500 points with full parameters)
- Most Affordable: Pika-1.0 (3,750 points)
- Total Points Spent: 368,251
- Same prompt used across all models: “Pullback shot depicting a guitar being strummed by a grunge-rock musician, standing in front of the reflecting pond at Balboa Park in San Diego, California.”
“
Thanks for the research and comparison
I’m delighted to get to share this kind of info. 🙂