AI picture turbines have been all the fashion in 2023, however now corporations are shifting focus to the subsequent frontier — AI video technology. With OpenAI unveiling its AI text-to-video generator, Sora, in February 2024, it was solely a matter of time earlier than Google did the identical.
On Tuesday, at its annual Google I/O developer convention, Google unveiled Veo, its most superior text-to-video generator, able to producing movies with 1080p decision which can be over one minute lengthy.
Along with the high-quality output, Google says that Veo gives customers with an “unprecedented degree of inventive management.” The AI generator’s deeper understanding of pure language permits Veo to ship extra particulars from longer prompts and to know cinematic phrases like “timelapse” or “aerial pictures.”
Additionally: All the pieces introduced throughout Google I/O 2024: Gemini, Search, Android 15, and extra
Moreover, the video generator can deal with a typical drawback with video technology — the fluidity of pictures. Based on Google, Veo can create constant footage, with completely different topics similar to folks, animals, and objects shifting realistically within the pictures.
Google is not new to video technology. The corporate famous that this mannequin builds on all its prior video-generating initiatives, together with Imagen-Video, VideoPoet, and Lumiere.
Like OpenAI’s Sora, Google’s Veo just isn’t accessible to the general public but. Fairly, Google is sharing Veo first with choose creators in a non-public preview inside VideoFX. Google does, nevertheless, invite that you just be part of a waitlist to ultimately attempt the mannequin.
Moreover, Google unveiled Imagen 3, its highest-quality text-to-image mannequin up to now. Imagen 3, which boasts improved picture high quality and fewer visible artifacts, can be restricted to a non-public preview inside ImageFX for choose creators and has its personal waitlist.