OpenAI has launched a new tool that can instantly make short videos in response to written commands. This new text-to-video generator tool has been named as ‘Sora’ by the Microsoft-backed OpenAI.
Though tech giants like Google and Meta have demonstrated similar technology in the past, the maker of ChatGPT has gone way ahead in terms of quality.
In order to make announcement of the new tool, OpenAI took to its official X (formerly Twitter) handle and shared, “Introducing Sora, our text-to-video model. Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions.”
OpenAI has also attached a video of the text, “Beautiful, snowy Tokyo city is bustling. The camera moves through the bustling city street, following several people enjoying the beautiful snowy weather and shopping at nearby stalls. Gorgeous sakura petals are flying through the wind along with snowflakes.”
Following the announcement, social media user told OpenAI CEO Sam Altman “Sam please don’t make me homeless”, to which he replied, “will generate you a video, what would you like?” The user later asked to make a video of a monkey playing chess in a park. Immediately, Altman shared a high-quality video by Sora on the X platform.
However, the tool is publicly available yet and OpenAI has revealed limited information about how it was built.
In a statement OpenAI revealed that the tool currently available for red teaming, which helps identify flaws in the AI system, as well as for use by visual artists, designers, and filmmakers to gain feedback on the model.
“Sora is able to generate complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background,” the statement said. Additionally, the tool can create multiple shots within a single video.
Apart from this, Sora can also animate a still image, the company said in a blogpost.
Earlier in yesteryear, Meta unveiled its image generation model Emu to add two AI-based features that can edit and generate videos from text prompts.