Sora Roundup

The reviews of the leap forward in video models

Prakash

Feb 20, 2024

🔷 Subscribe to get breakdowns of the most important developments in AI in your inbox every morning.

Here’s today at a glance:

Sora Roundup
Things happen
AI artwork of the day

📚 Sora Roundup

On Jerk Day, Thursday, Feb 15th, OpenAI disclosed their text to video model: Sora, and the world moved forward again. It was not completely unexpected, as many, many teams across the industry were working on individual aspects of video. But still… it was a great leap forward. It is very hard to generate any video for more than 2 seconds, let alone up to one minute, without any weird morph artifacts and missing and disappearing features.

Comparisons

Gabor Cselle@gabor

.@OpenAI SORA vs @pika_labs vs @runwayml vs @StabilityAI Video. I gave the other models SORA's starting frame. I tried my best prompting and camera motion techniques to get the other models to output something similar to SORA. SORA's just much better at longer scenes.

1:02 AM · Feb 16, 2024 · 80.8K Views

19 Replies · 74 Reposts · 341 Likes

Capabilities

I just want to take a moment to explore the capabilities of the Sora model, It shows

Clear signs of having been trained on the output of a 3D engine

Sam Altman@sama

here is a better one:

Matt Schlicht @MattPRD

@sama “A half duck half dragon flies through a beautiful sunset with a hamster dressed in adventure gear on its back”

7:09 PM · Feb 15, 2024 · 2.26M Views

304 Replies · 469 Reposts · 6.29K Likes

It can generate multiple videos in the same “world” at the same time. This means that eventually, you can just imagine a scene from every possible angle, without needing cameras everywhere.

Bill Peebles@billpeeb

Sora can generate multiple videos side-by-side simultaneously. This is a single video sample from Sora. We didn't stitch this together; Sora decided it wanted to have five different viewpoints all at once!

9:05 PM · Feb 17, 2024 · 771K Views

165 Replies · 337 Reposts · 2.29K Likes

Sequential scene changes in the same story world

Bill Peebles@billpeeb

welcome to bling zoo! this is a single video generated by sora, shot changes and all.

Sam Altman @sama

here is sora, our video generation model: https://t.co/CDr4DdCrh1 today we are starting red-teaming and offering access to a limited number of creators. @_tim_brooks @billpeeb @model_mechanic are really incredible; amazing work by them and the team. remarkable moment.

8:16 PM · Feb 15, 2024 · 3.97M Views

186 Replies · 512 Reposts · 3.94K Likes

Storytelling

Bill Peebles@billpeeb

Sora can also generate stories involving a sequence of events, although it's far from perfect. For this video, I asked that a golden retriever and samoyed should walk through NYC, then a taxi should stop to let the dogs pass a crosswalk, then they should walk past a pretzel and

9:27 PM · Feb 17, 2024 · 278K Views

61 Replies · 79 Reposts · 753 Likes

Worryingly realistic-looking humans

Sam Altman@sama

cate bligh @catebligh

@sama A instructional cooking session for homemade gnocchi hosted by a grandmother social media influencer set in a rustic Tuscan country kitchen with cinematic lighting

7:59 PM · Feb 15, 2024 · 12.2M Views

1.29K Replies · 2.23K Reposts · 23.1K Likes

Sora allows video-to-video editing

Endrit@EndritRestelica

Video Editing will never be the same. ''Sora'' by OpenAI doesn't just generate simple AI videos, it can also edit videos. Video-to-video editing using diffusion models allows Sora to transform styles and environments in input videos based on text prompts. Here are 12 examples:

4:16 PM · Feb 18, 2024 · 1.25K Views

1 Reply · 1 Repost · 10 Likes

Same Data Source

The comparisons between Sora and Midjourney revealed that they seemed to have been trained on the same data. When we dream in latent space, we have similar dreams.

Nick St. Pierre@nickfloats

An extreme close-up of an gray-haired man with a beard in his 60s, he is deep in thought pondering the history of the universe as he sits at a cafe in Paris, his eyes focus on people offscreen as they walk as he sits mostly motionless, he is dressed in a wool coat suit coat...

2:22 PM · Feb 16, 2024 · 899K Views

57 Replies · 147 Reposts · 3.83K Likes

In effect, the similarity in training data causes convergence to the same district of latent space. Another example below:

Nick St. Pierre@nickfloats

An adorable happy otter confidently stands on a surfboard wearing a yellow lifejacket, riding along turquoise tropical waters near lush tropical islands, 3D digital render art style. --ar 16:9 --style raw

2:22 PM · Feb 16, 2024 · 779K Views

14 Replies · 31 Reposts · 1.68K Likes

We Don’t Know How To Do This

Meanwhile, Yann LeCun, Facebook’s AI chief, declared in the Middle East just days prior that generative AI would never reach this milestone:

ricburton@ricburton

Yann LeCun, a few days ago at the World Governments summit, on AI video: “We don’t know how to do this”

6:32 AM · Feb 16, 2024

Yann was out and about on Twitter defending his statements, and to be honest, he may still be right in the end, but still, the juxtaposition is a tad embarrassing.

In any case, there was an incredible amount of cope among real-world animators.

Owen Fern@owenferny

The reason I'm not scared (yet) of the Sora vids as an animator is that animation is an iterative process, especially when working for a client Here's a bunch of notes to improve one of the anims, which a human could address, but AI would just start over What client wants that?

1:26 PM · Feb 16, 2024 · 3.16M Views

529 Replies · 2.4K Reposts · 18.7K Likes

Though everyone should know better at this point

Build Alpha

The best information on the Sora build came from the co-author of the underlying paper, Saining Xie:

He goes on to speculate that Sora might only be a 3 billion parameter model, which implies:

not that many GPUs utilized for generation
fast inference
cheap
lots more runway to improve
and quickly

There are real questions on how closely Sora is simulating reality, with some converting Sora video into 3D scrollable representations known as radiance fields:

Radiance Fields@RadianceFields

Sora unlocks generative large scale radiance fields and this is such a massive deal radiancefields.com/openai-launche…

8:23 PM · Feb 15, 2024 · 33.2K Views

8 Replies · 56 Reposts · 299 Likes

OpenAI’s first intern, Dr. Jim Fan, was roundly shouted down, but persisted in that Sora must be performing both world and physics modeling,

Poor Google

Meanwhile, poor Google achieved 5-second videos in late January and has still not released the model to the public. Compare:

Matsuda Takumi@matsuda_tkm

3週間前にGoogleが発表した「Lumiere」 vs. 今日OpenAIが発表した「Sora」

8:38 AM · Feb 16, 2024 · 5.37K Views

10 Reposts · 30 Likes

The final Sora rundown

Leveraging spacetime patches, Sora offers a unified representation for large-scale training across various durations, resolutions, and aspect ratios.
It generates high-definition content, showcasing its prowess in handling videos and images with dynamic aspect ratios.
It excels in framing and composition, outperforming traditional square-cropped training methods.
Utilizing descriptive video captions, Sora achieves higher text fidelity, making it adept at following detailed user prompts for video generation.
From animating static images to extending videos, Sora showcases a wide range of editing capabilities.
Sora's training reveals emergent properties like 3D consistency and long-range coherence, hinting at its potential as a simulator for the physical and digital world.

All Known Soras

A supercut of all known and confirmed Sora videos with their associated prompts.

🗞️ Things Happen

Legendary chip architect Jim Keller responds to Sam Altman's plan to raise $7 trillion to make AI chips — 'I can do it for less than $1 trillion'. Everyone is targeting chips at this point
Geoffrey Hinton: 200,000 people a year die of incorrect medical diagnoses in the United States. AI will fix that in the next 10 years.

Cogniscendo

Discussion about this post

Ready for more?

Cogniscendo

Sora Roundup

The reviews of the leap forward in video models

🔷 Subscribe to get breakdowns of the most important developments in AI in your inbox every morning.

📚 Sora Roundup

🗞️ Things Happen

🖼️ AI Artwork Of The Day

Discussion about this post

Ready for more?