The model, named Megatron-Turing NLG 530B, is about 3x bigger than GPT-3.<p>The blog post doesn't provide a lot of numbers but looks like it beats the state-of-the-art in a couple of commonsense reasoning benchmarks.<p>Still, it shows that you can't just keep scaling the models and expect magic.