close
close

Nvidia drops the bomb: New AI model is open, massive and ready to rival GPT-4

Nvidia drops the bomb: New AI model is open, massive and ready to rival GPT-4

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn more


Nvidia It has released a powerful open-source AI model that competes with proprietary systems from industry leaders such as OpenAI and Google.

company new NVLM1.0 Large family of multimodal language models led by 72 billion parameters NVLM-D-72BIt demonstrates exceptional performance on vision and language tasks while also improving text-only abilities.

“We are introducing NVLM 1.0, a family of leading-edge multimodal large language models that rival leading proprietary models (e.g., GPT-4o) and open access models, achieving state-of-the-art results on visual language tasks. ” researchers explain their papers.

By making model weights public and promised to be released training codeNvidia is breaking with the trend of keeping advanced AI systems closed. This decision gives researchers and developers unprecedented access to the latest technology.

Benchmark results comparing NVIDIA’s NVLM-D model to AI giants such as GPT-4, Claude 3.5, and Llama 3-V show NVLM-D’s competitive performance across a variety of visual and language tasks. (Credit: arxiv.org)

NVLM-D-72B: A versatile performer in visual and textual tasks

The NVLM-D-72B model demonstrates impressive adaptability in processing complex visual and textual input. The researchers provided examples that highlight the model’s ability to interpret memes, analyze images, and solve math problems step by step.

NVLM-D-72B appears to specifically improve performance on text-only tasks after multimodal training. While many similar models saw a decrease in text performance, the NVLM-D-72B improved its accuracy by an average of 4.3 points in basic text benchmarks.

“Our NVLM-D-1.0-72B shows significant improvements over the text backbone in text-only math and coding benchmarks,” the researchers say, highlighting a key advantage of their approach.

NVIDIA’s new AI model demonstrates visual humor and the ability to interpret scientific concepts by analyzing a meme comparing academic abstracts to full texts. (Credit: arxiv.org)

AI researchers respond to Nvidia’s open source initiative

The AI ​​community responded positively to the release. Commenting on social media, one AI researcher observed: “Wow! Nvidia just released a 72B model that is on par with the Lama 3.1 405B in math and coding benchmarks and also has vision.”

Nvidia’s decision to make such a powerful model publicly available could accelerate AI research and development in this area. By providing access to a model that competes with the proprietary systems of well-funded tech companies, Nvidia can enable smaller organizations and independent researchers to make more significant contributions to AI advances.

The NVLM project also offers innovative architectural designs, including a hybrid approach combining different multi-modal processing techniques. This development may shape the direction of future research in this field.

NVLM 1.0: A new chapter in open source AI development

Nvidia’s release of NVLM 1.0 marks a pivotal moment in AI development. By open-sourcing a model that rivals proprietary giants, Nvidia is not only sharing code but also challenging the structure of the AI ​​industry.

This move could trigger a chain reaction. Other tech leaders may feel pressure to open up their research, potentially accelerating the advancement of AI overall. It also levels the playing field, allowing small teams and researchers to innovate with tools once reserved for tech giants.

However, the rollout of NVLM 1.0 is not without risks. As powerful AI becomes more accessible, concerns about misuse and ethical consequences will likely increase. The AI ​​community now faces the complex task of encouraging innovation while establishing guardrails for responsible use.

Nvidia’s decision also raises questions about the future of AI business models. Companies may need to rethink how to create value in AI and maintain their competitive advantage if cutting-edge models become freely available.

The true impact of NVLM 1.0 will emerge in the coming months and years. It could be the beginning of an unprecedented era of collaboration and innovation in AI. Or it could force a reckoning with the unintended consequences of widely available advanced AI.

One thing is certain: Nvidia has fired a shot across the bow of the AI ​​industry. The question now is not whether the landscape will change, but how dramatically it will happen and who will adapt quickly enough to this new world of open AI.