Databricks’ open-source DBRX LLM beats Llama 2, Mixtral, and Grok

news

Mar 27, 20242 mins

Databricks says DBRX, its latest open-source LLM, outperforms its open-source competitors, potentially offering enterprises a low-cost way to train it on their own data for generative AI use cases.

Data lakehouse provider Databricks has released a family of open-source large language models (LLM), DBRX, that it says outperforms OpenAI’s GPT 3.5 and open-source models such as Mixtral, Claude 3, Llama 2, and Grok-1 on standard benchmarking tests.

DBRX can be downloaded for free from GitHub and Hugging Face for research or commercial use.

This provides enterprises the opportunity to not only reduce their cost of developing generative AI use cases with their own enterprise data without being held back by constraints put forth by providers of closed models, such as OpenAI, on commercial usage.

The strategy to launch DBRX can be traced back to April last year, when the company launched its first open source LLM, Dolly 2.0, to showcase that enterprises had alternatives to models such as GPT 3.5 and GPT-4.

DBRX is supported on AWS, Google Cloud, and on Microsoft Azure via Azure Databricks, so enterprises can download the model and run it on graphical processing units (GPUs) wherever they wish.

Alternatively, enterprises can also choose to subscribe to DBRX and additional tools, such as retrieval augmented generation (RAG), for customizing the LLM via Databricks’ Mosaic AI Model Serving offering.

Mosaic AI Model Serving connects to DBRX via what the company calls Foundation Model APIs, which allows enterprises to access and query LLMs from a serving endpoint.

The Foundation Model APIs are provided in two pricing modes—pay per token and provisioned throughput.

While the pay per token is billed on the basis of concurrent requests, throughput is billed per GPU instance per hour. Both the rates, including cloud instance cost, start at $0.070 per Databricks unit.

The company also provides a pricing band for different GPU configurations.

As part of the LLM launch, Databricks has released two models under an open license with certain restrictions: DBRX Base, a pretrained base model, and DBRX Instruct, a fine-tuned version for few-turn interactions.

DBRX is also expected to be available through the Nvidia API Catalog and supported on the Nvidia NIM inference microservice.

While DBRX outperforms most models available today, according to Databricks’ tests, OpenAI’s GPT-4 leaves it behind on most benchmarks.

by Anirban Ghoshal

Senior Writer

Anirban is an award-winning journalist with a passion for enterprise software, cloud computing, databases, data analytics, AI infrastructure, and generative AI. He writes for CIO, InfoWorld, Computerworld, and Network World. He won the 2024 Silver Azbee Award for Best News Article in the Technology category. He has a post-graduate diploma in journalism from the Indian Institute of Journalism and New Media.

Show me more

Topics

About

Policies

Our Network

More

Databricks’ open-source DBRX LLM beats Llama 2, Mixtral, and Grok

Databricks says DBRX, its latest open-source LLM, outperforms its open-source competitors, potentially offering enterprises a low-cost way to train it on their own data for generative AI use cases.

More from this author

Neo4j unveils Infinigraph to merge OLTP and OLAP for Agentic AI

OpenAI acquires Statsig to speed up generative AI-based product launches

OpenAI adds MCP and SIP support to gpt-realtime for smarter voice-based agents

The era of cheap AI coding assistants may be over

Databricks buys Tecton to give context to AI agents

Microsoft adds MCP support to Visual Studio to boost development of agentic applications

Anthropic adds Claude Code to its Claude enterprise plans

AWS blames bug for Kiro pricing glitch that drained developer limits

Show me more

Rust Innovation Lab launched, sponsors first project

Is Meta’s $10 billion cloud deal a good idea for you?

What makes JavaScript great

Getting encryption wrong (and getting it right, too)

How to build a native desktop app vs. a web UI app

PyApp: Build click-to-run Python apps with Rust