Friday, August 22, 2025

AtScale Likes Its Odds in Race to Construct Common Semantic Layer

(Oselote/Shutterstock)

Semantic layers are all of a sudden a scorching commodity because of their functionality to make non-public enterprise knowledge make sense to AI fashions. Databricks and Snowflake are each constructing their very own semantic layers, but when broad business help, common applicability, and the potential to change knowledge lakehouse suppliers are the purpose, then AtScale says it’s forward of the sport.

Over the previous 12 months, the potential of enormous language fashions (LLMs) to generate good high quality SQL code has elevated dramatically, which has spurred nice curiosity in utilizing LLMs as defacto knowledge analysts. The massive hope is that using an LLM to transform a pure language question into SQL will allow many extra individuals, functions, and AI brokers to get entry to enterprise knowledge, thereby attaining (lastly!) the longstanding purpose within the BI group of democratizing entry to knowledge.

That’s the grand plan, anyway, however there’s just a few small particulars to work out–together with the truth that the massive LLMs have (hopefully) by no means seen your non-public database earlier than and subsequently don’t know what the columns, rows, tables, and views truly imply. That’s form of an issue if accuracy is necessary to your board of administrators.

And that’s the place a semantic layer performs an necessary function, by functioning as a translator, if you’ll, between the precise approach you’ve modeled your knowledge in your database–together with the actual measures, dimensions, and metrics that outline your particular person enterprise–and the generic definitions that SQL question engines and AI fashions can learn and perceive.

Semantic layers assist with accuracy with NLQ (Supply: AtScale)

AtScale Co-founder and CTO David Mariani watched as demand elevated for the kind of semantic layer that his firm builds. Initially developed a dozen years in the past to help AtScale’s on-line analytical processing (OLAP) question engine, the corporate’s semantic layer itself has grow to be an enormous gross sales driver and a spotlight for the corporate. That makes the business exercise round semantic layers each good and dangerous, Mariani says.

“It’s like we have been alone in form of shouting from the mountaintops how necessary a semantic layer was, and so now the remainder of the market agrees, in order that’s nice. You’ll be able to’t be a market of 1,” Mariani tells BigDATAwire. “So we’re actually inspired that different individuals are investing on this space. However man, they’ve obtained a whole lot of work in entrance of them. A variety of laborious work.”

There’s no query {that a} semantic layer can enhance the standard of AI-generated BI queries. AtScale lately carried out a check the place it measured the accuracy of SQL queries generated by Google’s Gemini and Snowflake’s Cortex choices. The primary section of the check measured their efficiency on the Transaction Processing Council (TPC) Information Science (DS) benchmark operating as stand-alone merchandise, and the second section measured how they labored utilizing the AtScale semantic layer functioning as a translator. With out the semantic layer, Gemini and Cortex question outcomes have been within the 0% to 30% accuracy vary, relying on schema and query complexity. With AtScale, the scores have been 100%.

Why did the scores enhance a lot? It’s all about understanding how knowledge is saved within the database, which is the place the complexity lives. The TPC DS benchmark simulates a retailer that sells to customers in three manners: in-store, through the Net, and thru a catalog. Gross sales in every of these channels is booked individually within the database, however to grasp what “whole gross sales” means, the individual or software producing the SQL question must know which particular a part of the database has the proper quantity to plug into the equation.

Dave Mariani is the founder and CTO of AtScale

“It’s obtained to look by way of dozens of tables–and these will not be all simply tables, as a result of every of those inexperienced packing containers are dimensions, which itself have a mannequin behind it,” Mariani says. “So it’s immensely complicated. And so to get it proper and to get it proper constantly with out a map–how are you going to get to the vacation spot with out a map?”

One answer can be to easily give your proprietary database to the LLM, which can finally be capable to determine it out. However most organizations are hesitant to do this for safety and privateness considerations. The choice, in fact, is to sit down a semantic layer in between the LLM and your database to operate because the map or the translator.

The query, then, turns into which semantic layer to make use of. Many BI instruments, like Looker, Tableau, and PowerBI, include their very own semantic layers, and datalake suppliers, like Snowflake and Databricks, are additionally constructing semantic layers that perceive knowledge saved on their platforms. Alternatively, clients can select to purchase an unbiased semantic layer that works with a number of front-end BI instruments and backend databases. That is what Mariani and AtScale are constructing: a common semantic layer that works with the whole lot.

“It’s like a Rosetta Stone that means that you can plug various things into it, but it surely nonetheless lives inside your firewall,” Mariani says. “The semantic layer is that firewall, that abstraction layer which permits them to have the independence to change out the again finish or change out the entrance finish. As a result of in the end your corporation logic is identical and your presentation is identical no matter what it’s speaking to.”

AtScale isn’t the one vendor constructing a common semantic layer. Final week we coated the work that its competitor, Dice, is doing. Dbt Labs can be searching for to broaden from its dominant function in knowledge transformation into semantic layers, too.

Mariani respects the work that these distributors are doing, however he additionally insists that AtScale’s semantic layer is extra mature and is best located to grow to be the usual for this area, if one emerges (which isn’t any assure).

LLMs wrestle to make sense of complicated knowledge modeling schemes on non-public knowledge (Picture supply: AtScale)

In 2024, the corporate took a step towards changing into the business commonplace by open sourcing the language it makes use of to outline metrics. Dubbed Semantic Modeling Language (SML), the language is now within the open area. Along with defining metrics, SML can be utilized to translate between different semantic layers, together with help for Snowflake, dbt, PowerBI, and Looker. Mariani says its being donated to the Apache Software program Basis.

Would AtScale take the subsequent step and open supply its semantic engine, as Dice as carried out? That’s not within the playing cards for the time being, Mariani says.

“For now, no, however we’re positively curious about establishing a standard open supply semantic modeling language as a result of, we’re seeing there’s now a whole lot of competing languages,” he says. “We’re not the one recreation on the town. All people’s gotten into it they usually’re all creating their very own languages. And that’s actually form of dangerous for the business, I believe.”

There’s another functionality in AtScale’s semantic layer that may very well be an ace up its sleeve: deep technical help for Microsoft’s knowledge and analytics stack.

“The problem to a common semantic layer is that it’s a must to connect with the whole lot, and that’s the place we now have a bonus. As a result of we’re multi-dimensional, we will help the Microsoft stack by way of and thru,” he says. “Meaning Excel and Energy BI work natively with AtScale, identical to they might work with Microsoft Analytics stack. That’s distinctive to us. And that’s actually, actually, actually laborious as a result of these multidimensional languages will not be meant to be translated right into a tabular SQL language. And we’ve been engaged on that for actually 12 years. Different distributors are going to have a tough time supporting these interfaces.”

As demand for common semantic layers picks up, distributors like AtScale shall be proper within the thick of it. The market hasn’t given a sign but whether or not common semantic layers shall be favored, or whether or not clients shall be happy with utilizing semantic layers tied to explicit BI instruments or knowledge platforms. Within the meantime, larger funding on this space means that extra innovation is on the best way.

Associated Objects:

Past Phrases: Battle for Semantic Layer Supremacy Heats Up

AtScale Claims Textual content-to-SQL Breakthrough with Semantic Layer

Is the Common Semantic Layer the Subsequent Massive Information Battleground?

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles