
Data is the lifeblood of every business, but most of it is underutilized, buried under high costs and complex schema management. Simply funneling data into an analysis platform or cheap storage isn’t enough. Data deserves better.
Cribl set the standard for telemetry pipelines with Cribl Stream. Now, we’re bringing that same innovation to data lake architecture—delivering elastic scalability, automated tiered storage, and unified schema-agnostic data management. Since launching Cribl Lake last April, customer feedback has been overwhelmingly positive. One customer called it “stupid easy” to get started. Another slashed SIEM costs by using Cribl Search to analyze data directly in Lake. But our customers told us they want more — more flexibility, more use cases to tackle, and more ways to collaborate across teams and regions with all their telemetry data.
Traditional log data storage solutions are designed for structured enterprise data, forcing teams to manually define schemas, master SQL, and build parsers just to make data usable. Instead of focusing on insights, they’re stuck managing complex ETL pipelines. As telemetry data explodes in volume, legacy solutions make storing everything “just in case” costly and unsustainable. A new, scalable approach to eliminate these burdens is necessary. IT and security teams want their telemetry data to be instantly accessible for real-time analysis without the overhead and without breaking the budget. That’s why we built Lakehouse: to rethink how organizations store, manage, and analyze telemetry data at scale.
Making Waves: What is Lakehouse?
Cribl’s new Lakehouse is the first lakehouse purpose-built for the dynamic, unpredictable nature of telemetry data. Unlike traditional solutions designed for structured enterprise data, Lakehouse eliminates schema management complexity and manual transformation, delivering elastic scalability, automated tiered storage to optimize costs, and unified data management experience and federated query capabilities across diverse datasets and geographies. Lakehouse empowers organizations to unlock the full value of all their telemetry data by allowing IT and security teams to effortlessly store and analyze massive volumes of evolving telemetry data in real time—without requiring data engineering expertise.

Lakehouse is a next-generation architecture designed to unlock the full potential of telemetry data at scale. As a new feature within Cribl Lake, Lakehouse makes it possible to store massive volumes of ever-changing telemetry data while enabling real-time, high-performance dashboards and analytics.
One Control Plane to Manage It All
Lakehouse removes complex management barriers by offering real-time visibility, instant access to any stored data, and the ability to compose, manage, and query highly distributed datasets across regions — all from a single interface. Users can manage one or many lakes with ease, isolate workloads for performance and security, and dynamically accelerate queries across highly distributed datasets.
Cribl serves as a single, unified control plane for telemetry, eliminating the complexity of juggling multiple tools and data silos. With built-in security and granular access controls, teams can enforce strict role-based permissions, ensuring the right users have the right access without adding operational overhead. By centralizing data management, Cribl streamlines governance, simplifies compliance, and reduces risk—all while giving teams the flexibility to route, transform, and store data wherever it delivers the most value.
Go with the Flow: How Lakehouse Works
Spinning up a Lakehouse in Cribl Lake is super fast, super easy, and takes just a few clicks. Here’s how it works:
Ingest Data: Data flows into a Cribl Lake dataset.
Set up Acceleration: Add dataset to Lakehouse acceleration tier. ALL data will continue to be stored in Lake simultaneously.
Short-term Retention: Data stays in Lakehouse for up to 30 days.
Smart Search: Queries automatically target the fastest option, whether Lakehouse or Cribl Lake.
Transparent to Users: Search runs without extra steps, always choosing the best source based on query time range.
Rehydrate as Needed: Full-fidelity data is always available in Cribl Lake via Replay.
Watch the demo video to see how easy it is to set up:
Not Your Average Lakehouse: What Sets Cribl’s Lakehouse Apart
Lakehouse goes beyond traditional architectures, delivering a cloud-native solution with composable management, self-service access, and elastic resource consumption. Notable distinctions set Lakehouse apart from other solutions out there:
Federated, Distributed Data Management: Users can manage a single lake or multiple lakehouses across regions, enabling isolated workloads to prevent query interference, improve security, and maintain fine-grained RBAC (role-based access control). Regardless of dataset location or speed of access, Cribl provides a unified experience across all lakes and queries.
Schema-Agnostic, No-Code Data Management: Instead of requiring users to manually define schemas like with traditional data lakes, Lakehouse automatically structures telemetry data for exploration and analysis. No need for complex SQL queries, custom parsers, or expensive ETL pipelines—just send the data in, and it’s instantly available for high-performance dashboards and analytics
Composable, Cloud-Native Scalability: Unlike rigid legacy architectures, Lakehouse dynamically provisions resources, allowing organizations to start small and scale on demand. By keeping lakehouses close to data egress points but with a centralized control plane, organizations reduce costs, improve security, and optimize performance for diverse use cases.
Fully Managed Data Experience: Cribl eliminates the need for deep database expertise—users don’t have to manage the nuances of time series databases (TSDB), columnar acceleration, or cloud data warehouses. Instead, Cribl intelligently routes and stores data in the optimal format, reducing costs by 50% compared to traditional solutions while ensuring seamless search and retrieval from a single, unified query interface.
Automated Tiered Storage for Cost Optimization: Users can define storage tiers based on access frequency and retention needs, ensuring real-time access to high-value telemetry data without performance degradation or long retrieval times.
Cribl breaks the traditional storage-analysis lock-in with open formats, flexible performance tiers, and seamless integrations. Lakehouse future-proofs your infrastructure, so you can work smarter with your data, keep costs predictable, and your teams ahead — without expertise, without complex schema management, and without vendor lock-in.
Ready to Join us at the Lakehouse?
Lakehouse is available to current Cribl.Cloud Enterprise customers through early access, reach out to your Cribl account team if you'd like to try it out before general availability in March.
Want to dive deeper into Cribl Lake, Lakehouse, or Cribl Search?