The complexity and extensive processing effort required to serve data-driven and latency-sensitive analytics applications, in terms of time-to-market and cost, create a significant challenge in fully monetizing data assets. When evaluating data virtualization vs data warehouse solutions, most big data infrastructure solutions only optimize one aspect of the problem: either SQL performance tuning or business agility. Optimizing performance requires major data modelling and pre-aggregations which limit the ability to support application development velocity.
Indeed, query engines on top of data lakes can support rapidly changing data requirements but often deliver unacceptable performance and unreasonable costs, due to the reliance on large data scans.
Varada’s cloud data virtualization technology serves as a smart acceleration layer on the data lake, which remains the single source of truth, and runs in the customer cloud environment. Varada enables data teams to democratize data and operationalize the entire data lake while ensuring interactive performance, without the need to move data, model or manually optimize. Our secret sauce is our ability to automatically and dynamically index relevant data, at the structure and granularity of the source. Varada enables any SQL query to meet various performance and concurrency requirements for users and analytics API calls, while keeping costs predictable and under control.
Varada adaptively and dynamically indexes any column on trillions of rows, supporting any ANSI SQL query.
Varada seamlessly connects directly to a wide range of data sources, including the data lake (AWS S3, on-prem Hadoop, etc.), data catalogs (Hive Metastore, AWS Glue) and other sources
(MySQL, PostgreSQL, etc.).
Deliver x100 faster query response time across any data source, using the index to filter, join and aggregate data. Varada automatically and dynamically chooses which queries to accelerate, based on continuous monitoring and priorities set by data teams.
Support hundreds of active users while delivering interactive response time. You can easily support different workload priorities and budgets.
Significantly accelerate analytics time-to-market and enable ad hoc queries by cutting down on manual SQL performance tuning, modeling and data preparation. Varada enables data teams to keep data at its granularity and ensure optimal query flexibility.
Extend your Presto SQL workflows to support low latency and near real-time analytics.
Varada runs in your cloud environment, keeping data in your full control and in your own VPC, so you can employ existing security policies.