Databricks , Snowflake

These are solutions used to build data warehouses and data lakes.  Lakehouse, from Databricks, is based on the open-source Apache Spark platform, which enables analytical queries on semi-structured data without a traditional database schema. Snowflake is a data warehouse that supports ETL processes (in other words, the processes in the data warehouse, which means Extract, Transform and Load. This is a process in which the ETL tool extracts data from different data source systems, transforms it in the bridging area and then loads it into the Data Warehouse system). Databricks started mainly as a data lake company, but has added warehouse features to its product. Snowflake started on the opposite end of the spectrum as a data warehouse, and now introduces data lake features. Data lake is a system or repository of data stored in its natural or raw format. A broad range of analyses can be run directly on the data, which can then be used for visualisation (reporting, dashboards), bigdata processing, real-time analytics and machine learning.

databricks-logo-email