Big-Data
All articles
Cloud Dataproc - Managed Spark and Hadoop Service
Google Cloud Dataproc is a fully managed service for running Apache Spark, Hadoop, Flink, and Presto clusters …Azure Synapse Analytics - Unified Analytics and Data Warehousing
Azure Synapse Analytics is an integrated analytics platform that combines enterprise data warehousing, big …Azure HDInsight - Managed Open-Source Big Data Clusters
Azure HDInsight is a managed cloud service for running open-source big data frameworks including Apache Spark, …Apache Spark - Unified Big Data Processing Engine
Apache Spark is a multi-language engine for large-scale data processing, machine learning, and streaming …Apache Hive - Data Warehouse on Hadoop
Apache Hive is a data warehouse infrastructure built on top of Apache Hadoop that provides SQL-like querying …Apache Hadoop - Distributed Big Data Framework
Apache Hadoop is an open-source framework for distributed storage and processing of large data sets across …Apache Flink - Stateful Stream Processing Framework
Apache Flink is a distributed stream processing framework for stateful computations over unbounded and bounded …Amazon EMR - Big Data Processing for AI
A comprehensive reference for Amazon EMR: managed Spark and Hadoop clusters, large-scale data processing, and …
Open source projects