Big-Data

8 articles
Cloud Dataproc - Managed Spark and Hadoop Service Google Cloud Dataproc is a fully managed service for running Apache Spark, Hadoop, Flink, and Presto clusters …Azure Synapse Analytics - Unified Analytics and Data Warehousing Azure Synapse Analytics is an integrated analytics platform that combines enterprise data warehousing, big …Azure HDInsight - Managed Open-Source Big Data Clusters Azure HDInsight is a managed cloud service for running open-source big data frameworks including Apache Spark, …Apache Spark - Unified Big Data Processing Engine Apache Spark is a multi-language engine for large-scale data processing, machine learning, and streaming …Apache Hive - Data Warehouse on Hadoop Apache Hive is a data warehouse infrastructure built on top of Apache Hadoop that provides SQL-like querying …Apache Hadoop - Distributed Big Data Framework Apache Hadoop is an open-source framework for distributed storage and processing of large data sets across …Apache Flink - Stateful Stream Processing Framework Apache Flink is a distributed stream processing framework for stateful computations over unbounded and bounded …Amazon EMR - Big Data Processing for AI A comprehensive reference for Amazon EMR: managed Spark and Hadoop clusters, large-scale data processing, and …