What It Is
Apache Spark is a distributed data processing engine for large-scale computation. On this site, it is part of the practical toolset behind building systems that are easier to understand, operate, and repeat.
Learn more: https://spark.apache.org/documentation.html