• 1

    Suitable for the Internet of things

    Apache Spark is suitable for the Internet of things, machine learning, cybersecurity. Apache Spark was designed to be suitable for both batch and iterative processing.

  • 2

    “Map-side join” broadcast method

    This method speeds up joins significantly when one of the tables is smaller than the other and can fit in its entirety on individual machines.

  • 3

    Spark has a massive open-source community behind it

    The community improves the core software and contributes practical add-on packages.