If you are a developer or data scientist interested in big data, Spark is the tool for you.
Apache Spark’s ability to speed analytic applications by orders of magnitude, its versatility, and ease of use are quickly winning the market. With Spark’s appeal to developers, end-users, and integrators to solve complex data problems at scale, it is now the most active open source project with the big data community.
Databricks is happy to present this ebook as a practical introduction to Spark. With rapid adoption by enterprises across a wide range of industries, Spark has been deployed at a massive scale, collectively processing multiple petabytes of data on clusters of over 8,000 nodes. If you are a developer or data scientist interested in big data, Spark is the tool for you.
- Learn why Spark is a popular choice for data analytics
- Discover what tools and features are available
- Explore Spark’s basic architecture
- Start right away through interactive sample code