The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In <i>Spark in Action, Second Edition</i>, you''ll learn to take advantage of Spark''s core features and incredible processing speed, with applications including real-time computation, delayed evaluation, and machine learning.<BR><BR>Unlike many Spark books written for data scientists, <i>Spark in Action, Second Edition</i> is designed for data engineers and software engineers who want to master data processing using Spark without having to learn a complex new ecosystem of languages and tools. You''ll instead learn to apply your existing Java and SQL skills to take on practical, real-world challenges. <BR><BR>Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.