{"product_id":"advanced-analytics-with-pyspark-patterns-for-learning-from-data-at-scale-using-python-and-spark-9781098103651","title":"Advanced Analytics with Pyspark: Patterns for Learning from Data at Scale Using Python and Spark","description":"\u003cp\u003eThe amount of data being generated today is staggering and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together Spark, statistical methods, and real-world datasets to teach you how to approach analytics problems using PySpark, Spark's Python API, and other best practices in Spark programming. \u003c\/p\u003e\u003cp\u003e Data scientists Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, and Josh Wills offer an introduction to the Spark ecosystem, then dive into patterns that apply common techniques-including classification, clustering, collaborative filtering, and anomaly detection, to fields such as genomics, security, and finance. This updated edition also covers NLP and image processing. \u003c\/p\u003e\u003cp\u003e If you have a basic understanding of machine learning and statistics and you program in Python, this book will get you started with large-scale data analysis. \u003c\/p\u003e\u003cul\u003e \u003cli\u003eFamiliarize yourself with Spark's programming model and ecosystem \u003c\/li\u003e\n\u003cli\u003eLearn general approaches in data science \u003c\/li\u003e\n\u003cli\u003eExamine complete implementations that analyze large public datasets \u003c\/li\u003e\n\u003cli\u003eDiscover which machine learning tools make sense for particular problems \u003c\/li\u003e\n\u003cli\u003eExplore code that can be adapted to many uses \u003c\/li\u003e\n\u003c\/ul\u003e\u003cbr\u003e\u003cbr\u003e\u003cb\u003eBinding Type:\u003c\/b\u003e Paperback\u003cbr\u003e\u003cb\u003ePublisher:\u003c\/b\u003e O'Reilly Media\u003cbr\u003e\u003cb\u003ePublished:\u003c\/b\u003e 08\/02\/2022\u003cbr\u003e\u003cb\u003eISBN:\u003c\/b\u003e 9781098103651\u003cbr\u003e\u003cb\u003ePages:\u003c\/b\u003e 215","brand":"Akash Tandon, Sandy Ryza, Uri Laserson","offers":[{"title":"Default Title","offer_id":42200679579829,"sku":"9781098103651","price":56.09,"currency_code":"USD","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0473\/0804\/6492\/products\/img_825a9fa7-b747-4caa-8c80-144aca5f6238.jpg?v=1655218783","url":"https:\/\/pastforward.org\/products\/advanced-analytics-with-pyspark-patterns-for-learning-from-data-at-scale-using-python-and-spark-9781098103651","provider":"Past Forward","version":"1.0","type":"link"}