Packt – Mastering Big Data Analytics with PySpark

Packt – Mastering Big Data Analytics with PySpark-RiDWARE
English | Size: 1.65 GB
Category: Tutorial


Effectively apply Advanced Analytics to large datasets using the power of PySpark
Learn
Gain a solid knowledge of vital Data Analytics concepts via practical use cases
Create elegant data visualizations using Jupyter
Run, process, and analyze large chunks of datasets using PySpark
Utilize Spark SQL to easily load big data into DataFrames
Create fast and scalable Machine Learning applications using MLlib with Spark
Perform exploratory Data Analysis in a scalable way
Achieve scalable, high-throughput and fault-tolerant processing of data streams using Spark Streaming

Cloud Academy – Big Data Analytics on Azure

Cloud Academy – Big Data Analytics on Azure
English | Size: 1.03 GB
Category: Tutorial


Microsoft Azure provides robust services for analyzing big data. One of the most effective ways is to store your data in Azure Data Lake Storage Gen2 and then process it using Spark on Azure Databricks.

Pluralsight – Big Data LDN A GDPR Retrospective

Pluralsight – Big Data LDN A GDPR Retrospective-NOLEDGE
English | Size: 120.09 MB
Category: Tutorial


Big Data LDN 2019 | A GDPR Retrospective: Implementation by a Large-scale Data Organization in Reality | Morri Feldman The date May 25, 2018 was a fateful day for many companies that process and store client data, particularly across the EU. On this day, GDPR went into effect and no one really knew quite what its effects would be. This talk will take you through our company’s journey to compliance – the indexers we used to append & delete client data, and a retrospective of how this affected our data processing operations. This will walk you through the design through implementation, as well as expectation vs. real demand. Eventually, what we imagined would be requested by hundreds of clients at best ended up being requested by tens of thousands and growing. Learning how to manage this new compliance demand alongside our day to day data engineering tasks and processes was no easy feat

Linux Academy – Big Data Fundamentals

Linux Academy – Big Data Fundamentals-BiFiSO
English | Size: 607.48 MB
Category: Tutorial

If you’re completely new to big data and aren’t quite sure what it is, why it’s neccessary, and how it works, then this is the course for you! We are going to clarify what big data is (and isn’t), while also defining some other related terms around data characterization and analysis methods. Then, we will talk about some architectural problems with big data and how we solve them with cluster computing, distributed storage, and cluster managment. Lastly, we will cover some of the popular technologies and illustrate how big data is used in the real world to hopefully shine a light on how big data is already impacting your daily life – whether you realize it or not. Let’s get started!

Cloud Academy – AWS Big Data Specialty Data Collection

Cloud Academy – AWS Big Data Specialty Data Collection-STM
English | Size: 560.28 MB
Category: Tutorial

In course one of the AWS Big Data Specialty Data Collection learning path we explain the various data collection methods and techniques for determining the operational characteristics of a collection system. We explore how to define a collection system able to handle the frequency of data change and the type of data being ingested. We identify how to enforce data properties such as order, data structure, and metadata, and to ensure the durability and availability for our collection approach Intended audience: