Get in Touch

Course Outline

Introduction to Programming Big Data with R (bpdR)

  • Configuring your environment to use pbdR
  • Overview of pbdR scope and available tools
  • Commonly used packages with Big Data in conjunction with pbdR

Message Passing Interface (MPI)

  • Utilizing pbdR MPI 5
  • Parallel processing techniques
  • Point-to-point communication
  • Handling Matrix sending operations
  • Matrix summation methods
  • Collective communication strategies
  • Matrix summation using Reduce
  • Scatter and Gather operations
  • Additional MPI communication methods

Distributed Matrices

  • Constructing a distributed diagonal matrix
  • Singular Value Decomposition (SVD) for distributed matrices
  • Building distributed matrices in parallel

Statistics Applications

  • Monte Carlo Integration
  • Loading datasets
  • Reading data across all processes
  • Broadcasting data from a single process
  • Processing partitioned data
  • Distributed Regression analysis
  • Distributed Bootstrap methods
 21 Hours

Testimonials (2)

Related Categories