Fundamental Big Data Engineering

Module 7
Fundamental Big Data Engineering

This course explores introductory topics pertaining to the field of developing data processing solutions–data engineering–in the context of Big Data environments. Specifically it covers concepts, techniques and technologies related to the processing and storage of Big Data datasets including MapReduce and NoSQL. It highlights the unique challenges faced when processing and storing Big Data datasets. The MapReduce data processing engine, which is the de facto framework for batch processing of large amounts of data, is also explained in detail.

The following primary topics are covered:

  • Big Data Engineering – Big Data Engineering Challenges
  • Big Data Storage Terminologies (including sharding, replication, CAP theorem, ACID, BASE)
  • Big Data Storage Requirements
  • On-Disk Storage (including distributed file system – databases)
  • Introduction to NoSQL – NewSQL
  • NoSQL Rationale – Characteristics
  • NoSQL Database Types (including key-value, document, column-family and graph databases)
  • Big Data Processing Requirements
  • Big Data Processing (including batch mode and realtime mode)
  • Introduction to MapReduce for Big Data Processing (batch mode)
  • MapReduce Explained (including map, combine, partition, shuffle and sort, and reduce)

Duration: 1 Day

Taking the Course at a Workshop

This course can be taken as part of public or private instructor-led workshops. Visit the Workshop Calendar page to view the current calendar of public workshops or contact to inquire about private workshop delivery.

The following materials are provided to public and private workshop participants:

Note that as a workshop participant, you may be eligible for discounts on the purchase of the self-study kit and Pearson VUE exam voucher for this course.

Taking the Course using a Self-Study Kit

This course can be completed via self-study by purchasing a self-study kit, which includes the base course materials as well as additional supplements and resources designed specifically for self-paced study and exam preparation.

Fundamental Big Data Engineering Module 7 Self-Study Kit [ order ]

Visit the Self-Study Kits page for pricing information and for details regarding discounted self-study kit bundles for individual certification tracks. The following materials are provided in the self-study kit for this course:

Module 7 Self-Study GuideModule 7
Self-Study Guide

NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence Text Book

Audio Tutor CD 1 Overview Part IAudio Tutor CD 1
Part I
Audio Tutor CD 2 Overview Part IIAudio Tutor CD 2
Part II
Audio Tutor CD 3 Exam PreparationAudio Tutor CD 3
Exam Preparation

Array Flash Cards30 Flash Cards

This self-study kit can be purchased using the Online Store.

Note that by purchasing and registering this self-study kit, you may be eligible for discounts on the registration of this course as part of a workshop.


This course corresponds to Exam B90.07, which is required for the following certifications:

Vendor-Neutral Topic Overview

Note that all BDSCP course modules are focused on vendor-neutral Big Data topics and therefore do not provide detailed coverage of any vendor-specific platforms or technologies. BDSCP courses are intentionally authored this way so as to provide an unambiguous and objective understanding of Big Data practices and technology that can be further complemented with product-specific training.

Fact Sheet

Download a printable PDF document with information about this course module and its corresponding self-study kit.

Pearson VUE Exams

A self-study kit is available for each Pearson VUE exam:

Self-Study Kits

A self-study kit is available for each Pearson VUE exam, allowing you to study remotely and at your own pace. For information about the latest available Self-Study Kits, visit the Self-Study page.

Instructor-Led Workshops

The following public workshops are currently scheduled. Additional workshops are often added on short notice. For information regarding private instructor-led workshops delivered to your location, contact: