Skip to product information
1 of 1
Regular price $199.00 USD
Regular price Sale price $199.00 USD
Sale Sold out
View full details

Complete the Big Data Professional Consulting course and, optionally, get accredited as a Certified Big Data Consultant by passing the certification exam. You can purchase the course now and get the exam later, or you can get them together at a discount as part of the Certification Bundle.

Upon completing the course you will receive a digital certificate of completion, as well as a digital training badge from Acclaim/Credly. Because this course encompasses both the Big Data Professional and Big Data Consultant certifications, upon passing the exam you will also receive official Big Data Professional and Big Data Consultant digital accreditation certificates and certification badges from Acclaim/Credly, along with an account that can be used to verify your certification status.

If you already completed the Big Data Professional course modules, you can purchase a partial course (or a partial bundle) with only the modules specific to the Big Data Consultant track here.

The Big Data Professional Consulting course is comprised of the following 3 course modules, each of which has an estimated completion time of 10 hours:

  • Module 1: Fundamental Big Data Science & Analytics
  • Module 2: Big Data Analysis & Technology Concepts
  • Module 3: Big Data Analysis & Technology Lab
  • Module 4: Big Data Analysis & Science
  • Module 11: Fundamental Big Data Engineering

Choose the Certification Bundle to receive the entire course together with the online-proctored certification exam and a set of practice exam questions, all at a bundle discount.

Upon purchasing this course, you will automatically receive access via the Online Interactive eLearning platform. To provide you with the greatest flexibility, you will also have the option to access the course materials via two additional eLearning formats, at no extra cost. All three eLearning formats are briefly described below. A more detailed comparison can be found here.
  1. For everyday learning: An online interactive eLearning platform with individual lessons, as well as interactive and automatically graded exercises and practice questions.
  2. For learning on-the-go: A study kit platform with access to full course documents that support online/offline synching, annotations, comments, custom bookmarks and cross-document searches.
  3. For your reference: A set of printable PDF documents that you can keep (for all course workbooks and posters).
All three forms of access are subject to Arcitura’s *. Upon purchase, access to the online interactive eLearning platform (1) is provided within one business day. Access to the study kits (2) and the PDF documents (3) is provided upon request.

Shown below are the digital contents and the topic outline for each course module:

Module 1: Fundamental Big Data Science & Analytics

This foundational course module provides a high-level overview of essential Big Data topic areas. A basic understanding of Big Data from business and technology perspectives is provided, along with an overview of common benefits, challenges, and adoption issues. The module content is divided into a series of modular sections, each of which is accompanied by one or more hands-on exercises.

Course Module Contents

  • Workbook Lessons (100+ pages)
  • Video Lessons (for all topics)
  • Mind Map Poster
  • Symbol Legend Poster

  • Patterns and Mechanisms Poster
  • Practice Exam Questions
  • PDFs of Workbook and Posters (printable)

Topics Covered

  • Understanding Big Data
  • Fundamental Big Data Terminology and Concepts
  • Big Data Business Drivers and Technology Drivers
  • Traditional Enterprise Technologies Related to Big Data
  • OLTP, OLAP, ETL and Data Warehouses in relation to Big Data
  • Characteristics of Data in Big Data Environments
  • Dataset Types in Big Data Environments
  • Structured, Unstructured and Semi-Structured Data

  • Metadata and Data Veracity
  • Fundamental Analysis and Analytics
  • Quantitative and Qualitative Analysis
  • Machine Learning Types
  • Descriptive and Diagnostic Analytics
  • Predictive and Prescriptive Analytics
  • Business Intelligence and Big Data
  • Data Visualization and Big Data
  • Big Data Adoption and Planning Considerations

Module 2: Big Data Analysis & Technology Concepts

This course module explores a range of the most relevant topics that pertain to contemporary analysis practices, technologies and tools for Big Data environments. The module content intentionally keeps coverage at a conceptual level, focusing on topics that enable participants to develop a comprehensive understanding of the common analysis functions and features offered by Big Data solutions, as well as a high-level understanding of the back-end components that enable these functions.

Course Module Contents

  • Workbook Lessons (100+ pages)
  • Video Lessons (for all topics)
  • Mind Map Poster

  • Supplement
  • Practice Exam Questions
  • PDFs of Workbook and Poster (printable)

Topics Covered

  • Big Data Analysis Lifecycle (from Business Case Evaluation to Data Analysis and Visualization)
  • A/B Testing and Correlation
  • Regression and Heat Maps
  • Time Series Analysis
  • Network Analysis and Spatial Data Analysis
  • Classification and Clustering
  • Filtering, including Collaborative Filtering and Content-based Filtering
  • Sentiment Analysis and Text Analytics

  • Clusters and Processing Batch and Transactional Workloads
  • How Cloud Computing relates to Big Data
  • Foundational Big Data Technology Mechanisms
  • Big Data Storage Devices and Processing Engines
  • Resource Managers, Data Transfer Engines and Query Engines
  • Analytics Engines, Workflow Engines and Coordinate Engines

Module 3: Big Data Analysis & Technology Lab

This course module presents participants with a series of exercises and problems that are designed to test their ability to apply their knowledge of topics covered in previous modules. Completing this lab will help highlight areas that require further attention and will help prove proficiency in big data analysis and technology and practices as they are applied and combined to solve real-world problems.

Course Module Contents

  • Lab Exercise Booklet
  • Mind Map Poster

  • Practice Exam Questions
  • PDFs of Exercise Booklet and Poster (printable)

Topics Covered

  • Reading Exercise 3.1: Case Study Background PLGM
  • Lab Exercise 3.2: Plan the Big Data BI Environment
  • Lab Exercise 3.3: Analyze Customer Loyalty Data
  • Lab Exercise 3.4: Alleviate Customer Dissatisfaction
  • Lab Exercise 3.5: Improve PLGM’s On-Line Sales
  • Reading Exercise 3.6: Case Study Background LHL
  • Lab Exercise 3.7: Plan the Data Integration and Reporting Environment

  • Lab Exercise 3.8: Develop a Treatment Personalization Capability
  • Lab Exercise 3.9: Enhance LHL’s Research Capability
  • Reading Exercise 3.10: Case Study Background SWP
  • Lab Exercise 3.11: Smart Meter Data Analysis
  • Lab Exercise 3.12: Enhance Electricity Demand Prediction Capability
  • Lab Exercise 3.13: Asset Management and Risk Identification Capability

Module 4: Big Data Analysis & Science

This course module provides an in-depth overview of essential topic areas pertaining to data science and analysis techniques relevant and unique to big data with an emphasis on how analysis and analytics need to be carried out individually and collectively in support of the distinct characteristics, requirements and challenges associated with big data datasets.

Course Module Contents

  • Workbook Lessons (100+ pages)
  • Video Lessons (for all topics)
  • Mind Map Poster

  • Supplement
  • Practice Exam Questions
  • PDFs of Workbook and Poster (printable)

Topics Covered

  • Data Science, Data Mining & Data Modeling
  • Big Data Dataset Categories
  • High-Volume, High-Velocity, High-Variety, High-Veracity, High-Value Datasets
  • Exploratory Data Analysis (EDA)
  • EDA Numerical Summaries, Rules and Data Reduction
  • EDA analysis types, including Univariate, Bivariate and Multivariate
  • Essential Statistics, including Variable Categories and Relevant Mathematics
  • Statistics Analysis, including Descriptive, Inferential, Covariance, Hypothesis Testing, etc.
  • Measures of Variation or Dispersion, Interquartile Range & Outliers, Z-Score, etc.
  • Probability, Frequency, Statistical Estimators, Confidence Interval, etc.
  • Data Munging and Machine Learning

  • Variables and Basic Mathematical Notations
  • Statistical Measures and Statistical Inference
  • Confirmatory Data Analysis (CDA)
  • CDA Hypothesis Testing, Null Hypothesis, Alternative Hypothesis, Statistical Significance, etc.
  • Distributions and Data Processing Techniques
  • Data Discretization, Binning and Clustering
  • Visualization Techniques, including Bar Graph, Line Graph, Histogram, Frequency Polygons, etc.
  • Prediction Linear Regression, Mean Squared Error and Coefficient of Determination R2, etc.
  • Clustering k-means, Cluster Distortion, Missing Feature Values, etc.
  • Numerical Summaries

Module 11: Fundamental Big Data Engineering

This course module covers engineering-related concepts, techniques and technologies for the processing and storage of big data datasets. It highlights the unique challenges faced when processing and storing large, volatile and disparate sets of data. NoSQL is covered and the MapReduce data processing engine is explained in detail as a base framework for high-volume batch data processing.

Course Module Contents

  • Workbook Lessons (100+ pages)
  • Video Lessons (for all topics)

  • Mind Map Poster
  • Practice Exam Questions
  • PDFs of Workbook and Poster (printable)

Topics Covered

  • Big Data Engineering Techniques and Challenges
  • Big Data Storage, including Sharding, Replication, CAP Theorem, ACID and BASE
  • Master-Slave, Peer-to-Peer Replication, Combining Replication with Sharding
  • Big Data Storage Requirements, Scalability, Redundancy and Availability
  • Fast Access, Long-term Storage, Schema-less Storage and Inexpensive Storage
  • On-Disk Storage, including Distributed File System and Databases
  • Introduction to NoSQL and NewSQL
  • NoSQL Rationale and Characteristics

  • NoSQL Database Types, including Key-Value, Document, Column-Family and Graph Databases
  • Big Data Processing Engines
  • Distributed/Parallel Data Processing, Schema-less Data Processing
  • Multi-Workload Support, Linear Scalability and Fault-Tolerance
  • Big Data Processing Requirements, including Batch, Cluster and Realtime Modes
  • MapReduce for Big Data Processing, including Map, Combine, Partition, Shuffle and Sort and Reduce
  • MapReduce Algorithm Design
  • Task Parallelism, Data Parallelism

Learn About Arcitura: Take the Video Tour

Watch these helpful informational videos to learn about Arcitura programs, courses and certifications.

About Arcitura

About Arcitura Courses

About Arcitura Certifications

What’s in an Arcitura Course


Each course provides a comprehensive curriculum with 2-8 modules and 20-80 hours of training.

More Than Just
Video Lessons

In addition to standard video lessons, courses include full-color workbooks and reference posters for all lessons.

Interactive & Graded

Courses also include interactive and graded exercises, interactive and graded self-tests and other supplements.

The Arcitura Difference


  • is authored by a dedicated courseware development team
  • has a self-test, accreditation exam and professional certification
  • is available via two different eLearning platforms


  • undergo a common development process
  • are authored to be consistent in quality, structure and style
  • share a common vocabulary and symbol notation
  • are authored in collaboration with subject matter experts

Take Your Skills Anywhere

Regardless of whether you are an individual looking to boost your career or an organization looking to up-skill a team, Arcitura courses and certifications provide a sound investment.

Because both courses and accreditations are vendor-neutral, they empower you with skills and credentials that you can take to wherever you need to go.

Professional Instructor-Led Training & Coaching



Contact or 604-904-4100 during PT working hours.