Course Outline

Introduction

Overview of Data Cleaning

  • Why is Data Cleaning Important?

Case Study: When Big Data Is Dirty

Developing A Thorough Data Cleaning Strategy

Common Data Cleaning Tools

  • Drake
  • OpenRefine
  • Pandas (for Python)
  • Dplyr (for R)

Achieving High Data Integrity

  • Complete
  • Correct
  • Accurate
  • Relevant
  • Consistent

Automating the Data Cleaning Process

Monitoring Your Data Cleaning System

Summary and Conclusion

Requirements

  • An understanding of data analytics concepts.

Audience

  • Data Scientists
  • Data Analysts
  • Business Analysts
 7 Hours

Number of participants



Price per participant

Testimonials (2)

Related Courses

Excel For Statistical Data Analysis

14 Hours

Data Analytics With R

21 Hours

Data Analysis with Hive/HiveQL

7 Hours

Data Analysis with Python, Pandas and Numpy

14 Hours

Knowledge Discovery in Databases (KDD)

21 Hours

NLP: Natural Language Processing with R

21 Hours

A Practical Introduction to Data Analysis and Big Data

35 Hours

Elasticsearch for Developers

14 Hours

MATLAB Fundamentals, Data Science & Report Generation

35 Hours

Data and Analytics - from the ground up

42 Hours

SQL Advanced level for Analysts

21 Hours

Apache Kylin: From Classic OLAP to Real-Time Data Warehouse

14 Hours

Datameer for Data Analysts

14 Hours

Embedding Projector: Visualizing Your Training Data

14 Hours

kdb+ and q: Analyze Time Series Data

21 Hours

Related Categories