Get in Touch

Course Outline

Introduction

  • The Data Science Lifecycle
  • Roles and responsibilities within Data Science

Setting Up the Development Environment

  • Libraries, frameworks, programming languages, and tools
  • Local development setups
  • Collaborative, web-based development environments

Data Collection

  • Various Data Types
    • Structured Data
      • Local databases
      • Database connectors
      • Standard formats: xlsx, XML, JSON, CSV, etc.
    • Unstructured Data
      • Clickstream data, sensors, smartphones
      • Application Programming Interfaces (APIs)
      • Internet of Things (IoT)
      • Documents, images, videos, audio
  • Case Study: Continuously collecting large volumes of unstructured data

Data Storage

  • Relational databases
  • Non-relational databases
  • Hadoop: Distributed File System (HDFS)
  • Spark: Resilient Distributed Dataset (RDD)
  • Cloud-based storage solutions

Data Preparation

  • Ingestion, selection, cleansing, and transformation processes
  • Ensuring data quality—focusing on accuracy, relevance, and security
  • Generating exception reports

Languages for Preparation, Processing, and Analysis

  • R Language
    • Introduction to R
    • Data manipulation, calculations, and graphical representation
  • Python
    • Introduction to Python
    • Manipulating, processing, cleaning, and analyzing data

Data Analytics

  • Exploratory Data Analysis
    • Fundamental statistics
    • Creating preliminary visualizations
    • Gaining deeper understanding of the dataset
  • Establishing causality
  • Feature engineering and transformations
  • Machine Learning
    • Supervised vs. unsupervised learning
    • Guidelines for selecting the appropriate model
  • Natural Language Processing (NLP)

Data Visualization

  • Industry best practices
  • Matching the right chart type to the data
  • Effective use of color palettes
  • Advancing visualization techniques
    • Interactive dashboards
    • Dynamic visualizations
  • Data storytelling

Summary and Conclusion

Requirements

  • A foundational understanding of database concepts
  • Basic knowledge of statistics
 35 Hours

Number of participants


Price per participant

Testimonials (1)

Upcoming Courses

Related Categories