Thank you for sending your enquiry! One of our team members will contact you shortly.
Thank you for sending your booking! One of our team members will contact you shortly.
Course Outline
Introduction
- The Data Science Lifecycle
- Roles and responsibilities within Data Science
Setting Up the Development Environment
- Libraries, frameworks, programming languages, and tools
- Local development setups
- Collaborative, web-based development environments
Data Collection
-
Various Data Types
-
Structured Data
- Local databases
- Database connectors
- Standard formats: xlsx, XML, JSON, CSV, etc.
-
Unstructured Data
- Clickstream data, sensors, smartphones
- Application Programming Interfaces (APIs)
- Internet of Things (IoT)
- Documents, images, videos, audio
-
Structured Data
- Case Study: Continuously collecting large volumes of unstructured data
Data Storage
- Relational databases
- Non-relational databases
- Hadoop: Distributed File System (HDFS)
- Spark: Resilient Distributed Dataset (RDD)
- Cloud-based storage solutions
Data Preparation
- Ingestion, selection, cleansing, and transformation processes
- Ensuring data quality—focusing on accuracy, relevance, and security
- Generating exception reports
Languages for Preparation, Processing, and Analysis
-
R Language
- Introduction to R
- Data manipulation, calculations, and graphical representation
-
Python
- Introduction to Python
- Manipulating, processing, cleaning, and analyzing data
Data Analytics
-
Exploratory Data Analysis
- Fundamental statistics
- Creating preliminary visualizations
- Gaining deeper understanding of the dataset
- Establishing causality
- Feature engineering and transformations
-
Machine Learning
- Supervised vs. unsupervised learning
- Guidelines for selecting the appropriate model
- Natural Language Processing (NLP)
Data Visualization
- Industry best practices
- Matching the right chart type to the data
- Effective use of color palettes
-
Advancing visualization techniques
- Interactive dashboards
- Dynamic visualizations
- Data storytelling
Summary and Conclusion
Requirements
- A foundational understanding of database concepts
- Basic knowledge of statistics
35 Hours
Testimonials (1)
workshops, practical examples