This site is a structured, practical guide on learning enough Python, SQL and systems to get a first job in development or as a junior data engineering.
Though the focus is on data engineering, the essential skills taught will support a career in data science or data analysis - though you will need additional mathematics and statistical understanding which this course does not provide.
What this course is not:
A silver bullet, there is no substitute for practise and hours spent coding.
Contents
- Getting started
- UNIX & gitbash
- Navigating a file system
- git 101
- Setting up a project
- Python 101
- Installing Python package with Pip
- Virtual environments
- iPython
- Objects, Data Structures and Data Types
- Functions
- Loops
- Conditional logic
- Errors
- Python_file.py
- Passing Input
- Debugging
- Unit Testing
- Logging
- Collaborating in Software development
- Agile Ways of Working
- Standards
- Writing clean code
- Linting & pre-commit
- git 102
- Shell 101
- Hot keys and shortcuts
- Aliasing commands
- Escape characters & RegEx
- Bash Scripting
- Continuous Integration
- yaml
- GitHub Actions
- Python 102 - Data Handling With Python
- Working with structured and unstructured data
- Data ingestion with Python and Pandas
- Data quality & cleaning
- Joining data sets
- Plotting with Python
- Plotting maps with Python
- Storing Data
- Integration Testing
- APIs
- Databases and Warehousing
- What is a database?
- Setting up a data warehouse with Snowflake
- Getting data into Snowflake
- Querying Data with SQL
- Data Governance
- Role Based Access Control
- Accessing a Database with Python
- Connecting Python to Snowflake
- Setting up the connection
- Building a data warehouse using Python
- Pushing data into Snowflake with Python
- Query data
- Getting data out of Snowflake using Python
- Consuming APIs
- ELT Pipelines
- Using Python to retrieve data from an API
- Data Lakes
- Building the pipeline to Snowflake
- Warehousing the data
- Transforming the data
- Automating a data export using the Gmail API
- Dashboarding and Data Presentation
- Moving to the Cloud
- Amazon s3, Lambda and IAM
- Architecting Systems
- Event Driven Architecture
- Infrastructure as Code
- Containerisation with Docker
- Putting code into Production
- Scheduling jobs
- Continuous Deployment
- Secrets and Configuration Files
- Monitoring & Alerting
- Game day