Back to Projects
LinkedIn - Job Postings Data Pipeline
Python
dbt
PostgreSQL
AWS
Tableau
Overview
A comprehensive data engineering project that automates the collection and processing of job postings from LinkedIn. The pipeline uses a modular architecture with microservices for each stage of the ELT process. Key Features: - Modular ELT system using Python and Airflow - Data enrichment using AI for skills extraction and summary generation - Automated cover letter generation based on job descriptions - Staging and Marts layers implemented with dbt - Interactive visualization in Tableau