Data Engineering Projects
Automated data pipelines and ETL systems for complex, multi-source datasets built during my work at the Leonard C. Cooper Jnr International Trade Center.
ERS-Cooper: Automated Global Trade Data Pipeline
A professional Python-based ETL system that synchronizes complex trade datasets from the USDA, WTO, World Bank, and IMF. Features modular architecture, automated error handling, and data normalization.
The Challenge
Agricultural trade researchers at NC A&T were spending 20+ hours per week manually downloading, cleaning, and merging data from multiple international sources.
Modular Architecture
Multi-Source Integration
Measurable Impact
Supporting Academic Research
This pipeline was the foundation of my master's thesis: “Quantifying the Impact of U.S. Free Trade Agreements on Agricultural Exports”
Additional Data Projects
Country Reference Data Synchronization
Automated system for normalizing country names and codes across multiple international data sources with 99.5% matching accuracy.
Macroeconomic Data Integration
Automated extraction and integration of GDP, exchange rates, and population data from World Bank and IMF covering 200+ countries over 20+ years.
Interested in Data Engineering?
I'm passionate about building robust, scalable data systems that enable research and decision-making.