PinnedRamses Alexander Coraspe ValdezinITNEXTBuilding a Schema Inference Data Pipeline for Large CSV filesA parallel implementation with python·6 min read·Jul 9, 2022--1--1
PinnedRamses Alexander Coraspe ValdezinITNEXTBuilding Real-time communication with Apache Spark through Apache LivyDockerizing and Consuming an Apache Livy environment·5 min read·Jun 12, 2022--1--1
PinnedRamses Alexander Coraspe ValdezinITNEXTHow to build a DAG based Task Scheduling tool for Multiprocessor systems using pythonScheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag·14 min read·Jun 7, 2022----
PinnedRamses Alexander Coraspe ValdezinGeek CultureDesign, Development and Deployment of a simple Data PipelineData Engineering technical challenge (part 1)·6 min read·Jun 5, 2022----
PinnedRamses Alexander Coraspe ValdezinAWS in Plain EnglishBuilding an ETL pipeline with Apache Airflow and Visualizing AWS Redshift data using Power BITracking Uber Rides and Uber Eats expenses with Apache Airflow, AWS Redshift and Power BI.·11 min read·Apr 30, 2021--2--2
Ramses Alexander Coraspe ValdezWorking with large CSV files in Python from Scratch5 Techniques·12 min read·Dec 21, 2022----
Ramses Alexander Coraspe ValdezDesigning and Planning an Event Store SystemCQRS and Event Sourcing design patterns·8 min read·Dec 11, 2022----
Ramses Alexander Coraspe ValdezinPython in Plain EnglishBuilding, Preparing and Cleaning a Real Estate DatasetDockerizing a Python Script for Faster Web Scraping·4 min read·Jun 14, 2022----
Ramses Alexander Coraspe ValdezinPython in Plain EnglishHow to Build a Lossless Data Compression and Data Decompression PipelineA parallel implementation of the bzip2 high-quality data compressor tool in Python.·6 min read·Apr 20, 2022----