Ramses Alexander Coraspe Valdez – Medium

Ramses Alexander Coraspe Valdez

Pinned

Ramses Alexander Coraspe Valdez
in
ITNEXT

Building a Schema Inference Data Pipeline for Large CSV files

A parallel implementation with python

6 min readJul 9, 2022

--

1

Building a Schema Inference Data Pipeline for Large CSV files

--

1

Pinned

Ramses Alexander Coraspe Valdez
in
ITNEXT

Building Real-time communication with Apache Spark through Apache Livy

Dockerizing and Consuming an Apache Livy environment

5 min readJun 12, 2022

--

1

Building Real-time communication with Apache Spark through Apache Livy

--

1

Pinned

Ramses Alexander Coraspe Valdez
in
ITNEXT

How to build a DAG based Task Scheduling tool for Multiprocessor systems using python

Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag

14 min readJun 7, 2022

--

How to build a DAG based Task Scheduling tool for Multiprocessor systems using python

--

Pinned

Ramses Alexander Coraspe Valdez
in
Geek Culture

Design, Development and Deployment of a simple Data Pipeline

Data Engineering technical challenge (part 1)

6 min readJun 5, 2022

--

Design, Development and Deployment of a simple Data Pipeline

--

Pinned

Ramses Alexander Coraspe Valdez
in
AWS in Plain English

Building an ETL pipeline with Apache Airflow and Visualizing AWS Redshift data using Power BI

Tracking Uber Rides and Uber Eats expenses with Apache Airflow, AWS Redshift and Power BI.

11 min readApr 30, 2021

--

2

Uber expenses tracking Architecture

--

2

Ramses Alexander Coraspe Valdez

Working with large CSV files in Python from Scratch

5 Techniques

12 min readDec 21, 2022

--

Working with large CSV files in Python from Scratch

--

Ramses Alexander Coraspe Valdez

Designing and Planning an Event Store System

CQRS and Event Sourcing design patterns

8 min readDec 11, 2022

--

Designing and Planning an Event Store System

--

Ramses Alexander Coraspe Valdez
in
Python in Plain English

Building, Preparing and Cleaning a Real Estate Dataset

Dockerizing a Python Script for Faster Web Scraping

4 min readJun 14, 2022

--

Building, Preparing and Cleaning a Real Estate Dataset

--

Ramses Alexander Coraspe Valdez
in
Python in Plain English

How to Build a Lossless Data Compression and Data Decompression Pipeline

A parallel implementation of the bzip2 high-quality data compressor tool in Python.

6 min readApr 20, 2022

--

How to Build a Lossless Data Compression and Data Decompression Pipeline

--

Ramses Alexander Coraspe Valdez

Introduction to Apache Spark

Apache Spark

13 min readNov 21, 2021

--

Introduction to Apache Spark

--

Ramses Alexander Coraspe Valdez

Ramses Alexander Coraspe Valdez

Very passionate about data engineering and technology, love to design, create, test and write ideas, I hope you like my articles.

Following

Help
Status
About
Careers
Blog
Privacy
Terms
Text to speech
Teams