david-landeo / portfolio
available
get in touch
profile · summary updated · May 2026
DATA ENGINEER · MADRID, ES

I build reliable pipelines
that other engineers don't hate.

3 years designing & optimizing scalable data pipelines and analytics solutions. Currently at Procter & Gamble — building ETL in Python / PySpark / Databricks across Bronze + Silver, shipping Power BI dashboards stakeholders actually open, and enforcing data-quality checks on production pipelines.

📍 Madrid, ES 🎓 MSc Big Data & AI · UB BSc Electronic Eng. · PUCP 🌐 ES native · EN advanced
identity
David Landeo at his desk David Landeo at his desk

David J. Landeo

// Data Engineer @ P&G
status
● open to roles
location
Madrid, ES
timezone
UTC+1 (CET)
remote
EU ✓
01

Key metrics

last 36 months
Years in production
3.0yrs↑ growing
PySpark runtime cut
~1hruntime↓ faster
from several hours → ~1h
BI dashboards owned
2livestable
biz + eng stakeholders
Languages shipped (AI tool)
5localesES·EN·PT·FI·SV
02

About & stack

how I think · what I use
about

I'm an electronic engineer who became a data engineer. Five years as a teaching assistant at PUCP, an exchange semester in Vigo, hardware roles building drones, air-quality monitors and Peru's first oxygen concentrator — then an MSc in Big Data & AI at the University of Barcelona to pivot into data.

Most of what I do now is unglamorous on purpose: reading other people's PySpark, making it faster, and writing the data-quality checks they wish they'd had. I prefer pipelines that recover quietly to pipelines that look impressive in a slide deck.

Friendliest with Databricks / PySpark / Power BI, comfortable across the rest of the modern data stack, and quick to pick up the next tool when the work calls for it.

stack primary · secondary
Data & pipelinescore
PySparkDatabricksPython SQLAirflowDelta LakeBigQuery
Analytics & BIdelivery
Power BITableauExcel Power AutomateVBA
Platform & opsship + own
GitGH Actions DockerAzureFlaskLinux
ML / AIwhen it earns it
scikit-learnPyTorchSeleniumStreamlitNLP
03

Experience

6 roles · data + hardware · newest first
work history · descending
Sep 2023
present

Data Engineer

@ Procter & Gamble · Madrid
Built & maintained ETL pipelines in Databricks (PySpark) across Bronze + Silver.
Refactored a legacy PySpark codebase — several hours → ~1h.
Designed 2 Power BI dashboards used by business + engineering.
Adapted CI/CD bundles · added data-quality checks & alerting.
Designed Power Automate workflows that cut recurring manual ops.
Owned Master Data accuracy & functionality.
PySparkDatabricksPower BI GitPower Automate
Sep 2023
→ Feb 2025

AI & Big Data Analyst

@ Instituto Escalae · remote
AI-driven translation pipeline for files in ES / EN / PT / FI / SV.
Azure-hosted API that transcribes videos to text over HTTP.
Automated certificate delivery to thousands via Open Badge Factory.
PythonAzureFlaskOpenAI
May 2023
→ Jul 2023

Data Engineer · Intern

@ Rentals United · Barcelona
Web scrapers for prospect contact data.
Automations that shaved hours/week off sales-ops admin.
Pipelines wiring Sheets ↔ BigQuery ↔ APIs.
PythonBigQuerySeleniumAPIs
Jan 2022
→ Sep 2022

Electronic Project Engineer

@ Diacsa · Lima
Helped assemble Covox — Peru's first oxygen concentrator.
Selected components & designed schematics in Eagle.
Printed circuit boards for electronic devices.
Assembled finished electronic devices for customers.
EaglePCBHardware
Jan 2021
→ Sep 2021

Head of Hardware Area

@ Qaira · Lima
Led a team of 5 — technicians + engineers.
Trained customers on the qHAWAX air-quality device.
Planned & supervised delivery of customer purchases.
Welding & QA of printed-circuit-board manufacturing.
Team leadqHAWAXPCB QA
Jun 2020
→ Dec 2020

Technology Development Manager

@ Qaira · Lima
Drone flight testing — improved autonomous flight time & video coverage.
3D-printed parts & CNC cuts for the drone gimbal.
Improved the working environment inside the Hardware Area.
Drones3D printingCNC
04

Education & certifications

degrees · courses · newest first
formal education
Oct 2022
→ Jul 2023

MSc · Big Data & Artificial Intelligence Solutions

@ Barcelona Technology School · Universitat de Barcelona
Master's thesis & capstone projects across Spark, ML, NLP. Career pivot from hardware into data.
Jan 2019
→ Jun 2019

Student Exchange Program

@ Universidad de Vigo · Galicia, ES
Semester abroad in Electronic Engineering — first time outside Peru on an academic track.
Mar 2013
→ Jun 2019

BSc · Electronic Engineering

@ Pontificia Universidad Católica del Perú (PUCP) · Lima
Bachiller in Electronics. 5 years as teaching assistant, Student Representative before the University Assembly.
certifications
01Databases & SQL for Data ScienceIBM · Coursera
02Python for Data Science & AIIBM · Coursera
03Python programmingPUCP
04Intermediate Excelcert · ES
05PLC · programmable logic controllerscert · ES
06English · advancedcert · ES
05

Selected projects

click email for case-study deep-dive
case study · 02

Power BI dashboards

Two dashboards tracking pipeline health and business KPIs. Modeled the semantic layer, owned the data contracts, set up alerting.

Power BIDAXSQL
2dashboards · biz + eng
case study · 03

Contact scraper · RU

Selenium + Sheets/BigQuery/APIs glue and tiny automations that compounded into real hours back to the sales-ops team.

SeleniumBigQueryAPIs
~hrs/week saved
— lab notebook small projects · learning & play
L.01
AI-powered app

Wrapper UI on top of an LLM — prompt scaffolding, response formatting, light persistence.

Python · LLM
L.02
WhatsApp word cloud

Generator that turns an exported chat into a visual word cloud — with stopword cleanup & emoji handling.

Python · NLP
L.03
Chat sentiment analysis

NLP notebook scoring sentiment across WhatsApp chats over time — who's the optimist?

Jupyter · NLP
L.04
Used-car price model

ML regression to predict used-car prices from listing features — baseline + feature engineering iterations.

scikit-learn
L.05
Bedroom-price scraper

Web scraper analysing shared-bedroom listing prices across neighbourhoods — a personal house-hunting tool that escaped.

Selenium
06

Beyond data

the non-linear part
2014 — 2019

Teaching assistant

5 years explaining electronics at PUCP. Learned how to translate dense things into "I get it now."

2015 — 2018

Water polo · National Team

3 years on Peru's NT. Recognized as a qualified athlete. I bring team-sport rhythm to standups.

student years

Student Representative

Elected to the University Assembly at PUCP. Also swam on the university swim team — early lessons in showing up.

ongoing

Coach

Coached swimming and water polo. A good coach asks better questions than a junior data engineer.

07

Get in touch

replies within 24h on weekdays
currently open

Let's talk about data engineering.

Open to Data Engineer roles in Madrid or fully-remote across the EU. Email is fastest — happy to share a case-study deep-dive if you want one.

[email protected]