<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Projects on Data Trenches</title><link>https://data-trenches.leandrof.space/projects/</link><description>Recent content in Projects on Data Trenches</description><generator>Hugo -- gohugo.io</generator><language>en-us</language><managingEditor>leandrojlfernandes@gmail.com (Leandro Fernandes)</managingEditor><webMaster>leandrojlfernandes@gmail.com (Leandro Fernandes)</webMaster><lastBuildDate>Sun, 04 Jan 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://data-trenches.leandrof.space/projects/index.xml" rel="self" type="application/rss+xml"/><item><title>Real-time ML Model Serving</title><link>https://data-trenches.leandrof.space/projects/realtime-ml-serving/</link><pubDate>Sun, 04 Jan 2026 00:00:00 +0000</pubDate><author>leandrojlfernandes@gmail.com (Leandro Fernandes)</author><guid>https://data-trenches.leandrof.space/projects/realtime-ml-serving/</guid><description>&lt;h2 id="the-challenge">The Challenge&lt;/h2>
&lt;p>Deploying machine learning models to serve real-time inference requests for client-facing applications with strict latency requirements.&lt;/p>
&lt;h2 id="the-solution">The Solution&lt;/h2>
&lt;p>Built and deployed low-latency inference services using modern microservices architecture:&lt;/p>
&lt;ul>
&lt;li>FastAPI-based REST endpoints&lt;/li>
&lt;li>Docker containerization for consistency&lt;/li>
&lt;li>Load balancing and auto-scaling&lt;/li>
&lt;li>Health monitoring and logging&lt;/li>
&lt;/ul>
&lt;h3 id="technologies-used">Technologies Used&lt;/h3>
&lt;ul>
&lt;li>FastAPI&lt;/li>
&lt;li>Docker&lt;/li>
&lt;li>Machine Learning Deployment&lt;/li>
&lt;li>API Development&lt;/li>
&lt;/ul>
&lt;h2 id="impact">Impact&lt;/h2>
&lt;ul>
&lt;li>Real-time model inference capabilities&lt;/li>
&lt;li>Low-latency responses for client applications&lt;/li>
&lt;li>Scalable architecture handling varying load&lt;/li>
&lt;li>Easy model updates and rollbacks&lt;/li>
&lt;li>Production-grade reliability&lt;/li>
&lt;/ul>
&lt;p>This project showcased the ability to bridge the gap between ML models and production applications, ensuring models could be consumed by real users with minimal latency.&lt;/p></description></item><item><title>NLP Analytics Engine</title><link>https://data-trenches.leandrof.space/projects/nlp-analytics-engine/</link><pubDate>Thu, 18 Sep 2025 00:00:00 +0000</pubDate><author>leandrojlfernandes@gmail.com (Leandro Fernandes)</author><guid>https://data-trenches.leandrof.space/projects/nlp-analytics-engine/</guid><description>&lt;h2 id="the-challenge">The Challenge&lt;/h2>
&lt;p>Building a production-grade NLP analytics engine capable of processing semantic data from 25,000 daily targets while maintaining high availability and delivering actionable insights to enterprise clients.&lt;/p>
&lt;h2 id="the-solution">The Solution&lt;/h2>
&lt;p>Designed and implemented an end-to-end pipeline from model training to deployment, including:&lt;/p>
&lt;ul>
&lt;li>Data ingestion and preprocessing pipeline&lt;/li>
&lt;li>Model training infrastructure&lt;/li>
&lt;li>Inference serving layer&lt;/li>
&lt;li>Monitoring and alerting system&lt;/li>
&lt;/ul>
&lt;h3 id="technologies-used">Technologies Used&lt;/h3>
&lt;ul>
&lt;li>Python&lt;/li>
&lt;li>Machine Learning/NLP libraries&lt;/li>
&lt;li>Distributed processing&lt;/li>
&lt;li>Containerization (Docker)&lt;/li>
&lt;li>API development (FastAPI)&lt;/li>
&lt;/ul>
&lt;h2 id="impact">Impact&lt;/h2>
&lt;ul>
&lt;li>&lt;strong>$700k recurring revenue&lt;/strong> generated from the analytics solution&lt;/li>
&lt;li>Processes semantic data from &lt;strong>25,000+ daily targets&lt;/strong>&lt;/li>
&lt;li>Production-grade reliability and performance&lt;/li>
&lt;li>Real-time analytics delivery to clients&lt;/li>
&lt;/ul>
&lt;p>This project demonstrated the full lifecycle of deploying ML models in production, from data pipeline to client-facing application. The atual output of this project can&amp;rsquo;t be shared publicly given it was trained with confidential data.&lt;/p></description></item><item><title>PySpark Infrastructure Optimization</title><link>https://data-trenches.leandrof.space/projects/pyspark-optimization/</link><pubDate>Sat, 17 Feb 2024 00:00:00 +0000</pubDate><author>leandrojlfernandes@gmail.com (Leandro Fernandes)</author><guid>https://data-trenches.leandrof.space/projects/pyspark-optimization/</guid><description>&lt;h2 id="the-challenge">The Challenge&lt;/h2>
&lt;p>Handling massive-scale data processing while maintaining reasonable query latency and managing compute resource costs in a distributed environment.&lt;/p>
&lt;h2 id="the-solution">The Solution&lt;/h2>
&lt;p>Architected distributed processing jobs using PySpark with multiple optimization strategies:&lt;/p>
&lt;ul>
&lt;li>Algorithmic improvements to reduce computational complexity&lt;/li>
&lt;li>Storage optimization using Trino and Hive&lt;/li>
&lt;li>Query execution plan optimization&lt;/li>
&lt;li>Resource allocation tuning&lt;/li>
&lt;li>Data partitioning strategies&lt;/li>
&lt;/ul>
&lt;h3 id="technologies-used">Technologies Used&lt;/h3>
&lt;ul>
&lt;li>PySpark&lt;/li>
&lt;li>Apache Hadoop&lt;/li>
&lt;li>Trino&lt;/li>
&lt;li>Hive&lt;/li>
&lt;li>Distributed Systems&lt;/li>
&lt;/ul>
&lt;h2 id="impact">Impact&lt;/h2>
&lt;ul>
&lt;li>&lt;strong>25% reduction&lt;/strong> in query latency&lt;/li>
&lt;li>&lt;strong>25% decrease&lt;/strong> in resource consumption&lt;/li>
&lt;li>Improved processing efficiency for massive datasets&lt;/li>
&lt;li>Significant cost savings on compute resources&lt;/li>
&lt;/ul>
&lt;p>This optimization effort required deep understanding of distributed systems, Spark internals, and data storage patterns to achieve measurable performance gains.&lt;/p></description></item><item><title>nAttrMon Open Source Contribution</title><link>https://data-trenches.leandrof.space/projects/nattrmon-contribution/</link><pubDate>Tue, 07 May 2019 00:00:00 +0000</pubDate><author>leandrojlfernandes@gmail.com (Leandro Fernandes)</author><guid>https://data-trenches.leandrof.space/projects/nattrmon-contribution/</guid><description>&lt;h2 id="the-challenge">The Challenge&lt;/h2>
&lt;p>Improve system observability across distributed clusters by developing custom plugins for the &lt;a href="https://github.com/OpenAF/nAttrMon">nAttrMon&lt;/a> monitoring tool.&lt;/p>
&lt;h2 id="the-solution">The Solution&lt;/h2>
&lt;p>Contributed code and developed custom monitors to detect and visualize real-time system bottlenecks:&lt;/p>
&lt;ul>
&lt;li>Real-time system monitoring&lt;/li>
&lt;li>Custom monitoring architecture&lt;/li>
&lt;li>Alerting and notification systems&lt;/li>
&lt;li>Performance metrics collection&lt;/li>
&lt;/ul>
&lt;h3 id="technologies-used">Technologies Used&lt;/h3>
&lt;ul>
&lt;li>Java&lt;/li>
&lt;li>Open Source Development&lt;/li>
&lt;li>System Monitoring&lt;/li>
&lt;li>Distributed Systems&lt;/li>
&lt;/ul>
&lt;h2 id="impact">Impact&lt;/h2>
&lt;ul>
&lt;li>Enhanced observability across distributed clusters&lt;/li>
&lt;li>&lt;strong>Reduced mean-time-to-resolution (MTTR)&lt;/strong> for outages&lt;/li>
&lt;li>Better system performance insights&lt;/li>
&lt;li>Community contribution to open source project&lt;/li>
&lt;li>Improved system stability for carrier networks&lt;/li>
&lt;/ul>
&lt;p>This open source work demonstrated the ability to understand and contribute to complex systems while providing practical value to the community.&lt;/p></description></item></channel></rss>