Featured Projects
Video Skip Intro/Outro Intelligence
An end-to-end computer vision and audio matching microservice to automatically identify skip markers in streaming content, scaling across millions of active playback sessions.
Low-Latency Inference System
Low-latency inference server serving models with sub-15ms P99 responses. Integrated Docker, Kubernetes scaling pods, and Grafana validation dashboards for extreme resilience.
Network Resilience Simulator
A backend framework designed to emulate chaotic network conditions and high-concurrency request loads to test the stress limits of critical microservices.
RAG Knowledge Pipeline
A retrieval-augmented generation system with vector search, embedding pipelines, and LLM orchestration for intelligent document Q&A at enterprise scale.
GitHub Repositories
Personal portfolio website — built with vanilla HTML, CSS & JS, featuring interactive ML pipeline simulations and terminal.
View on GitHubUniversity coursework and backend engineering projects from Computer Science curriculum.
View on GitHubSudoku solver using backtracking algorithm — a clean implementation of constraint satisfaction.
View on GitHubProfessional Experience
Technology Lead / Applied ML Engineer
- Scaled ML-powered product features across 7 European markets, impacting high-volume consumer applications.
- Designed and automated Airflow pipelines handling 300–500+ DAG runs/day across batch and real-time inference.
- Productionized ML models into low-latency inference APIs for seamless product integration.
- Developed computer vision/video intelligence features (skip intro/outro) across millions of playback sessions.
- Improved system reliability and SLA tracking via SLIs/SLOs, monitoring dashboards (Prometheus/Grafana), and validation frameworks.
- Built resilient infrastructure using Docker and Kubernetes, ensuring fault tolerance and high availability.
Senior Systems Engineer
- Designed and developed high-performance backend systems and APIs for data-intensive applications.
- Built a robust traffic simulation and network emulation framework for resilience testing under extreme network load.
- Improved legacy system performance and scalability through modernization, caching, and code refactoring efforts.
Systems Engineer
- Developed automated data processing and engineering pipelines to support downstream analytics and ML workflows.
- Optimized SQL database query performances and built schedulers for ETL batch processing.
Education
B.Tech, Computer Science
SRM Institute of Science and Technology
About Me
With over 8 years of experience in engineering, I specialize in the convergence of machine learning models and high-performance backend systems. I bridge the gap between data science and product engineering, translating complex research models into performant, production-ready microservices.
Currently based in Amsterdam, Netherlands, my focus lies in refining real-time inference systems, designing large-scale Airflow pipelines, and implementing retrieval architectures (RAG) using modern LLMs and vector search.
I build software with the assumption of scale, resilience, and change.
Skills Grid
Applied Machine Learning
Generative AI & Search
Backend & Distributed Systems
Cloud & Observability
Get In Touch
I am currently open to new opportunities, technical discussions, or collaboration projects. Reach out via email:
yashgpt2894@gmail.com