About Me:

I enjoy coding, understanding, and explaining. Thinking about software/data architecture solutions and implementing them. In fact, regardless of the language and tools, I learn quickly and enjoy picking my brain on difficult engineering problems.

  • Hard skills
    ·Python
    ·Kafka
    ·GenAI
    ·Self-hosting of LLMs in production env
    ·MLOPS
    ·Machine Learning / Deep Learning
    ·CI/CD
    ·Kubernetes
    ·Docker
    ·Spark
    ·RAG Systems
    ·REST API
  • Soft skills
    ·Excellent communication
    ·Problem Solving
    ·Leading by example
    ·Teamwork and Collaboration
    ·Attention to Detail
    ·High adaptation and learning abilities
  • Language
    ·Fluent French and English
    ·Intermediate Chinese and Japanese
  • 03/2024-current
    Data Engineer, Scientist / GenAI Developer at RTBF

    · Delivering personalized recommendations to over 5 million users across media types including news articles, VOD, podcasts, and live TV, spanning two websites.

    · Maintaining over 100 microservices within the recommendation architecture.

    · Developed a uniform logging system for all recommendation APIs to support operational use and ensure compliance with EU/Belgian laws.

    · Performed graph query optimization to enhance API response times using Cypher language in the Neo4j graph database.

    · Contributed to the development of an internal framework, structured as a versioned Python library, and distributed via CodeArtifact.

    ·Technical stack : Python, Kafka, Neo4j, Kubernetes (EKS), Docker, Redis, AWS, Redshift, Postgres, Terraform, GitLab, CloudWatch, Grafana.
  • 01/2024-current
    Data Engineer Consultant at Amaris

    · Leading the development of a live pipeline that generate video summary of sports live stream for BeinAsia.

    · Joining the development of the internal Data Platform solution, creating the end-user config builder interface from scratch. Engineering a DAG solution for lineage querying and building intricate features like user query validation.

    · Participating in the creation of an easily deployable RAG, leveraging open source projects.

    · Building a GenAI augmented scrapper, to scrap data from targeted website and previous research on efficient prompting to perform batch evaluation and classification with LLM calls
  • 07/2022-12/2023
    Junior Data Engineer at Pragaz

    ·Developed and implemented interactive dashboards : using Dash, leveraging years of delivery data to visualize client locations and extract valuable business insights.

    ·Data-driven business development initiatives : including the creation, testing, and deployment of a web application and ETL pipeline, automating data scraping processes to streamline lead generation efforts.

    ·Technical stack : AWS (EC2, S3, RDS), Docker, Dash, Streamlit, PostgreSQL, Selenium, Beautiful Soup, Github Actions, Pytest, Pandas, SQLAlchemy
  • 2023-2024
    Master's degree in Data Science
    OpenClassroom - CentraleSupélec
  • 2022-2023
    Bachelor's degree in Data analysis
    OpenClassroom - ENSAE
  • 2012-2015
    Bachelor's degree in Video Editing
    University of Rennes 2
  • 2007-2011
    High school diploma in science

Key career projects :

Full Data Lifecycle Pipeline

Jul 2022 - Dec 2023

Led the design and deployment of a complete data lifecycle: from web scraping and ingestion to transformation, storage, and dashboard-based analytics—empowering business decisions at Pragaz.

Lakehouse Data Platform Accelerator

Jan 2024 - Mar 2024

Helped build a Lakehouse data platform accelerator using Databricks and Azure stack, designing user-friendly configuration interfaces and DAG-based lineage querying.

GenAI Scraper and Email Alert

Mar 2024

Created a GenAI-enhanced scraper to analyze websites, rank information by relevance, and email key insights to stakeholders automatically.

Video Content Enrichment Pipeline

Apr 2025 - May 2025

Built a GenAI-powered pipeline for RTBF to enrich video content through transcription, summarization, and entity extraction using fully self-hosted LLMs.

Scalable Personalized Recommendations

Jun 2025 - Dec 2025

Engineered and scaled a metadata-driven recommendation system for auvio.be, generating and caching over 2 million daily recos for optimized frontend performance.

CMS-Integrated Tag Generation

Feb 2025

Developed a CMS-integrated tagging solution using GenAI to help RTBF journalists assign accurate local labels to news articles based on content.

Live Sport Stream Summarization R&D

Mar 2025

Led R&D efforts for BeinAsia to summarize live sports streams in real time using GenAI, improving viewer engagement through digestible highlight reels.

Contact Me

jbgidrol@hotmail.fr

+33698192570

Download CV