Turning Environmental Data into Business Decisions
Senior Data Scientist with 7+ years helping research institutes and enterprises build ML pipelines, interactive dashboards, and time-series models that ship. Specialized in climate risk, geospatial analytics, and forecasting.
Data science services that
move your business forward
I don't just build models — I deliver business outcomes. Every project starts with your goals and ends with measurable impact.
Environmental & Geospatial Analytics
Satellite data analysis, climate risk modeling, ESG reporting, and environmental monitoring. 2TB+ satellite data management experience.
Time-Series Modeling & Forecasting
Demand forecasting, anomaly detection, and predictive analytics. Full lifecycle from preprocessing to production deployment with LSTM, XGBoost, and Prophet.
Interactive Dashboards & BI
Interactive analytics dashboards with country-level maps, KPI tracking, and real-time monitoring. 5+ production dashboards delivered.
MLOps & Data Pipelines
End-to-end ML infrastructure: automated training, CI/CD, Docker containerization, cloud deployment, and ETL pipelines at scale.
Statistical Consulting & R Packages
Advanced multivariate analysis, time-series modeling, and custom R package development. Published author on CRAN.
NLP & Computer Vision
Text classification, document extraction, image recognition using state-of-the-art transformer and CNN architectures.
Tools & Technologies
From research to production
A track record spanning marine research, freelance consulting, and industry ML engineering.
— Data Science Team- ML infrastructure for peril risk assessment
- Engineered end-to-end ML pipeline
- 3 interactive dashboards with country-level maps
- Automated data management for 2TB+ satellite data
- Customer churn models (XGBoost), price forecasting (LSTM)
- Full ML lifecycle: preprocessing to deployment
— 400+ hour program- 10 projects: regression, classification, clustering, NLP, CV
- Deployed 3 ML models via REST APIs with CI/CD
- Interactive dashboards with Streamlit and Dash
Boulogne-sur-Mer- Automated 500GB+ data extraction — saved 50h/month
- 70% computation time reduction via script optimization
- Mentored 4 researchers in R, reproducible research
Boulogne-sur-Mer- Bloom detection algorithm on 10+ years satellite data
- 50,000+ km² coastal analysis (MODIS, Sentinel)
- Published BDAlgo R package on CRAN
- 2 peer-reviewed publications, 1 dataset
- Statistical consulting and data visualization for 15+ clients
- Script optimization: 40-60% performance gains
Centre Brest- Analyzed 15+ years satellite data (1TB+)
- Multivariate techniques (PCA, niche modeling)
- Mentored 2 Master's students
- 2 first-author papers, 1 dataset
- Novel statistical framework (WitOMI) for niche shifts
- Analyzed 20+ years monitoring data (10M+ records)
- 3 first-author papers, 2 CRAN packages (subniche)
Projects with measurable outcomes
Real deliverables, real impact — not just notebooks.
Satellite Image Analysis Platform
Deep learning system for land use classification and change detection from satellite imagery using PyTorch with ResNet/EfficientNet and interactive Folium maps.
→ Multi-temporal change detectionTime Series Demand Forecasting
Multi-model forecasting combining Prophet, LSTM, and XGBoost for retail demand prediction with seasonality detection, anomaly handling, and ensemble forecasts.
→ Ensemble forecast with backtestingBI Dashboard Suite
Interactive business intelligence dashboard with Plotly Dash and DuckDB featuring KPI cards, cross-filtering, drill-down charts, role-based access, and PDF/CSV export.
→ Multi-page BI dashboard with RBACReal-time Fraud Detection System
Production-grade ML pipeline for credit card fraud detection with real-time streaming inference, SHAP explainability, A/B testing, and a Streamlit monitoring dashboard.
→ Multi-page real-time dashboardIoT Anomaly Detection System
Unsupervised anomaly detection for manufacturing IoT sensors using Isolation Forest, PyTorch Autoencoders, and DBSCAN with real-time scoring and alerting.
→ Real-time anomaly scoring pipelineETL Orchestration Pipeline
Airflow-based ETL pipeline with multi-source extraction, dbt transformations on DuckDB warehouse, SCD Type 2 snapshots, data quality checks, and lineage tracking.
→ Automated ETL with data qualityClimate Risk Data Infrastructure at Raincoat LLC
End-to-end ML pipeline, 3 interactive dashboards, and automated satellite data management for parametric insurance — serving 10+ countries.
Published, peer-reviewed, open source
Contributing to science and the data community through packages and publications.
BDAlgo — Bloom Detection Algorithm
subniche — Ecological Niche Modeling
Environmental Impact on Harmful Species
Harmful Algae Niche Responses
Realized Niche Analysis of Phytoplankton
Within Outlying Mean Indexes
Environmental Impact on Harmful Species
Harmful Algae Niche Responses
Harmful Algal Blooms Dynamics
Estuarine & Coastal Shelf Science
Ecohydrology & Aquatic Ecosystems
Trusted by researchers & engineers
From question to solution
Discovery Call
We discuss your data, goals and constraints. Free, 15 min, no commitment.
Scoping & Proposal
Clear approach, timeline, deliverables, and pricing — fixed or time-and-materials.
Build & Iterate
Weekly check-ins, shared repos, transparent progress. You're never in the dark.
Deliver & Support
Polished deliverables, documentation, and handoff support to your team.
Let's talk about your data
Book a free 15-minute discovery call. No pitch, no pressure — just an honest conversation about how data science can help your business.
Book a Discovery Call → Email Me Download CV