Alex Molas CV

Summary

+8 years building production ML systems: ranking, LTV, experimentation, and LLM applications across subscriptions, marketplaces, and logistics

Spent the last eight years convincing computers to make decent predictions and executives that nobody can predict next week's lottery numbers. Expert at explaining why everyone's favorite metric is wrong and why that 99.9% accuracy model definitely won't work in production. Fluent in the art of setting realistic expectations and building pipelines that don't wake the team at 3am. Have mastered the dark art of productive async collaboration across time zones without losing sanity.

Experience

Senior Data Scientist · RevenueCat · Apr 2025 - present
- Owned multiple ML initiatives end-to-end as the sole data scientist on a cross-functional team, from opportunity sizing and product definition to deployment.
- Built and productized a per-user subscription signal (7-day subscription probability), scored at install time across all apps and exposed to developers; on holdout, top-decile predictions converted at 1.46x the average (uplift@10).
- Improved experiment winner prediction accuracy by +12pp by shipping an end-to-end predicted LTV feature (survival modeling, training pipelines, backend, and frontend integration).
- Quantified ~$960K/year in unrealized revenue across 11K+ experiments through counterfactual simulation, then designed a Thompson Sampling allocation approach with inverse probability weighting.
- Led the architecture of a real-time behavioral event pipeline to capture granular engagement signals at scale; validated that top-intent segments converted at up to 5x the baseline paid conversion rate.
Senior Data Scientist · Wallapop · Sep 2023 - Mar 2025
- Led ML initiatives for the search team focusing on matching, ranking, and software best practices. Used Solr as the search engine.
- Trained a ranker ML model and deployed it to Solr, improving search-to-transaction ratio by +1.5%, adding 600K€ annually.
- Trained and deployed a BERT model for a query classification service. Improved the search-to-transaction ratio by 1%, adding 400K€ annually.
- Trained a PoC ranker model that used real-time features such as item popularity. Improved offline NDCG metrics by +10%.
- Developed PoC solutions for query understanding (intent extraction from queries and structured attribute extraction from descriptions) using LLMs.
- Refactored ETLs from manually executed notebooks to Spark jobs. Reduced execution time from days to hours, improving developer experience and scalability.
- Organized events to increase machine learning visibility: internal hackathons, Meetups, and conferences.
Senior Data Scientist · Stuart · Nov 2019 - Sep 2023
- Deployed a service that improved ETA accuracy by +30% using a deep learning model. Achieved a +28% improvement in cold-start locations.
- Designed and developed pipelines to automatically train, evaluate, and deploy ETA models.
- Built a distributed pipeline to process daily all the events dumped from Kafka to S3, allowing DS to analyze and train models on it.
- Designed an experimental dispatcher engine to solve the assignment problem using Python and OR-Tools.
- Mentored a senior software engineer who wanted to specialize in machine learning and data science.
Additional Data Science Experience
- 21 Buttons (Jun 2019-Oct 2019): Built a recommender system with implicit data, and an image + text-based clothing classifier.
- Privalia (Veepee) (May 2018-Jun 2019): Built a forecasting model for clearance sales. Created a pricing engine on top of the model.
- Gauss&Neumann (Oct 2017-Feb 2018): Developed tools for monitoring and optimizing SEM campaigns using Google AdWords and Python.

Talks, writing, and projects

www.alexmolas.com: data science blog since 2020 (60k visits in 2025). Notable posts: A search engine in 80 lines of Python (2024) and Semantic Unit Testing (2025).
Teaching Data Engineering at the EAE Master's in Big Data and Analytics (2022-2024).
"Static Typing in Python", workshop at PyDay 2024. Repo with code and slides.
"How to beat your friends in fantasy football", PyDay 2022. Blog and Slides.
1st place at Novartis Datathon (2021) and Aily Datathon (2022). Participated in a dozen other datathons.
"Field theory for recurrent mobility". Nature Communications 10, 3895 (2019).
"Streak Camera Calibration Using RF Switches". 5th IBIC, MOPG55 (2017).
"Social network analysis of communities in literature" poster. Won the IFISC Best Poster Award.

Education

MSc. Physics for Complex Systems · CSIC-IFISC 2016-2017
BSc. Physics · UAB 2012-2016

Summary

Experience

Senior Data Scientist · RevenueCat · Apr 2025 - present

Senior Data Scientist · Wallapop · Sep 2023 - Mar 2025

Senior Data Scientist · Stuart · Nov 2019 - Sep 2023

Additional Data Science Experience

Talks, writing, and projects

Education