Spanish version here
Freelance computer engineer with 15 years of experience. Specialized in data projects: data development and cloud infrastructure. Proficiency in Python for backend development, data analytics, machine learning and AI processes. Working as an individual contributor or with my team (skilled freelance data scientists and web devs). CTO and Tech Lead in several startups and projects.
π§ Email: luzfrias at circiter dot es
π¦ Twitter: koldLight
ππ± Github: koldLight
π LinkedIn: Luz Frias
π Company website: Circiter
Roya - USA marketing startup
Designed and built an AI-powered pipeline to summarize marketing data and provide an interactive chatbot for business insights. The chatbot enables users to ask questions about sales trends, conversion performance, and top- and bottom-performing products.
Technologies: OpenAI platform integrated in R and Shiny.
Decedario β Spanish startup
Recording of EEG signals using an OpenBCI signal acquisition device, along with the software needed to log sessions and provide an analytics dashboard for neuropsychology professionals. The system is designed to evaluate sessions conducted by professionals with patients with neurodivergences or brain injuries.
I supported the startup throughout the process of selecting the device, applying to accelerator programs, building PoCs, and developing and deploying the final system.
Technologies: Python, OpenAI, Streamlit, MNE.
Hareas β Spanish startup
Responsible for the backend, database, and infrastructure of a system providing enriched information on rural land parcels in Spain. The platform is designed for land purchase and sale professionals, farmers, and other stakeholders. Serverless infrastructure design to handle computing and traffic peaks and reduce costs.
Technologies: AWS, Lambda, Martin, Python.
Oscar Car Rental - Dutch startup
Complete replacement of the productβs vehicle rental search system. Achievements:
Technologies: Python, Polars, Kubernetes.
Instituto de Salud Carlos III (ISCIII) - Spanish public health organization
Automation of the process of mortality data ingestion from civil registries, processing, modeling and generating reports and alerts from MOMO. The objectives were to alert and measure excesses and attribute mortality to events such as heat waves or flu.
MOMO received a lot of attention during the Coronavirus health crisis and it was a key project to estimate the real impact of the epidemic on Spanish mortality.
π MOMO
π Kairos
During the health crisis, I also developed the ISCIII data publication and monitoring panel, and the calculation of the reproducibility index by region, which was reported to other organizations.
π COVID-19 Dashboard
Main technologies: R, Python, Docker, Flask, Flexdashboard.
Modeling techniques: GAM, mixed-effects models, survival analysis.
Instituto de Salud Carlos III (ISCIII) - Spanish public health organization
Design and build the new surveillance system for the endemic phase of the COVID-19, and the rest of existing respiratory infections such as the influenza virus. The main objective of this application is to unify the data provided from the different autonomous communities of Spain and run automatic rules and validations to ensure data integrity. This system also enables the administrators from changing the required data without code upgrades, so it adapts quickly to regulation changes.
π System information (spanish only)
Main technologies: Python, Docker, Flask, MySQL.
Roya - USA marketing startup
Create a demo platform for the product. The client was a NY based startup with a strong analytical product for optimizing marketing processes in big companies. This included:
Main technologies: R, Shiny, Shiny Server, flexdashboard, Python, Flask, Linux, nginx.
Atlantica - Renewable energy
The main objective was the development of a heating alarm system on the electric transformers installed in multiple wind plants in South-America. The project included a detailed Exploratory Data Analysis, data inconsistency detection, and the heating model. The biggest challenge was the feature engineering, to take into account the physical rules and the heat behavior over time.
Main technologies: Python, Jupyter Notebooks.
Modeling techniques: GAM.
Johnson Controls - Buildings infrastructure and IoT
Development of a system to analyze people transit in physical spaces through cameras. My main objectives were:
Main technologies: R Shiny, Python Flask, Bootstrap, Google Cloud (Compute Engine, Cloud Functions and BigQuery).
Modeling techniques: trigonometry, minimization of error functions.
Acciona Motosharing - Mobility
Design, develop and deploy the Data Warehouse to analyze the e-scooter sharing quality service, follow the vehicles incidences, the customers behavior and take strategic decisions. I led the tech team.
The goals we achieved:
Main technologies: Python, Flask, Dash, PostgreSQL, GCP.
MDirector - Email marketing platform
Development of a prediction system for openings and clicks in an email marketing system.
The objectives:
Main technologies: Python, Docker, AWS, MySQL
Modeling techniques: random forests, linear optimization.
OrbitalAds - SEM optimization startup
Creation of machine learning processes to increase the performance of Google Ads campaigns. Specialization in:
Main technologies: Python, Docker, Google Cloud (Compute Engine, Cloud Build, BigQuery, GKE, datastore).
Modeling techniques: TFIDF, cosine distance and other NLP and other common techniques in NLP.
Johnson Controls - IoT for big corporations
Development of a management application for large IoT systems in buildings to measure supplies use, temperature, air quality and others. Integrated in Metasys, a worldβs leading building automation system.
π Metasys
Main technologies: Python, Flask, React.js, PostgreSQL.
Reny Picot - Dairy industry
The objective was to propose quantities to be produced per product to minimize losses due to stock breakage or expiration. We develop a demand forecasting model, a stock evolution simulator and a web panel to view and download the results.
Main technologies: R, Shiny.
Modeling techniques: simulation and optimization techniques.
Kren4 - Renewable-energy power plants management platform
Modeling of production in wind and solar generators based on sensor readings to detect machine failures. Integration with the customer IoT system and report automation. Development of an API to order actions and view system results.
Main technologies: ElasticSearch, Python, AWS, MySQL.
Modeling techniques: random forests.
EAE Business School, EDIX, Bank of Spain, Acciona and other companies
Associate professor of the Big Data master at EAE. I also usually teach programming training in R and Python oriented to data analysis in other companies.
This is the tool set I'm used to work with:
Complex queries and optimization of these in relational databases (MySQL, Oracle, Teradata, ...). PostgreSQL at a deeper level, such as optimization of index types, DB parameters, or use of PostGIS.
Sporadic use of non-relational databases like Mongo.
Python (pandas, scikit-learn, numpy, ...) and R.