Ondřej Měkota
Senior Data Scientist at ShipMonk. I’ve read computational linguistics at Charles University in Prague.
You can find me on Github, LinkedIn or you can send me an email.
Experience
- December 2024 - now ShipMonk; Senior Data Scientist
Working on various optimisations in logistics, warehouses, packagings - August 2021 – November 2024 Heureka Group; Machine Learning Engineer
Designed and maintained a machine learning pipeline for offer/product matching. I worked with neural networks, transformer language models, random forest models. We trained and deployed using Gitlab CI/CD, Kubernetes, Google Cloud Platform, MLFlow, Neptune, DVC, Ray. - December 2021 – November 2024 Czechitas; Lecturer
Czechitas is non-profit organisation; taught Python courses - July 2020 – June 2021 SAP; Data Science Intern
I worked on pattern mining in logs stored in an ElasticSearch cluster. - October 2020 – February 2021 University of Jena; Contract Software Developer
Short-term project. I developed a pipeline for morphological tagging, lemmatization and alignment in 30+ languages.
Education
-
2019–2021, Master’s (Mgr.) in Computational Linguistics
Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University. My thesis is on link prediction in inferred social network. I graduated summa cum laude. -
2016–2019, Bachelor’s (Bc.) in Computer Science
Faculty of Mathematics and Physics, Charles University. My thesis is on GAN-based anomaly detection.
Skills
- various ML methods (NLP “relatively ok”), reinforcement learning, …
- programming: mainly Python, common data science, ML libraries), to some extent C++, C, bash, Go, PHP
- devops stuff: k8s, helm, docker, prometheus, terraform, GCP, Gitlab CI/CD,..
- DBs: ElasticSearch, BigQuery, PostgreSQL, Redis, general principles like data mesh
Projects
- vim-python-docstring
An open source plugin for Vim to automatically generate docstrings for Python source code.
Downloaded by several thousands of users every year.