We're attending an event in your country soon! Want to know more?

ML Infrastructure Engineer


ML Infrastructure Engineer

calendarStart ASAP
contractFulltime / Contract
walletCompetitive salary reflecting your motivation and maturity (Fixed monthly salary + Stock Options)
location-pinBratislava/Brno/Prague/Vienna/Warsaw or relocation to Bratislava.

We are building a world-class platform right out of Slovakia for clients on 5 continents. We enable e-commerce companies to show relevant personalized content for ultimate shopping experience. With the core of the architecture done, we are adding several key components – and you can help us do it well.

What you will do:

  • As a ML Infrastructure Engineer you’ll be responsible for infrastructure used by Data Scientists for running and scaling up machine learning models.
  • Run Kubernetes and Hadoop (Spark) clusters using mainly Google Cloud Platform.
  • Build tools to improve experiment iteration time, help design and run experiments across hundreds of machines and perform capacity planning.
  • Automate and monitor hundreds of deployed online machine learning models.
  • Manage continuous integration, deployment and rollbacks of malfunctioned or degraded models.
  • Closely partner with Data Scientists and ML Engineers.



  1. Models deployment at scale. We’re currently running hundreds of live and production ML models. How to rollout new versions? How to do rollback in case of decreased model quality?
  1. Model performance monitoring. How to evaluate performance of models in production?
  2. Microservice architecture. How to automate deployment, monitor and alert issues with hundreds of production microservices?
  3. How to scale deployed machine learning models appropriately according to current load and ensure response time below 50ms?

Our expectations of you:

  1. Experience in building and monitoring large-scale consumer-facing software in cloud environment.
  2. Willingness to debug problems across the stack (network, performance problems or memory leaks).
  3. Machine learning or high-performance computing experience is a plus.
  4. Willingness to improve your skills and learn new things fast.
  5. Experience with cloud services, particularly Google Cloud Platform is a plus.

Our tech stack and product:

  • We mostly use Python, but there are opportunities to use C++, Java and Go.
  • For data storing and manipulation we’re using Apache Hadoop, Kafka and Spark.
  • Python libraries: Flask, Gensim, Pandas and Numpy.
  • Google Cloud infrastructure with Kubernetes for container orchestration and GitLab for CI+CD.
  • For ad-hoc analysis and model training you can use BigQuery, Google ML Engine, Datalab or Dataflow.

Our community:

What you will like about Exponea:

  • A great deal of freedom and trust. At Exponea we don’t clock in and out, and we have neither corporate rules nor long approval processes. We are interested in results from day one. The freedom goes hand in hand with responsibility.
  • Learning, personal development and professional growth.
  • Flexible working hours. Remote work and home office are possible.
  • Reward based on your commitment and capabilities.
  • Possibility to co-own the company through employee share program into which we have allocated over 50 per cent of shares.
  • An easy-to-access downtown office.
  • An array of fringe benefits including hardware (pick the machine and tools you like), books, gym classes, language classes.

Like this role? Apply here.

Drop your CV here or click to upload (docx, pdf, max. 5MB)

If you have any questions, please contact us at career@exponea.com