Java Engineer (Backend)

Mid level Developer

Ref: 218Thursday 19 May 2022

Great opportunity to join a globally distributed team of over 190  working from over 28 countries who are on a mission to enable customers to extract the data they need to continue to innovate and grow their businesses.

You could join a successful team that has led the way in building powerful, easy-to-use tools to collect, format, and deliver web data, quickly, dependably, and at scale. Currently, over 2,000 companies and 1 million developers rely on their tools and services to get the data they need from the web.

Their new SaaS-based extraction tool provides an API for automated e-commerce and article extraction from web pages using Machine Learning.  You will use your creative skills to work on this distributed application written in Java, Scala and Python.  The components communicate via Apache Kafka and HTTP and are orchestrated using Kubernetes.

Working at the backend you will be designing and implementing distributed systems: large-scale web crawling platforms, integrating Deep Learning-based web data extraction components, working on queue algorithms, large datasets, creating a development platform for other company departments.   The technical challenges will provide you with a very stimulating engineering environment!


  • Work on the core platform: develop and troubleshoot Kafka-based distributed application, write and change components implemented in Java, Scala and Python.
  • Work on new features, including design and implementation. You will have ownership of the complete lifecycle of your features and code.
  • Solve distributed systems problems, such as scalability, transparency, failure handling, security, multi-tenancy


  • 1+ years of experience building large scale data processing systems or high load services
  • Strong background in algorithms and data structures.
  • Strong track record in at least two of these technologies: Java, Scala, Python, C++. 3+ years of experience with at least one of them.
  • Experience working with Linux and Docker.
  • Good communication skills in English.
  • Computer Science or other engineering degree.


  • Kubernetes experience
  • Apache Kafka experience
  • Experience building event-driven architectures
  • Understanding of web browser internals
  • Good knowledge of at least one RDBMS.
  • Knowledge of today’s cloud provider offerings: GCP, Amazon AWS, etc.
  • Web data extraction experience: web crawling, web scraping.
  • Experience with web data processing tasks: finding similar items, mining data streams, link analysis, etc.
  • History of open source contributions

The company is based in Ireland but the role is offered as a fully remote opportunity and as the company has operated this model for over 10 years they have well-established processes and procedures for working successfully in a remote environment.  An attractive salary and package will be offered to the successful candidate!