Be the Owner of our Data Pipelines.
We use Airflow as our ETL, a very powerful Python framework that allows us to break down any complex problem/process into smaller ones. Each component can be implemented directly in Python, or in any other language since they could be docker images executed in ECS instances. We also have a Streaming processing pipeline to collect events and allows our ML models to use real-time data.
Overall, we can resume the main responsibilities as follow:
Designing and implement the data processing pipelines that better fit each given problem.
Investigating, evaluating and proposing different "data solutions", e.g., select the best DB for a given type of data, implement a Spark code to speed-up processes, implement parts of DAG in different languages if needed, etc.
Improving the performance of our ETL DAGs with high-quality code.
Automatize the training, building, and deployment of a series of Machine Learning algorithms that are used for very different applications.
You will be working in the Data Science team (read the last blog posts), together with Full-stack Engineers, Data Scientists and Data Analysts.
Ideally, we are looking for a senior profile that is also interesting in:
Supervising mid- and junior- Python/Data Engineer: given the great impact of our ETL to the entire company, we want to grow a team dedicated to it, capable of improving, maintaining, and boosting it even further.
Sharing her/his knowledge inside and outside Jobandtalent, raising the quality standards of the entire team with the aim of growing together.
Contributing to Open Source projects: we are using different Open Source frameworks and libraries, and one of our wishes would be to contribute to some of those projects, dedicating part of our time when possible.
Requirements and Skills
Bachelor’s degree in Math, Engineering, Stats or Quantitative field.
4+ years of proven experience programming in Python.
Different kinds of databases (e.g., RDBMS, NoSQL, Big-data frameworks).
Container development with Docker or Kubernetes.
The leadership of projects, services or products.
Extremely skilled programmer (e.g., unittest, production/staging experience).
Excellent verbal and written communication skills; ability to communicate effectively with different levels of management, as well as the business and technical communities.
(Nice to have) Previous experience with
ETL, data pre-processing, or data analysis.
Supervising junior and mid-level developers.
Stream processing framework (e.g., RabbitMQ, Kafka, Spark, Flink).
A valid work permit to be employed in Spain.
Fluent in English is a must.
Offer in Short
Ownership of projects with direct impact on the product.
Salary in the 45k-65k range (wide range, depending on experience).
Transparent Equity package.
Discount on Health insurance.
A yearly budget for Conferences/meetup/self-learning.
Working in the sunny Barcelona ~2 minutes walk to the city center.
Free fruit, free coffee and free lunch on Fridays!
Join us in one of the hottest startups in Spain, breaking into a new market worldwide, learning and contributing with your expertise in innovating the entire job market!