Were looking for a Senior Data Engineer in the Data Collection Ingest team to contribute to the design and coding of data ingestion services & pipelines. This role involves working as part of a team that handles millions of requests per minute across multiple servers, and is responsible for a wide-range of data pipelines, processing billions of events each day.
Why is this role so important?
As the most trusted platform for measuring online behavior, millions of people rely on insights daily as the ground truth for their knowledge of the digital world. Producing these insights requires large scale raw data to be ingested reliably in high scale to provide stable signals for analysis. As a Data Engineer you will have the opportunity to perform hands-on work and own raw data ingestion pipeline end-to-end. Your work will have a direct impact on the quality and reliability of our data and the insights that our products are delivering to our customers.
Why is this role so important?
As the most trusted platform for measuring online behavior, millions of people rely on insights daily as the ground truth for their knowledge of the digital world. Producing these insights requires large scale raw data to be ingested reliably in high scale to provide stable signals for analysis. As a Data Engineer you will have the opportunity to perform hands-on work and own raw data ingestion pipeline end-to-end. Your work will have a direct impact on the quality and reliability of our data and the insights that our products are delivering to our customers.
Requirements:
Has 5+ years of experience in developing code for big data infrastructure. Proficiency in technologies such as: Databricks, Spark, Airflow, Firehose, SQS, or other similar tools.
Proven experience working with high scale on AWS or any other cloud provider. Experience in architecture and design of large-scale and high performance production systems.
Comfortable taking challenges and learning new technologies.
Excellent communication skills with the ability to provide constant dialog between teams.
Ability to take business requirements and translate them to technical alternatives by performing risk management and evaluating tradeoffs.
Has 5+ years of experience in developing code for big data infrastructure. Proficiency in technologies such as: Databricks, Spark, Airflow, Firehose, SQS, or other similar tools.
Proven experience working with high scale on AWS or any other cloud provider. Experience in architecture and design of large-scale and high performance production systems.
Comfortable taking challenges and learning new technologies.
Excellent communication skills with the ability to provide constant dialog between teams.
Ability to take business requirements and translate them to technical alternatives by performing risk management and evaluating tradeoffs.
This position is open to all candidates.


















