ABOUT THIS JOB
In your role you will be part of a development team responsible for developing, improving and maintaining our identity graph, which deterministically and probabilistically groups media consumption devices into persons, and persons into households. This data structure powers our marketing activation business, and helps our clients convey their advertising messages in the device- person- or household- level.
The construction of the identity graph involves Apache Airflow-based data pipelines, triggering AWS EMR-based Scala-developed Apache Spark jobs handling tens of TBs of data.