Data Engineer

TOP Tehran

Posted a year ago

Job

Company

Salary

Applicant Insights

Job Description

● Create and maintain optimal data pipeline architecture. ● Assemble large, complex data sets that meet functional / non-functional business requirements. ● Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies. ● Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc. ● Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics. ● Work with stakeholders including the executive, product, data, and design teams to assist with data-related technical issues and support their data infrastructure needs. ● Work with data and analytics experts to strive for greater functionality in our data systems.

Requirements

● Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases. ● Experience building and optimizing ‘big data’ data pipelines, architectures, and data sets. ● Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement. ● Strong analytic skills related to working with unstructured datasets. ● A successful history of manipulating, processing, and extracting value from large disconnected datasets. ● Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores. ● More than five years of related experience. ● Bachelor's or Master's degree in Computer Science, Statistics, Informatics, Information Systems, or other fields. ● Experience with big data tools: Hadoop, Spark, Kafka, etc. ● Experience with relational SQL and NoSQL databases, including Postgres and Cassandra. ● Experience with data pipeline and workflow management tools: Azkaban, Luigi, Airflow, etc. ● Experience with AWS cloud services: EC2, EMR, RDS, and Redshift. ● Experience with stream-processing systems: Storm, Spark-Streaming, etc. ● Experience with object-oriented/object function scripting languages: Python, Java, C++, Scala, etc.

Employment Type