Posted about 6 days

Sr Data Engineer


●Working experience with Redshift and know the best practices for tuning Redshift’s performance

●Advanced knowledge of Python

●Experience writing and understanding complex SQL queries

●Advanced knowledge of AWS Big Data services, such as EMR, Glue, Athena

●Working experience orchestrating data pipelines withtools like Airflow

●Experience with stream-processing systems such as Apache Spark Streaming, Kinesis, Apache Kafka

●Working experience with RDS, Elasticsearch, Lambda functions, EC2, S3 Working knowledge of messaging and data pipeline tools like Apache Kafka, Kinesis, SNS, SQS

●Working experience with logging and monitoring tools like Elasticsearch Service, Cloudwatch Knowledge of infrastructure as code and CloudFormation

●Working experience on automating the deployment and operation of data pipelines

●Industry Experience: 

○ Involved in developing at least one data pipeline that involved collecting/streaming, storing and processing (ETL) the data for various business use cases.

○Experience with structured, semi-structured and unstructured large data sets from real time/batch streaming data feeds.

Personal Traits Required:

●Experience working in a product company or has product company type software development experience

●Comfortable working in a small company, start-up, fast moving, ambitious environment

●Comfortable working in an environment with ambitious expectations for themselves and each other

●Flexible mindset and able to deal with ambiguity

●Excellent communication - in person, on the phone,and in writing

●Comfortable communicating with a wide range of individuals- including peers, juniors,and seniors and executives

●Intellectual curious, forward thinking, willing to suggest / try new technologies and creative approaches to problems