SRE Big Data Lead

Key Responsibilities

● Responsible for support and operations of several 100’s of critical Data Analytics Applications, Machine Learning Models and APIs, Microservices built on open source and big data platform based on Hadoop, Yarn, Spark, Airflow, Kafka, Aerospike, MariaDB, EDB, Hbase, etc. running on on-premises Open Shift VM’s and PCF Containers.

Responsible for big data applications operations architecture, observability automation, capacity planning, cost optimization to continuously improve stability, efficiency and service level objectives.

Providing best-in-class user support for the Big Data Analytics and Streaming Applications running on our Hadoop ecosystem.

Troubleshoot incidents, facilitate blameless post-mortems and ensure appropriate remediation

Engage with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes. Identify application patterns and analytics in support of better service level objectives  Design and implement auto scaling, self-healing and resiliency patterns

Design and implement fully automated software and product upgrades, change management, and release management solution for continuous integration and delivery.

Develop and implement migration to public cloud AWS or GCP.

Coach and lead teams by s haring in the collective team vision and successfully promoting the why and how to all teams

Requirements :

Overall 15+ years of experience with at least 4+ years of leading technical teams

Engineering/Computer Science degree or equivalent experience

5+ years of scripting/automation experience (bash, python or perl)

Strong programming experience in one or more of: Java, Python, Scal

1. Hadoop/Yarn 2. Spark Compute 3. Airflow 4. Kafka streaming 5. Hbase, Maria DB, EDB 6. Redis, Aerospike 7. Presto 8. QlikView, Tableau  Good understanding of cloud native architecture, microservices, data management principles, big data, middleware technologies distributed computing  Experience deploying & supporting microservices and cloud native applications  Extensive experience in application/system/network performance and availability monitoring (ELK stack, Grafana, Tivoli, Splunk, etc..)  Proven technical leadership experience, including the ability to quickly understand an issue, appropriately / efficiently troubleshoot to detailed levels and direct swift resolution.  Strong ownership, collaboration, and communication skills Desired  DevOps testing and release techniques (e.g, A/B Testing, Blue / Green Deployments and Canary Release, etc).  Experience with CI/CD pipelines tooling (e,g,: Jira, Jenkins, Maven SonarQube, Fortify, NexusIQ etc.).  Expertise in monitoring and scaling environments. DevOps tools/technologies (Docker, Kubernetes, OpenShift) will be preferred.  Expertise in public/private cloud IT infrastructure preferably in OpenShift, AWS or GCP.

Apply for this position

Allowed Type(s): .pdf, .doc, .docx