Apache Zookeeper Consultant – REMOTE

Job Overview

logoWe are seeking an experienced Apache Zookeeper Optimization Consultant to enhance theresiliency and performance of our distributed systems infrastructure. The ideal candidate willpossess deep expertise in Zookeeper configuration, tuning, and troubleshooting, with a strongunderstanding of distributed systems, high-availability requirements, and related technologiessuch as RabbitMQ, Redis, and Kafka.
Key Responsibilities:Performance Optimization:● Analyze the current Zookeeper setup and identify bottlenecks affecting performance.● Implement tuning measures for read/write latency, throughput, and leader election times.● Optimize JVM parameters and Zookeeper settings (e.g., tick time, heap size).Resiliency Enhancement:● Architect solutions for fault tolerance and disaster recovery.● Design and implement multi-region and multi-data center deployments.● Establish robust configurations for quorum consistency and failover mechanisms.Monitoring and Alerting:● Review monitoring tools (e.g., Prometheus, Grafana) to track Zookeeper health forresiliency.● Develop custom alerts for potential issues such as latency spikes, memory usage, andconnection limits.Collaboration:● Work closely with engineering teams to ensure Zookeeper is optimized and resilientalongside other components like Kafka, RabbitMQ, Redis, and custom services.● Conduct capacity planning to ensure scalability for future workloads.Qualifications:Experience:● 10+ years of hands-on experience managing and optimizing Apache Zookeeper inproduction environments at large scale.● Proven track record of designing resilient distributed systems.● Experience with RabbitMQ, Redis, and Kafka in distributed architectures.Technical Expertise:● Deep understanding of distributed systems, including Zookeeper internals (leaderelection, session management, quorum design).● Expertise in associated technologies like RabbitMQ, Redis, and Kafka, with anunderstanding of their integration into distributed environments.● Proficiency in monitoring and troubleshooting tools such as Prometheus, Grafana, orsimilar.Skills:● Strong scripting skills (e.g., Bash, Python) for automation.● Excellent problem-solving and communication abilities.Certifications (optional):● Relevant certifications in distributed systems, messaging technologies, or DevOpspractices are a plus.

Job Detail
Shortlist Never pay anyone for job application test or interview.