Thursday, 26 June 2025

Role: SRE || FL-Remote (EST Timezone only)

0 comments
Hello,


Role: SRE
Location: Miami, FL (Remote Ok - EST Timezone Only)

Job Summary:
We are seeking a proactive and detail-oriented Site Reliability Engineer (SRE) with 4–7 years of experience to support and optimize large-scale systems in the telecom domain. The ideal candidate will have strong hands-on experience across backend technologies, workflow engines, and monitoring tools, and will play a critical role in maintaining the availability, performance, and scalability of production systems.
You will collaborate with development, infrastructure, and DevOps teams to automate operations, resolve complex issues, and build resilient systems.
________________________________________
Key Responsibilities:
•    Monitor, support, and maintain the reliability of production systems and services across distributed environments.
•    Design and implement observability, alerting, and dashboards using Grafana, Elasticsearch, and Prometheus.
•    Develop and maintain SRE tools and scripts using Java, Spring Boot, and supporting technologies.
•    Implement robust workflows and orchestration using Camunda Zeebe.
•    Ensure efficient message streaming and data pipelines using Apache Kafka.
•    Optimize caching, session management, and data access using Redis, MongoDB Enterprise, and MySQL.
•    Collaborate with developers to build scalable and fault-tolerant microservices.
•    Lead root cause analysis, performance tuning, and incident management processes.
•    Participate in on-call rotations and establish best practices for high availability and recovery.
________________________________________
Required Skills:
•    4–7 years of hands-on experience in SRE, production support, or backend engineering roles.
•    Strong experience in Java and Spring Boot for building and maintaining backend services.
•    Expertise in Apache Kafka, Elasticsearch, and Redis for distributed systems.
•    Experience with Camunda Zeebe for workflow automation and orchestration.
•    Practical knowledge of MySQL and MongoDB Enterprise.
•    Proficiency in building dashboards and alerts using Grafana and log analysis tools.
•    Experience with container platforms (Docker) and basic knowledge of orchestration.

Technology version Chart:
 
Technology
Version
Kafka
3.2.3
Camunda Zeebe
8.0.5
Elasticsearch
8.4
Redis
7.4.1
Java
Java 17
Spring boot
3.3.3
Grafana
9.5
MySQL
8
MongoDB Enterprise
6.0.x
Rancher
2.10

________________________________________
Preferred / Optional Skills:
•    Familiarity with Rancher, Kubernetes, or other container orchestration platforms.
•    Understanding of CI/CD practices and tools such as Jenkins or GitLab.
•    Experience working in telecom projects or large-scale enterprise environments.
•    Knowledge of incident management and SLO/SLA practices.
________________________________________
Soft Skills:
•    Strong analytical and problem-solving skills.
•    Excellent communication and cross-functional collaboration.
•    Ability to work in fast-paced, 24/7 environments with a focus on service uptime.

--
You received this message because you are subscribed to the Google Groups "Latest C2C Requirements2" group.
To unsubscribe from this group and stop receiving emails from it, send an email to latest-c2c-requirements2+unsubscribe@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/latest-c2c-requirements2/CAMjeKS93m3drtDDFyY4Pp19iPunUJSvH2iRjQb0CK-oP3-xftw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

No comments:

Post a Comment