Title: Site Reliability Engineer with Kubernetes
Location: SFO, CA (THESE ARE 100% REMOTE POSITIONS. NO NEED TO BE ONSITE)
Duration: Long Term
Major skills: AWS or Azure, Docker, Kubernetes, Terraform -
PLEASE SUBMIT RELIABLE CANDIDATES.
A Day in the Life:
Responsible for Infrastructure maintenance, availability, performance & cost reduction.
Dive deep to resolve problems at their root and troubleshoot services related to the big data stack in our AWS/Linux infrastructure.
Develop software tools to give insights into costs & utilization patterns.
Enhance and maintain our monitoring infrastructure.
Develop automation tools for managing our cloud infrastructure.
Improve engineering standards, tooling, and processes
Partake in an on-call rotation alongside the engineers who build our production backends
What You Need:
You should have 5+ years of experience with a start-up mentality in managing & troubleshooting large-scale distributed systems.
Familiarity with infrastructure provisioning tools like Docker, Kubernetes, Ansible, Chef, Cloud Formation & Terraform.
Excellent Linux and troubleshooting skills
You have a passion for solving problems using open source software
You are an expert in Python/Bash and you are proficient in Linux.
Familiarity with big data stack, HDFS, HBase, YARN clusters, Elasticsearch
Strong experience working in AWS environment and other server virtualization technologies
Experience working with monitoring stack like sensu
Bachelor's degree in computer science
Knowledge of SQL, AWS Redshift & AWS EMR
Title: Sr. Devops Engineer with very strong Kubernetes experience(Kind of Principal Engineer)
Location: SFO, CA (THESE ARE 100% REMOTE POSITIONS. NO NEED TO BE ONSITE)
Duration: Long Term
Min 10+ years of Software delivery experience with at least 3+ years of experience in a DevOps role with Kubernetes
Kubernetes, Argo, GIT, Jenkins, Terraform, Maven, JUnit, Docker, JMeter, Artifactory
Extensive experience in cloud, specifically containerized infrastructure – Docker, Kubernetes
Expert level knowledge of automated builds and deployments – Git, Jenkins, CI/CD flows, etc.
A strong understanding of networking infrastructure and protocols
Understand how to secure and protect the application and infrastructure end to end.
Excellent troubleshooting and problem-solving skills
Integrating modern SaaS solutions and vendors with existing systems and services
Continuously learning and expanding your skills to help build a well-rounded team
Automating yourself out of today's job so that you can move on to the next big challenge
Delivering system automation by setting up continuous integration/continuous delivery pipelines