The Job logo



Site Reliability Engineer - AWS/Hybrid Cloud/Terraform/CI-CD/Python/Linux

ApplyJoin for More Updates

You must Sign In before continuing to the company website to apply.

Title: Site Reliability Engineer


Who we are

Today’s challenging business environment is more than that – it’s a period of disruption between the pandemic, global business change and internal process complexity. For us to focus on simplicity and the best customer experience, we need great talent and the right skillsets to be successful. This is now a mantra for our Cisco leadership team and for us.


Cisco is transforming its platforms to run the next generation of cloud-native and multi-cloud services. This role offers a superb opportunity to transform how infrastructure platforms are developed, managed with full software automation and at the same time is highly available with self-healing, full lifecycle monitoring, and management capabilities.


Who you will work with

Cloud Infrastructure Platform Services (part of the Hybrid Cloud Infrastructure and Operations) is responsible for the architecture, design, build and the operations of the private cloud (OpenStack and VMware, various PaaS platforms) and Public cloud services (AWS and GCP) to help the clients choose the right IaaS and PaaS offerings for their workloads. CIPS also provides the technical consultation for architectural guidance, deployment options and managed services to help clients from on-boarding to decommission through GitOps operating model. The organization today is focused on strengthening governance, security, observability to ensure complete visibility, security, and manageability of the client workloads to be able to support in a reliable manner.

You will be working alongside other Site Reliability Engineers who are passionate to work with cloud-based applications and push the way we use multicloud platforms in our business. You will help build new microservices and infrastructure to improve the way we conduct our workflow. Our team works in a fast paced, agile environment and ready to learn new things in an instant. Making use of the best Cisco has to offer by integrating these products with the wider spectrum of 3rd Party services to provide a best-in-class Cloud service.

What you’ll Do


You will be a member of a site reliability engineering team that uses tools and integrations for a portfolio of cloud infrastructure services for deploying and managing Cisco’s critical business services through GitOps. We are looking for an enthusiastic individual with extensive experience in Devops and GitOps, to join a dynamic and agile team of talented engineers who enable customers to move their workloads to cloud native hybrid cloud model using both Private and Public Clouds.


· Write terraform automations for infrastructure and application deployment of customers in AWS and GCP

· Integrate Observability Stack and manage lifecycle and operations of hybrid cloud infra

· Ensure the quality, performance, robustness, and scalability of the services that are implemented, perform bug fixes and triaging issues

· Automate the development, testing, and deployment processes through CI/CD pipelines (GitHub, GitHub Action, Jenkins, Helm, ArgoCD)

· Champion and drive the adoption of Infrastructure as Code (IaC) practices and mindset

· Software development lifecycle including design, development, testing, packaging, deployment, upgrade and support (Python).

· Collaborate with other core services team members to define roadmaps, write clear user stories with well-defined acceptance criteria, design, and build solutions

· Applies global knowledge of IT Infrastructure to develop standard solutions that can be leveraged across multiple areas; Contributes to the development of new technical principles and concepts

· Looks at new and emerging technology and determine group applicability

· Proactively engages and/or creates cross-functional teams to solve problems or add business value

· Generates ideas and/or technical strategies and presents them to his/her peers for feedback

· Influences others to support/implement ideas and/or technical strategies through collaboration with managers and peers in the organization

· Creating standards and policies and influencing technology decisions beyond own functional area or project; Practice DevOps supporting application from development through the operation lifecycle

· Responsible for determining and setting SLO’s, creating adequate monitoring and logging for features so that SLO can successfully be measured

Set alert for similar jobsSite Reliability Engineer - AWS/Hybrid Cloud/Terraform/CI-CD/Python/Linux role in Bengaluru, India
Cisco Logo



Job Posted

10 months ago

Job Type




Experience Level

3-7 years




Bengaluru, Karnataka, India




Be an early applicant

Related Jobs

Cisco Logo

Software Engineer (Terraform, Ansible, Python, Helm, CI/CD| 5+ Years)


Bangalore Urban, Karnataka, India

Posted: 9 months ago

JOB DESCRIPTION Who We Are The Cloud Services Infrastructure (CSI) engineering team at Cisco delivers the technology that's transforming the way product teams secure their networks, and more importantly, their users. As a Cloud Engineer, you will be a key member of the Platform team, supporting containerized internet security services in our globally distributed data centers. Our team is responsible for the networks and hosts that support thousands of low-latency, high-throughput network security services. At its core, the CSI Engineering team provides Infrastructure-as-a-Service for engineering in dozens of our “Edge” data centers. Who You'll Work With The Platform Service team is looking for a Cloud Engineer to help us build and operate our globally distributed application multi-stack platform. We support cloud-native security applications running at an internet scale. This team works closely with the rest of the Infrastructure Engineering teams to build core internet architecture, security applications, and distributed systems at scale. Our team strongly believes in infrastructure automation, and we are committed to automating all our repetitive tasks such as configuration management and auto-remediation of incidents. We are a globally distributed team supporting over three dozen data centers servicing thousands of containerized services. Who You Are You have experience with high-availability compute platforms and understand how to build scalable infrastructure. You have an excellent understanding of containerization technologies and a solid grounding in networking concepts. You strive to build systems that are automated, performant, and resilient. You understand cloud design principles and are comfortable operating in a modern DevOps environment. You thrive in a respectful and inclusive team culture. Some of the things you will work on: Design, build and operate the highly available infrastructure at scale, using Terraform, Ansible, Python, Helm, Golang and other languages/tools as fits the situation's needs Develop solutions for infrastructure automation with availability, scale, and latency in mind The ins and outs of supporting a platform running 24x7 Help improve CI/CD pipelines Fine-tune configuration and improve the performance of a variety of systems alongside your team members Support internal Cisco customers by extending existing services and creating new ones Never the same thing twice The team could be a good fit for you if some of these apply to you: Minimum of 5+ years of experience Experience with container orchestration platforms, ideally Kubernetes Experience with AWS Some experience with virtualization platforms ideally OpenStack Familiarity with operating a high-transactional, 24x7 production environment Some experience with applications that span multiple data centers and/or cloud controllers Some experience with application security concepts Familiarity with modern software development methodologies and tooling Undergraduate degree in Computer Science or equivalent engineering experience #WeAreCisco, where each person is unique, but we bring our talents to work as a team and make a difference powering an inclusive future for all. We embrace digital and help our customers implement change in their digital businesses. Some may think we're “old” (39 years strong) and only about hardware, but we're also a software company. And a security company. We even invented an intuitive network that adapts, predicts, learns, and protects. No other company can do what we do - you can't put us in a box! But “Digital Transformation” is an empty buzz phrase without a culture that allows for innovation, creativity, and yes, even failure (if you learn from it). Day to day, we focus on the give and take. We give our best, give our egos a break, and give of ourselves (because giving back is built into our DNA). We take accountability, bold steps, and take differences to heart. Because without diversity of thought and a dedication to equality for all, there is no moving forward. So, you have colorful hair? Don't care. Tattoos? Show off your ink. Like polka dots? That's cool. Pop culture geek? Many of us are. Passion for technology and world-changing? Be you, with us! Message to applicants applying to work in the U.S.:   When available, the salary range posted for this position reflects the projected hiring range for new hire, full-time salaries in U.S. locations, not including equity or benefits. For non-sales roles the hiring ranges reflect base salary only; employees are also eligible to receive annual bonuses. Hiring ranges for sales positions include base and incentive compensation target. Individual pay is determined by the candidate's hiring location and additional factors, including but not limited to skillset, experience, and relevant education, certifications, or training. Applicants may not be eligible for the full salary range based on their U.S. hiring location. The recruiter can share more details about compensation for the role in your location during the hiring process. U.S. employees have access to quality medical, dental and vision insurance, a 401(k) plan with a Cisco matching contribution, short and long-term disability coverage, basic life insurance and numerous wellbeing offerings. Employees receive up to twelve paid holidays per calendar year, which includes one floating holiday, plus a day off for their birthday. Employees accrue up to 20 days of Paid Time Off (PTO) each year and have access to paid time away to deal with critical or emergency issues without tapping into their PTO. We offer additional paid time to volunteer and give back to the community. Employees are also able to purchase company stock through our Employee Stock Purchase Program. Employees on sales plans earn performance-based incentive pay on top of their base salary, which is split between quota and non-quota components. For quota-based incentive pay, Cisco pays at the standard rate of 1% of incentive target for each 1% revenue attainment against the quota up to 100%. Once performance exceeds 100% quota attainment, incentive rates may increase up to five times the standard rate with no cap on incentive compensation. For non-quota-based sales performance elements such as strategic sales objectives, Cisco may pay up to 125% of target. Cisco sales plans do not have a minimum threshold of performance for sales incentive compensation to be paid.

Cisco Logo

Build / release engineer - CI/CD


Bengaluru, Karnataka, India

+1 more

Posted: 10 months ago

What You’ll Do The mobile Internet transformation is here, with mobile data traffic growing very rapidly and ever more complex for operators to monetize. The Mobile Core Business Unit was built to focus on this transformation and to enable mobile operators to make the most out of the challenges and opportunities it presents. MCBU is a trusted partner as operators migrate to next generation virtual networks and continues to be a market leader in mobility, delivering virtual and cloud solutions that address service provider challenges of network reach, network intelligence, and service creation velocity while enhancing overall profitability. Who You’ll Work With MCBU is responsible for complete development of CUPS and 5G nodes over Cloud deployment. We are looking for strong software developers who can make a difference in speed and quality. What You’ll do:  You will create and support a Continuous Integration/Delivery model for our customers.  You will integrate Docker container generation into our build processes using various DevOps tools and add the CD metadata into our tracking tools and dashboards. • Enable Mobility BU engineers & work closely with them • Unify the SCM tools & DevOps infra for Multiple product code base in 5G, 4G  Responsibilities: Develop tools (Code commit, Security analysis, Coverage, Pre-Commit CI, CD, LOC, CLOC, Changes Report, Branch management, Sanity & Regression, Access Control) to help support DevOps and RelOps functions within Mobility BU –   • Primary point-of-contact for a large development team  • Keep the build and release infrastructure running smoothly.  ( Scalable, Highly available & resilient infra) • Adhere to strict Cisco Security Compliance/Standards • Troubleshoot /Debug and fix issues in the development, build and release infrastructure.  • Document processes and policies.  Minimum Qualifications: ·     6+ years’ experience as a build / release engineer. Must have a solid grasp of release engineering principles. ·     Solid understanding of source code control systems, branching, merging, releases, CI/CD pipeline, Cloud technologies etc. ·     Previous experience implementing CI/CD pipelines ·     Thorough understanding of Docker, Containers, Artifactory, Kubernetes, etc. ·     Expertise in one or more of the following: Python, GitHub, Jenkins, GoLang, Groovy, SQL, RHEL, Static Analysis tools, Monitoring tools. ·     Strong programming/scripting skills. ·     Excellent troubleshooting skill ·     Ability to analyse a problem with input from stakeholders and teammates, then propose and implement a solid, efficient solution that works within the overall development infrastructure.  Desired Skills: ·     Previous experience supporting a large, complex code base for hundreds of developers is a definite plus. ·     Experience working in an Agile environment. ·     Expertise in the following areas is highly desirable: Python, REST APIs, Jenkins, K8, SQL, AWS (or other cloud services)

Cisco Logo

Software Engineer 4-8 yrs|Go|AWS|Networking|CI/CD


Bengaluru, Karnataka, India

Posted: 10 months ago

Who We Are Cisco Cloud Security Group is at the forefront of developing cloud-delivered security needs and challenges of our customers. With annual revenue exceeding $200M, it is one of the fastest-growing businesses at Cisco. As Cisco is transforming its business model aggressively to a software and recurring revenue model, our cloud security business is leading this journey with 100%+ YoY growth in software recurring revenue. What We Do The Cloud Security group focuses on developing solutions that provide Security as a Service to our customers. Our vision is to build the most comprehensive security solutions that are both easy to deploy and simple to manage. We have a highly scalable cloud infrastructure spread across 30 data centers where we run our cloud security applications that operate at massive scale - 100B+ requests per day from > 90M daily active users. Who You'll Work With You will join a dynamic, multi-talented Engineering team where new features are deployed to customers multiple times a week. We are Umbrella's Secure Web Gateway team and our mission is to improve the underlying technologies and the UI that drives Umbrella Security. We work on features big and small in order to build the best security solution in the world. This team does it all, so you'll never be bored, and you'll constantly be learning new technologies. What You'll Do Work on best of breed Cloud Security Solutions that provide comprehensive security to enterprises Build highly scalable Cloud Microservice solutions that secure a high volume of web traffic Bring a mindset of developing simple solutions to complex problems. Define, architect, design, and deliver new software features for Web gateway. Write functional and design specs. Conduct and participate in peer code and design reviews. Collaborate with the adjacent software teams in defining the programming interfaces into the various other software modules. Relish, build it, operate it, secure its culture. Who You Are You're a Software Engineer who enjoys wearing many hats, a self-starter who’s willing to jump headfirst into the codebase and start improving it immediately. Our minimum requirements for this role: Experience in networking protocols, tunneling protocols like GRE/IPSEC/TLS, Overlay Technologies and Virtualization Experience in Web Gateway products, web proxy, Web Security domain Solid understanding of network protocols like TCP/IP, UDP, HTTP, HTTPS, DNS Experience in designing, implementing, and supporting large-scale transaction systems Proven Python, GoLang, Java, C or C++ skills In-depth knowledge of object-oriented concepts, programming, and systems design Understanding of various design patterns and Microservice Architecture Strong foundation in data structures, algorithms and software design Strong analytical and troubleshooting skills. Desired Skills Experience in an Agile environment Well versed with Web proxy - NGINX is a plus Understanding of HTTP service behaviors Cloud Development Environments: AWS, Google Cloud, Microsoft Azure Continuous Integration/Delivery tools (Jenkins, Docker, Chef, Ansible) Educational Background MSCS with 3+ years of development experience or BSCS with 5+ years of development experience in Computer Engineering, Electrical Engineering, Computer Science or related major

Cisco Logo

Software Engineer (Devops/SRE - Cloud AWS, Kuberenetes, Docker, Python, Terraform, Ansible) - 8+ years


Bengaluru, Karnataka, India

Posted: 10 months ago

Cloud Security Engineering at Cisco drives the technology that's transforming the way customers secure their networks, and more importantly, their users. We're seeking a Software Engineer with a robust background in software development and familiarity with DevOps practices. The individual in this role will be crucial in shaping our infrastructure, enhancing our deployment pipelines, and maintaining our monitoring systems. As a key member of the Network eXperience organization, you will be part of a team responsible for the design, development, and operation of key microservices focused on cloud network experience, traffic optimization and insights related that our Umbrella and Cisco Secure Access products offer.  This is a small team that does big things.  What You'll Do •           Develop, implement, and optimize continuous delivery pipelines for various applications. •           Ensure all systems are scalable, reliable, secure, and efficient. •           Collaborate with software engineers to make sure operational issues (such as system sizing, system configuration, or load balancing) are considered in software design. •           Build and manage dashboards to provide visibility into production system health and performance. •           Work to solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. •           Participate in the creation of new distributed components and services. •           Utilize various open source technologies, tools and cloud services to support continuous integration efforts. •           Foster a culture of continuous improvement by learning, teaching, and implementing innovative practices. Basic Qualifications:  •           Bachelor’s Degree in Computer Science, Engineering, or related field. •           At least 2 years of experience in DevOps, Site Reliability Engineering (SRE), or similar roles. •           Proficiency in scripting languages such as Python, Bash, or JavaScript. •           Familiarity with cloud services (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes). •           Understanding of CI/CD pipelines and configuration management. Desired Qualifications: ·           Experience with infrastructure as code (IAC) using tools like Terraform, Ansible, or similar. ·           Familiarity with database systems, both SQL and NoSQL. ·           Good communication and teamwork skills.  Who You'll Work With  The members of the Cloud Security Engineering Network eXperience team build and operate core control plane services for the Umbrella and Cisco Secure Access platform. We are a team that is supportive of learning and experimentation. We work closely with the rest of the Cloud Security Engineering teams and other engineering groups across Cisco.

Cisco Logo

Software Engineer- Devops/SRE | Cloud AWS |Kuberenetes Docker |Python |Terraform | Ansible | 5-8 Years Exp


Bengaluru, Karnataka, India

Posted: 10 months ago

Cloud Security Engineering at Cisco drives the technology that's transforming the way customers secure their networks, and more importantly, their users. We are currently seeking a DevOps Engineer to join our dynamic and innovative technology team. The successful candidate should have a foundational understanding of software development, system administration, and a passion for learning new technologies and tools related to continuous integration, continuous deployment, and automation. As a key member of the Network eXperience organization, you will be part of a team responsible for the design, development, and operation of key microservices focused on cloud network experience, traffic optimization and insights related that our Umbrella and Cisco Secure Access products offer.  This is a small team that does big things.   What You'll Do  Support the development and operations teams by automating and optimizing the software delivery process. Assist in managing the organization's cloud infrastructure, using monitoring tools to maintain system health and security. Participate in system troubleshooting and problem-solving across platform and application domains. Collaborate on deployment strategies and testing processes. Help to create reliable pipelines for deployment to ensure a seamless process from development to production. Continuously learn new technologies and keep up with the latest trends in the DevOps landscape. Basic Qualifications: Bachelor’s Degree in Computer Science, Engineering, or related field. Up to 5 year of experience in a software development or system administration role. Understanding of software development and system administration principles. Basic knowledge of a programming language like Python, Java, or JavaScript. Familiarity with Unix/Linux operating systems. Basic understanding of DevOps principles and methodologies. Desired Qualifications: ·       Experience with automation and configuration management tools such as Jenkins, Ansible, or Puppet. ·       Familiarity with containerization technologies like Docker or Kubernetes. ·       Basic knowledge of cloud platforms such as AWS, GCP, or Azure. ·       Strong problem-solving skills and the ability to work both independently and in teams. ·       Excellent communication and teamwork skills. ·       Eagerness to learn and adapt in a fast-paced, constantly changing environment.   Who You'll Work With  The members of the Cloud Security Engineering Network eXperience team build and operate core control plane services for the Umbrella and Cisco Secure Access platform. We are a team that is supportive of learning and experimentation. We work closely with the rest of the Cloud Security Engineering teams and other engineering groups across Cisco. Collaborate with a team of developers, systems administrators, and other DevOps engineers to improve the software development process. Work with cross-functional teams to understand their needs for infrastructure, software deployment, and automation.

Cisco Logo

Site Reliability Engineer/ DevOps (7-14 years)


Bengaluru, Karnataka, India

Posted: 10 months ago

As a Senior Site Reliability Engineer (SRE), you will assume a leadership role in ensuring the reliability, scalability, and performance of our company's software systems and infrastructure. You will be responsible for driving the evolution of SRE practices and collaborating closely with engineering teams to architect and implement highly available and resilient systems. The role requires a deep understanding of software development, system design, and operations, as well as the ability to mentor and guide junior SRE team members. What You Will Do: System Architecture and Design: Lead the design and implementation of highly available, scalable, and fault-tolerant systems in collaboration with software development teams. Employ best practices and architectural principles to ensure long-term system stability and maintainability. Incident Response and Management: Take ownership of critical incidents and coordinate cross-functional teams to resolve them efficiently. Conduct thorough post-mortem analysis and leverage learnings to enhance system resilience and response procedures. Performance Optimization and Capacity Planning: Analyze system performance, identify bottlenecks, and work with engineering teams to optimize performance. Develop capacity planning strategies to support business growth and future demands. Automation and Tooling: Drive automation initiatives to streamline operational tasks, deployment processes, monitoring, and incident response. Mentor team members on best practices in automation and encourage a culture of innovation. Security and Compliance: Ensure that security measures are integrated into system design and operations. Collaborate with security teams to proactively address potential vulnerabilities and maintain compliance with industry standards and regulations. Monitoring and Alerting: Oversee the implementation and maintenance of robust monitoring and alerting systems. Ensure the timely response to alerts and lead efforts to improve the monitoring framework continually. Continuous Integration and Continuous Deployment (CI/CD): Enhance the CI/CD pipeline to enable seamless and reliable deployments. Foster a culture of continuous improvement in the deployment process. Documentation and Knowledge Sharing: Establish comprehensive documentation and knowledge sharing practices within the SRE team and across engineering teams. Mentor junior members to improve their technical expertise and problem-solving skills. Technical Leadership: Provide technical guidance and mentorship to junior SRE team members. Collaborate with other senior stakeholders to drive technical strategy and foster a culture of technical excellence. Who You Are: 8+ years experience with Bachelor's degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience). Substantial experience as a Site Reliability Engineer or in a similar role, with proven progression in responsibility and leadership. Expertise in software development and proficiency in multiple programming languages (e.g., Python, Go, Java). In-depth knowledge of cloud platforms (e.g., AWS, Google Cloud, Azure) and containerization technologies (Docker, Kubernetes). Strong understanding of system architecture, distributed systems, and networking principles. Experience with monitoring and logging tools like Prometheus, Grafana, DataDog, ThousandEyes, etc. Proven track record of driving automation initiatives and using infrastructure-as-code tools (e.g., Terraform, Ansible). Excellent problem-solving and critical-thinking skills, with a focus on root cause analysis. Ability to lead and mentor technical teams, fostering a collaborative and innovative environment.