The Job logo



Operations & Site Reliability Engineer

ApplyJoin for More Updates

You must Sign In before continuing to the company website to apply.


People at Apple don’t just build products — they craft the kind of experience that has revolutionised entire industries. The diverse collection of our people and their ideas encourage innovation in everything we do. Imagine what you could do here! Join Apple, and help us leave the world better than we found it. At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Every single day, people do amazing things at Apple. Join Apple’s Service Management team as an Operations and Site reliability Engineer and inspire the team for operational excellence and improve availability, scalability and security of multiple highly scalable, fault tolerant, business critical, global applications in Apple Service Management space. Lead operational planning, readiness, monitoring, measurement of system health, incident management and communication for these enterprise level applications. Build and manage systems, infrastructure and applications through automation. Develop tools that bring operational parity across all applications to improve team’s efficiency. The candidate’s skill will be a strong blend between Operations Lead and Engineering.

Key Qualifications

  • Strong sense of ownership, customer service, and integrity demonstrated through clear communication
  • Experience in leading and driving operations teams for large scale Critically important applications working in a 24x7 operations and on/off shore support model
  • Experience in strategizing and achieving operational excellence in global distributed systems
  • Strong knowledge of Production support practices for managing web and iOS applications
  • Experience in fixing, analyzing logs, building metrics and operational dashboards
  • Passion for eliminating repetitive manual processes using automation
  • Experience in interpreting data from systems like Hubble, ExtraHop, Splunk and other monitoring tools
  • Fundamental understanding of distributed systems including: Micro services, Messaging Brokers and Versioning
  • Experience in Java, JEE, REST, Swift/Objective C, database schema design and data access technologies
  • Deep Understanding of programs using a high-level programming language like: C, Java, Ruby, Python, or Perl
  • Experience managing large numbers of diverse systems with containers (Docker), build systems (Jenkins, Ansible, Spinnaker), and infrastructure as a service (Kubernetes, AWS)
  • Understanding of the Linux Operating System, including Kernel, Memory, Process, Threads, Static / Shared Libraries, IPC, Signals
  • Understanding of standard networking protocols and components such as: HTTP, DNS, TCP/IP, ICMP, the OSI Model, Subnetting and Load Balancing is a plus
  • Experience in ethical hacking, system security and fraud monitoring are added advantage
  • Self-starter, flexible, motivated to learn in a fast-paced environment and comfortable working as part of a team of versatile engineers
  • Excellent communication and leadership skills
  • Excellent organizational and documentation skills
  • Passion for quality and the optimal user experience


- We are looking for a highly technical and motivated individual who will own ultimate responsibility for operations of Service systems, working with teams to ensure 24X7 operations, coupled with the ability to ensure smooth rollout of applications that our customers use every day and improve our tool suite and develop new tools to improve the operational efficiency and product quality. - Identify and handle key performance indicators for global applications. Drive operational improvements, metrics tracking and implementation of standard methodologies through level one production support and engineering teams. - Handle Production backlog with business team and prioritize fixes in planned releases. Keep close tab on all product releases and ensure smooth and safe deployments in Production. Drive and handle product rollouts and partner/retail on-boardings. - Lead Production Support team to ensure all servers and application are monitored on an ongoing basis with alerts including CPU, memory, and storage utilization, as well as network and security issues, and performance tuning. Monitor production footprint and lead the effort for Capacity Planning - Keep track and interact with the Data Center, Network and other system teams to plan out OS patches, system upgrade and maintenance. - Drive the team to build, implement application automated health checks ensuring the high availability of applications - Along with applying your technical skills, you will have the opportunity to let your creative juices flowing. You will work very closely to design, develop and operate the best development support and automation tools you can imagine.

Education & Experience

Bachelors and equivalent

Set alert for similar jobsOperations & Site Reliability Engineer role in Hyderabad, India
Apple Logo



Job Posted

9 months ago

Job Type




Experience Level

3-7 years


Hyderabad, Telangana, India




Be an early applicant

Related Jobs

JPMorgan Chase & Co. Logo

Lead Site Reliability Engineer

JPMorgan Chase & Co.

Hyderabad, Telangana, India

Posted: 8 months ago

Lead Site Reliability Engineer at JPMorgan Chase within the Consumer and Community Banking of Infrastructure and Production Management Team. Hold a leadership role, demonstrate strong knowledge across multiple technical domains, and advise others on technical and business issues. Lead resiliency design reviews, act as a technical lead, and provide mentoring. Champion site reliability culture and practices, improve reliability and stability, and identify and solve technology-related bottlenecks. Required qualifications include formal training or certification, deep proficiency in reliability and scalability, fluency in programming language, proficiency in observability and CI/CD tools, experience with containers and troubleshooting networking technologies.

JPMorgan Chase & Co. Logo

Site Reliability Engineer III

JPMorgan Chase & Co.

Hyderabad, Telangana, India

Posted: 9 months ago

JOB DESCRIPTION There’s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a Site Reliability Engineer III at JPMorgan Chase within the Consumer and Community Banking of Infrastructure and Production Management, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform. Job responsibilities Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications Implements infrastructure, configuration, and network as code for the applications and platforms in your remit Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers Develop, test and debug automated tasks (Apps, Systems, Infrastructure) Troubleshoot priority incidents, facilitate blameless post-mortems    Required qualifications, capabilities, and skills Minimum 7 years of over all experience in IT industry Formal training or certification on site reliability engineering concepts and 3+ years applied experience Proficient in at least one programming language such as Python, Java/Spring Boot Proficient in site reliability culture and principles and familiarity with how to implement site reliability within an application or platform Proficient knowledge of software applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.) Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform Familiarity with container and container orchestration such as ECS, Kubernetes, and Docker Preferred qualifications, capabilities, and skills Proficiency in one or more technology domains, may be a cross-domain expert able to solve complex and mission critical problems within a business or across the firm Adept in the development of automated tools, systems, and services in multiple technology domains Working knowledge of infrastructure components. (E.g. routers, load balancers , cloud products , container systems , compute, storage and networks) Excellent debugging and trouble shooting skills   ABOUT US JPMorgan Chase & Co., one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world’s most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management. We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. In accordance with applicable law, we make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as any mental health or physical disability needs. ABOUT THE TEAM Our Consumer & Community Banking division serves our Chase customers through a range of financial services, including personal banking, credit cards, mortgages, auto financing, investment advice, small business loans and payment processing. We’re proud to lead the U.S. in credit card sales and deposit growth and have the most-used digital solutions – all while ranking first in customer satisfaction.