Oracle

Principal Service Reliability Engineer

Job Description

Summary

Own and scale mission-critical ERP/SaaS services while building intelligent, cloud-native capabilities. This role requires a SRE mindset combined with AI/ML expertise and strong application engineering skills across public and private cloud environments.

 

Key Responsibilities

- End-to-end service ownership: design for telemetry, security, resiliency, scalability, and performance; lead sizing/architecture; drive service health reviews and process simplification.

- Incident management and prevention: lead postmortems/RCAs, coordinate fixes, define repair items, and implement data-driven prevention and continuous improvement.

- AI/ML and GenAI delivery: design and integrate solutions with LLMs, RAG, agentic workflows, and conversational AI; build low-latency model serving and retraining pipelines.

- Application engineering: develop performant microservices for distributed, containerized, cloud-native systems.

- Automation: eliminate toil by automating operational workflows, recovery procedures, code delivery, and configuration management; build internal tools and reusable scripts/services to accelerate delivery and reduce errors.

- Observability: define and implement monitoring, logging, alerting, and tracing strategies; establish SLOs/SLIs/error budgets; improve diagnostics and performance visibility for rapid triage.

- Cross-functional collaboration: partner with product, operations, and data teams to translate requirements into secure, scalable solutions; communicate effectively with technical and non-technical stakeholders.

 

Minimum Qualifications

- BS/MS in Computer Science or related field; 10+ years of software engineering in cloud environments.

- Strong in distributed systems/microservices using java / python; SQL/data modeling; python for AI/automation.

- SRE/DevOps expertise: systems and networking fundamentals, application security, observability, performance analysis, and incident response.

- Proven SDLC excellence: code quality, reviews, version control, CI/CD, testing, and release engineering.

- Excellent written and verbal communication; English fluency.

 

Preferred/Technical Skills

- AI/ML/GenAI: experience with foundational models, RAG, agentic architectures; model deployment, optimization, monitoring, and retraining.

- Cloud and containers: experience with containerization, orchestration, and resilient, fault-tolerant microservices.

- Observability: hands-on experience designing dashboards, alerts, traces, logs, and metrics; defining SLOs/SLIs and error budgets; on-call readiness and runbook quality.

- Operations: performance tuning across java / python and SQL for large-scale enterprise applications; strong Linux/Unix expertise; capacity planning and reliability reviews.

- Automation and scripting: proficiency in scripting to automate operational workflows, build tooling, and CI/CD tasks (e.g., shell scripting, python, configuration-as-code, task runners).

- Familiarity with enterprise ERP applications and standard DevOps tooling and practices.


Jobs at Hyderabad

Amazon

HR Partner, Amazon International St…

Freshers/Experienced

Hyderabad, Telangana

View Details

Last Date: Feb. 3, 2026

Amazon

Software Development Engineer, AM T…

Freshers/Experienced

Hyderabad, Telangana

View Details

Last Date: Jan. 27, 2026

Amazon

Financial Analyst II AP, FinOps AP …

Freshers/Experienced

Hyderabad, Telangana

View Details

Last Date: Feb. 3, 2026

Amazon

Risk Specialist DE, SPIV-IPI

Freshers/Experienced

Hyderabad, Telangana

View Details

Last Date: Feb. 3, 2026

Amazon

Program Manager II-PCMO, Audits and…

Freshers/Experienced

Hyderabad, Telangana

View Details

Last Date: Feb. 3, 2026

Oracle

Principal Consultant

Professional

Hyderabad, Telangana

View Details

Last Date: May 12, 2026

Accolite

Senior Python Developer

8 - 12 Years Exp.

Hyderabad, Telangana

View Details

Last Date: Jan. 24, 2026

KPMG

Oracle Apps Technical

KI Professional

Hyderabad, Telangana

View Details

Last Date: Jan. 28, 2026

Amazon

Tririga Solution Architect, Close S…

Freshers/Experienced

Hyderabad, Telangana

View Details

Last Date: Jan. 27, 2026

Oracle

Software Developer 3

Professional

Hyderabad, Telangana

View Details

Last Date: Jan. 30, 2026

Amazon

Implementation Manager, JWO

Freshers/Experienced

Hyderabad, Telangana

View Details

Last Date: Feb. 3, 2026

Oracle

Consulting Technical Manager

Professional

Hyderabad, Telangana

View Details

Last Date: July 12, 2026




More Jobs at Oracle

Oracle

Site Reliability Developer 3

Professional

Bengaluru, Karnataka

View Details

Last Date: April 1, 2026

Oracle

Software Developer 3

Professional

Bengaluru, Karnataka

View Details

Last Date: April 21, 2026

Oracle

Principal Consultant

Professional

Bengaluru, Karnataka

View Details

Last Date: April 8, 2026

Oracle

Principal Consultant

Professional

Bengaluru, Karnataka

View Details

Last Date: June 15, 2026

Oracle

OCI Cloud Sr Engineer

Professional

Bengaluru, Karnataka

View Details

Last Date: June 16, 2026

Oracle

Senior AI Application Engineer

Professional

Bengaluru, Karnataka

View Details

Last Date: July 6, 2026

Oracle

Principal Software Developer - Orac…

Professional

Bengaluru, Karnataka

View Details

Last Date: March 16, 2026

Oracle

Business Analyst

Professional

Bengaluru, Karnataka

View Details

Last Date: Feb. 10, 2026

Oracle

Software Developer 3

Professional

Bengaluru, Karnataka

View Details

Last Date: Feb. 9, 2026

Oracle

Software Developer 3(Java Automatio…

Professional

Bengaluru, Karnataka

View Details

Last Date: April 25, 2026

Oracle

Software Development Snr Manager

Professional

Bengaluru, Karnataka

View Details

Last Date: June 2, 2026

Oracle

Senior Cloud Operations Engineer

Professional

Bengaluru, Karnataka

View Details

Last Date: March 3, 2026




Actively Recruiting Companies at Hyderabad, Telangana