Free cookie consent management tool by TermsFeed Principal Software Engineer - Network Reliability Engineering - AI/ML | Antal Tech Jobs
Back to Jobs
1 Week ago

Principal Software Engineer - Network Reliability Engineering - AI/ML

decor
Chennai, tennessee, India
Information Technology
Full-Time
Oracle

Overview

Job Description

Oracle Cloud Infrastructure (OCI) provides mission-critical cloud services to enterprises worldwide. The Network Reliability Engineering(NRE) Automation, Reporting, and Tooling team builds innovative solutions that boost the productivity and efficiency of the Global Network Operations Center (GNOC). Our tooling empowers the GNOC and Network Reliability Engineering (NRE) teams with observability, automation, and actionable insights at hyperscale.

As a Principal Software Developer, you will design, build, and deliver scalable automation frameworks and advanced platforms leveraging AI/ML to drive operational excellence across OCI’s global network. This includes building network event driven data (such as failures), hybrid classification, and both training and inference. You are passionate about developing software that solves real-world operational challenges, thrive in a fast-paced team, and are comfortable working with complex distributed systems. You value simplicity, scalability, and collaboration.

Responsibilities:

  • Architect, build, and support distributed systems for process control and execution based on Product Requirement Documents (PRDs).
  • Develop and sustain DevOps tooling, new product process integrations and automated testing.
  • Develop ML in Python 3; build backend services in Go (Golang); create command-line interface (CLI) tools in Rust or Python 3; and integrate with other services as needed using Go, Python 3, or C.
  • Build and maintain schemas/models to ensure every platform and service write is captured for monitoring, debugging and compliance
  • Build and maintain dashboards that monitor the quality and effectiveness of service execution for "process as code" your team delivers.
  • Build automated systems that route code failures to the appropriate oncall engineers and service owners.
  • Ensure high availability, reliability, and performance of developed solutions in production environments.
  • Support serverless workflow development for workflows which call and utlize the above mentioned services support our GNOC, GNRE, and onsite operations and hardware support teams.
  • Participate in code reviews, mentor peers, and help build a culture of engineering excellence.
  • Operate in an Extreme Programming (XP) asynchronous environment (chat/tasks) without daily standups, and keep work visible by continuously updating task and ticket states in Jira.


Required Qualifications:

  • 8 - 10 years of experience in process as code, software engineering, automation development, or similar roles
  • Bachelors in computer science and Engineering or related engineering fields
  • Strong coding skills in Go and Python3
  • Experience with distributed systems, micro-services, and cloud-native technologies
  • Proficiency in Linux environments and scripting languages
  • Proficiency with database creation, maintenance and code using SQL and Go or Py3 libraries
  • Understanding of network operations or large-scale IT infrastructure
  • Excellent problem-solving, organizational, and communication skills
  • Experience using AI coding assistants or AI-powered tools to help accelerate software development, including code generation, code review, or debugging.


Preferred Qualifications:

  • Process engineering experience (control systems, proportional integral derivative's (pid), statistical process control (SPC))
  • Proficiency with data modeling, data analysis, and reporting frameworks (e.g., SQL, Spark, Prometheus, Grafana, etc.)
  • Experience with C, Cpp, Java, or Rust
  • Experience developing automation and tools for network or scale cloud operations
  • Background in creating dashboards, alerts, and real-time reporting platforms
  • Familiarity with workflow automation (e.g., Apache Airflow), CI/CD pipelines, or infrastructure as code
  • Previous experience supporting or building tools for (any) hyperscale or scale could network, compute, or storage operations.
  • Knowledge of REST APIs, remote procedure calls (RPCs), and service oriented architectures (SOA)
  • Familiarity with eXtreme programming (xp), agile, and devops process
  • Experience with ticketing and version control systems (e.g., Jira, Git)


Qualifications

Career Level - IC4

About Us

As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges. We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.

We know that true innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing an inclusive workforce that promotes opportunities for all.

Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Share job
Similar Jobs
View All
16 Hours ago
Principal Architect - DotNet
Healthcare & Life Sciences
  • 15 - 20 Yrs
  • Chennai, Hyderabad
Summary role description: Hiring Principal Architect – .NET Full Stack in the Healthcare Technology provider. Company description: Our client is a global technology and services provider with operations across the U.S. a...
decor
16 Hours ago
Principal Architect - JAVA
Healthcare & Life Sciences
  • 14 - 20 Yrs
Hiring for the Principal Architect - Java Full Stack for a healthcare technology leader advancing U.S. healthcare through AI and cloud innovation. Company description: Our client is a leading healthcare technology and clinical services ...
decor
16 Hours ago
Full Stack Developer
Information Technology
  • 5 - 8 Yrs
  • Thane
About the Role We are building advanced AI-powered enterprise products and are looking for a Node.js + UI Developer (React) to join our engineering team. This role involves end-to-end development of high-performance web applications, from backend ...
decor
1 Day ago
Sr Technical Consultant
Information Technology
  • 7 - 23 INR - Annual
  • 5 - 8 Yrs
  • Pune
Position: Sr. Technical Consultant (Dotnet 6.0+) Experience: 5+ Years Job Title: ASP.NET Core 6.0 / Full stack Developer for Pune Location We are looking for a seasoned ASP.NET Core 6.0 / MVC Developer to join our innovative team. This ro...
decor
1 Day ago
Mobile Engineer (React Native)
Information Technology
  • 1200000 - 1800000 INR - Annual
  • 3 - 6 Yrs
  • Chennai
Job Description About the Role We are looking for a React Native Engineer to join our team in building robust, scalable, and high-performance mobile applications. You will work closely with engineers, designers, and product managers to deliver se...
decor
1 Day ago
Senior AI/ML Engineer
Information Technology
  • 2000000 - 2500000 INR - Annual
  • 4 - 8 Yrs
  • Chennai, Hyderabad
Role : Senior AI/ML Engineer Experience : 4 - 8 years Location: Chennai/Hyderabad Work Mode: WFO  Roles & Responsibilities: Design, implement, and deploy Machine Learning solutions to solve complex problems and deliver real busine...
decor
1 Day ago
Junior Automation Tester - Selenium/Cypress
Information Technology
  • Chennai, Tamil Nadu, India
DescriptionWe are seeking a motivated and enthusiastic Junior Automation Tester to join our Quality Assurance (QA) team.This role is ideal for recent graduates or those early in their career who have a foundational understanding of testing principle...
decor
1 Day ago
Senior AI/Cloud Engineer
Information Technology
  • Chennai, Tamil Nadu, India
Job DescriptionTechnical Expertise Experience: 5+ years of hands-on experience in cloud infrastructure engineering. IaC: Expert-level experience in writing and managing Terraform scripts/modules. Automation: Proficient in scripting with Python and B...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media