Free cookie consent management tool by TermsFeed Senior Software Engineer - Real-Time Workflows & ML Serving | Antal Tech Jobs
Back to Jobs
4 Days ago

Senior Software Engineer - Real-Time Workflows & ML Serving

decor
Space Exploration & Research, Information Technology
Full-Time
Microsoft

Overview

Overview

JOB DESCRIPTION

Modern ads platforms run on always-on, real-time data: streaming events, feature computation, near-real-time aggregations, and low-latency serving to power ML models that operate at massive scale under strict freshness, cost, and reliability requirements.

Microsoft Ads builds and operates large-scale, latency-sensitive systems that serve billions of requests. We are looking for a Sr Software Engineer who is hands-on with production coding and system design to build the real-time data pipelines and feature/embedding materialization systems that feed online stores/caches and integrate tightly with ML inference serving.

This Role Is Ideal For Engineers Who Enjoy

  • building robust streaming + ETL systems (correctness, idempotency, backfills, late data),
  • owning SLOs with strong observability and operational maturity,
  • and optimizing end-to-end performance and cost across compute, storage, and serving integrations.

Primary success metrics are freshness, correctness, latency, reliability, and cost in production.

Responsibilities

Responsibilities

  • Design and implement real-time streaming ETL / feature pipelines (e.g., Flink or Spark Structured Streaming) that meet strict freshness and correctness constraints.
  • Build and operate reliable messaging and ingestion with Kafka/Pulsar (partitioning strategy, retries, ordering guarantees, DLQs, backpressure handling).
  • Own data contracts between producers, pipelines, and consumers: schema evolution, versioning, compatibility, validation, and safe rollout.
  • Implement production-grade backfill/replay workflows
  • Define and meet SLOs using OpenTelemetry/Prometheus/Grafana for metrics, tracing, dashboards, alerting, and incident response readiness.
  • Integrate pipelines with online stores/caches and ML consumers (feature stores, embedding pipelines, LLM API calls, online/offline consistency patterns).
  • Partner with applied scientists on feature/embedding definitions, validation, and end-to-end quality measurement.
  • Optimize end-to-end performance and efficiency: CPU/memory/I/O, serialization, caching, network overhead, concurrency, and pipeline compute cost.
  • Contribute to serving/inference integrations where needed (e.g., Triton/ONNX Runtime/TensorRT) including batching and latency/cost tradeoffs.
  • Ship safely with CI/CD, automated testing (unit/integration/data quality), and operational playbooks/runbooks.

Qualifications

Required Qualifications

  • Bachelor’s or Master’s degree in Computer Science, Electrical/Computer Engineering, or a related field, with 6+ years of related experience.
  • Strong programming skills in language C++,C# or Python (at least one required).
  • Hands-on experience in one or more:
    • Building and operating streaming data pipelines in production (Flink or Spark Structured Streaming),
    • Distributed systems engineering with strong reliability and operational rigor,
    • Messaging systems such as Kafka/Pulsar.
  • Experience operating services with Kubernetes/containers and production readiness practices (deployments, scaling, rollbacks).
  • Experience with observability stacks such as OpenTelemetry, Prometheus, Grafana.
Preferred Qualifications

  • Experience with feature stores, embedding pipelines, and online/offline consistency (freshness guarantees, correctness validation).
  • Experience with data lakehouse/table formats and optimizations eg partitioning, compaction, and incremental processing.
  • Experience with GPU inference serving (Triton, ONNX Runtime/TensorRT) and performance techniques (batching, request shaping, tail-latency reduction).
  • Background in cost/performance modeling, capacity planning, and reliability improvements for high-scale data platforms.
  • Experience in Ads/search/recommendations or other high-scale systems where freshness, latency, and cost are important

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Share job
Similar Jobs
View All
16 Hours ago
Associate Devops Lead - GCP
Information Technology
  • 2400000 - 3500000 INR - Annual
  • 6 - 10 Yrs
  • Greater Noida, Noida
Responsibilities Design and deploy complex, multi-tier applications on GCP, ensuring scalability, reliability, and cost-efficiency. Manage and optimize workloads using GCP services like Compute Engine, Kubernetes Engine, BigQuery, Cloud Funct...
decor
16 Hours ago
Director/ Senior Director - Data Delivery Partner (CPG Domain)
Information Technology
  • 6000000 - 8000000 INR - Annual
  • 16 - 23 Yrs
  • Hyderabad
Role Overview: We are seeking an experienced Account Delivery Head – Director level to lead end-to end delivery for strategic accounts in the Consumer Packaged Goods (CPG) domain, with a strong focus on Data Engineering, Advanced Analytics, and Da...
decor
23 Hours ago
Quality Engineering Architect
Information Technology
  • 9 - 12 Yrs
  • Ahmedabad, Indore, Hyderabad
Your mission, roles and requirements: Design and implement scalable automation frameworks while defining the overall testing tool landscape for the organization. The role focuses on building robust test harnesses, significantly reducing testing cy...
decor
1 Day ago
Senior Maps Data Engineer
AI & Machine Learning Advancement
  • 6 - 10 Yrs
  • Hyderabad
Job Opening: Maps Data Engineer Location: Hyderabad Experience: 6+ years About Antal: Antal International, East Patel Nagar Delhi, is a leading recruitment consultancy having expertise in connecting top talent across IT, Manufact...
decor
1 Day ago
Maps Data Engineer
AI & Machine Learning Advancement
  • 4 - 7 Yrs
  • Hyderabad
Job Opening: Maps Data Engineer Location: Hyderabad Experience: 4+ years About Antal: Antal International, East Patel Nagar Delhi, is a leading recruitment consultancy having expertise in connecting top talent across IT, Manufact...
decor
2 Days ago
ETL Developer/Data Engineer
Information Technology
  • Bangalore, Karnataka, India
DescriptionAbout the Organization :G N Solutions Pvt. Ltd. is a trusted IT company providing state-of- the-art solutions, services and products to our clients spread across diverse domains and geographies. We are one of the privileged IBM Business Pa...
decor
2 Days ago
Vision Group - Senior Software Engineer
Information Technology
  • Bangalore, Karnataka, India
DescriptionRequired Mandatory Skills : Architecture Design Dot Net .Net Core JavaScript SQL Server Azure Cloud MicroservicesJob Responsibilities Responsible for delivering high quality software on time Works closely with Engineering leads and other d...
decor
2 Days ago
Cloud Native Architect - Azure
Information Technology
  • Bangalore, Karnataka, India
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will c...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media